Description
Your tasks:
- Contribute to a team responsible for the availability, scalability, and performance of a monitoring application;
- Build, maintain and automate monitoring systems to help adoption of solutions;
- Automate builds and building releases;
- Maintain and develop custom systems and tools to improve ability to test, integrate, automate and effectively monitor custom applications in a large-scale Linux environment;
- Assist in the rollout and deployment of new product features and installations;
- Create extensive documentation for current and future configuration processes and policies;
- Work with Service Management team on standard operating procedures including Incident, Problem and Change Management to ensure the monitoring of the systems 24/7.
- Minimum 13 years of relevant university studies and experience managing Unix/Linux infrastructure and related tools (DNS);
- Proven experience with IT Monitoring Applications like Nagios or similar;
- Prior experience in an Internet-facing technical operations role with high up-time requirements;
- Strong experience in web-based architectures and development, with tools such as Javascript, Node.js, Coffeescript, Ruby, Ruby on rails, HTML, Sinatra and MariaDB;
- Proven experience in Bash, Ruby, Perl and Python;
- Extensive experience in building CI/CD pipelines and in depth knowledge of software development life-cycle;
- Strong automation tooling experience in Ansible, Chef, Docker or similar;
- Good understanding of VPN and overall security requirements for Web based solutions implemented with a Linux infrastructure;
- Demonstrated ability to work with Web servers such as Apache and Puma;
- Prior experience in developing IOT solutions with raspberry PIs;
- Ability to take ownership of technical issues and be a productive member in the on-call rotation and certain off-hours shifts;
- Strong troubleshooting skills that span systems, network, and applications;
- Excellent command of the English language both oral and written;