Senior Site Reliability Engineer
Join our company in Cork and work on innovative automation technology.
Key Responsibilities:
* Manage incidents and conduct post-incident reviews
* Design and implement software to assist in operations and support
* Work closely with developers to ensure designed solutions meet non-functional requirements such as availability, performance, security, and maintainability
* Improve internal and external processes and systems by moving from ad-hoc to infrastructure and configuration as code throughout the organization
* Implement monitoring and alerting throughout in-house and cloud-based systems
* Improve system reliability throughout the organization and work with product teams to enhance their products
* Support production and in-house systems
* Define Service Level Objectives (SLOs) and Service Level Agreements (SLAs) for products
* Define and manage release budgets
* Oversee in-house development and Continuous Integration/Continuous Deployment (CICD) systems
* Ensure design, implementation, and maintainability of robust, scalable, high-quality software and systems within the SRE domain
* Take responsibility for complex project tasks and contribute to technical decisions to ensure successful delivery
* Contribute to site architecture for all products
* Be an active member of the best practice SRE function striving for higher standards of individual and team performance
* Build relationships outside the team
* Drive and achieve knowledge sharing across all products
* Engage in continuous education and development of technical skills and demonstrate application to the domain
* Identify personal development opportunities, set goals, and deliver on them
* Mentor and train other engineers throughout the company and drive company-wide improvement
Essential Requirements:
* Bachelor's degree in a related field such as Computer Science, Computer Engineering, Electrical Engineering, or equivalent, and 5-7 years of professional development or operations experience
* 2-3 years of experience in DevOps engineering or SRE role
* 2-3 years of proven experience in cloud computing
* Experience with VMWare ESXi
* Experience with AWS
* Knowledge of Configuration Management Technologies, such as Ansible, Puppet, Chef, or Salt
* Experience with Infrastructure as Code Technologies, such as Terraform and CloudFormation
* Experience working with multiple operating systems, including Ubuntu, MacOS, and Windows
* Scripting experience in languages such as Bash or Python
* Experience with CICD and Build pipelines in Jenkins and Gitlab
* Experience working with multiple teams to facilitate orderly project and release plans
* Experience in issue analysis in a cloud environment