Excellent opportunity for a Senior Site Reliability Engineer to work on site in Cork with an innovative automation technology company.
Key Responsibilities:
* Incident management and conducting post-incident reviews
* Design and implementation of software to assist in operations and support
* Work closely with developers to ensure designed solutions meet non-functional requirements such as availability, performance, security, and maintainability
* Improving the internal and external processes and systems, moving from ad-hoc to infrastructure and configuration as code throughout the organization
* Implementing monitoring and alerting throughout in-house and cloud-based systems
* Improving the reliability of systems throughout the organization and working with product teams to help improve their products
* Support on production and in-house systems
* Define SLOs and SLAs for Products
* Define and manage release budgets
* Managing the in-house development and CICD systems
* Responsible for design, implementation, and maintainability for robust, scalable, high-quality software and systems within the SRE domain
* Take responsibility for complex project tasks and contribute to technical decisions to ensure successful delivery
* Contribute to the site architecture for all products
* Active member of the best practice SRE function striving for and achieving higher standards of individual and team performance
* Build relationships external to the team
* Drive and achieve knowledge sharing across all products
* Continuous education and development of technical skill set, and proven demonstration applied to domain
* Identify personal development opportunities, set goals, and proven ability to deliver on them
* Mentor and train other engineers throughout the company and drive company-wide improvement
Minimum Requirements:
* BSc in a related field such as Computer Science, Computer Engineering, Electrical Engineering, or equivalent and 5-7 years of professional development or operations experience
* 2-3 years in a DevOps engineering or SRE role
* 2-3 years of proven experience in cloud computing
* Experience in VMWare ESXi
* Experience in AWS
* Configuration Management Technologies, such as Ansible, Puppet, Chef, or Salt
* Infrastructure as Code Technologies, such as Terraform and CloudFormation
* Experience with, and high-level understanding of, multiple operating systems including Ubuntu, MacOS, and Windows
* Scripting in languages such as Bash or Python
* CICD and Build pipelines both in Jenkins and Gitlab
* Experience working with multiple teams to facilitate orderly project and release plans
* Experience in issue analysis in a cloud environment
#J-18808-Ljbffr