Are you an experienced Site Reliability Engineer (SRE) or DevOps engineer looking to shape the future of cloud platform services? Here’s your chance to join a new team at an early stage and make a significant impact on systems and services offered to customers and partners in EMEA and beyond.
The Role:
As a Site Reliability Engineer, you’ll be responsible for critical areas within the Cloud Platform, including operations, availability, performance, monitoring, change management, emergency response, security, and capacity planning.
Key responsibilities include:
* Designing, implementing, and maintaining robust, scalable, high-quality software and systems within the SRE domain.
* Managing and supporting in-house development systems, CI/CD pipelines, tools, monitoring, and alerting, with a focus on automation to streamline activities and reduce toil.
* Collaborating closely with developers and architects to ensure designed solutions meet non-functional requirements such as availability, performance, security, and maintainability.
* Leading incident management, post-mortem reviews, and continuous improvement initiatives, contributing to the evolution of processes and systems within the organization.
* Defining key metrics and technical decisions driving products and their delivery, including SLOs and SLAs, architecture, best practices, and cost optimization.
* Taking responsibility for complex project tasks, striving for higher standards of individual and team performance.
* Building relationships external to the team and driving knowledge sharing across all products.
* Identifying personal development opportunities, setting goals, and delivering impactful contributions, while also mentoring and training other engineers throughout the company.
The Person:
To excel in this role, you should possess:
* 3-5 years of experience using AWS platform and services.
* Proficiency with common tools and technologies used within CI/CD and Build pipelines, including Terraform, Jenkins, Gitlab, Nexus, Ansible, Maven, Docker, and Helm.
* A BSc in an IT-related field or equivalent professional experience in cloud operations and/or cloud platforms in a DevOps engineering or SRE role.
* Familiarity with commonly used operating systems including Ubuntu and Windows, along with scripting experience in Bash, Python, or PowerShell.
* Experience troubleshooting issues in a cloud environment and working with multiple teams to facilitate orderly project and release plans.
* Familiarity with VMWare vSphere (ESXi, vCenter) desirable.
We’re looking for candidates with:
* Extensive experience across a wide range of AWS services, including compute, containerization, storage, database, networking, automation, IAM, security, monitoring, logging, backup, and configuration management.
* Previous experience in an SRE team and understanding of SRE principles.
* Experience in backup and restore processes, cloud-based multi-tenancy, emergency response, and on-call duties.
* Understanding of current best practices around Security Management, patching, branching strategy, release management, and Linux administration.
* Experience working in an agile development team using SCRUM methodologies.
If you are interested in learning more about this role and happy to be represented by Solas IT please email me with your CV ryan.wannenburg@solasit.ie. Alternatively please call me on 00 353 12449531
#J-18808-Ljbffr