We are collaborating with a client who is dedicated to transforming unused energy resources into valuable computational power. Our client aims to align the future of global computing with sustainable energy practices.
Location: Dublin
Contract: Full-Time, Permanent
Working Model: Hybrid
Reference number: BBBH 25914
Join our clients’ SRE team and play a crucial role in maintaining and enhancing the reliability and performance of their Data Centre infrastructure. As a Site Reliability Engineer, you'll be essential in upholding high Service Level Agreements by driving Service Level Indicators and Service Level Objectives to maintain optimal service quality. The clients’ SREs proactively tackle issues, automate error handling, and advise engineering teams on resilient code development, ensuring that our customers experience seamless access to critical services.
Key Responsibilities
1. Daily Operations: Start your day by reviewing overnight alerts and system performance metrics. Collaborate in team stand-ups to assess project status, recent incidents, and daily priorities.
2. Automation & Monitoring: Automate routine tasks, develop monitoring tools, and analyse system logs to strengthen our infrastructure's resilience.
3. Collaboration & Best Practices: Work closely with software engineering teams to guide best practices for code resilience and review updates before deployment.
4. Incident Management: Regularly participate in incident response drills, post-mortems, and root cause analysis sessions to drive continuous improvement and anticipate potential issues.
5. Documentation & Planning: Conclude each day by documenting insights, sharing findings with the team, and planning for upcoming challenges, always prioritizing customer satisfaction.
What You’ll need to be considered:
1. 5+ years in SRE roles with hands-on architecture, design, and scaling expertise.
2. Proficient in programming (Python, Go, or similar)
3. Experience with modern infrastructure tools (Docker, Kubernetes, Ansible, Cloud Formation, Terraform)
4. Skilled with CI/CD practices and tools (GitLab CI/CD, CircleCI, GitHub Actions)
5. Familiar with logging, monitoring, and alerting systems
6. Strong Unix/Linux experience and understanding of TCP/IP and network programming
If you’re passionate about delivering exceptional solutions and working in the Cloud Space as an SRE, we’d love to hear from you. Apply now to join a team dedicated to advancing sustainable computing solutions!
If you’re ready to take the next step in your career, we’d love to hear from you.
If this opportunity is of interest please apply with your latest CV alternatively email adrian@allenrec.com
#J-18808-Ljbffr