SRE - Monitoring Automation Engineer - Experienced Hire
Susquehanna International Group Dublin, Ireland
Overview
We are currently recruiting for a Site Reliability Engineer to work with our cross-functional Monitoring & Automation Engineering team. You will have the opportunity to design highly available systems, build, manage, monitor and performance tune our systems in support of Susquehanna's trading environment. We work in a complex, distributed environment where we continually strive to improve system capacity, increase speed, ensure stability, and mitigate risk in our environment.
Core Responsibilities
1. Administration and operation of system/application monitoring tools in a HA configuration
2. Contributing developer on existing in-house frameworks and standards
3. Create solutions with a production end state in mind
4. Review code and provide peer feedback relative to best practices
5. Contribute to developing architecture of appropriate technical solutions & platforms
6. Create and maintain architectural diagrams and other relevant documentation
7. Design, develop, and implement software integrations based on user feedback
8. Troubleshooting and Support of Non-Performant and Failing Tasks/Jobs/Playbooks
9. Application and systems analysis of production infrastructure
10. Influence and negotiate with stakeholder teams to implement optimal solutions
11. Implement automation tools and frameworks (CI/CD pipelines)
What we're looking for
1. Minimum 3 years' experience working with scripting (Python, PowerShell, BASH) (background in software development would be beneficial, but not essential)
2. Experience at admin level with:
o Configuration/Automation management tools (Ansible, Chef, Tidal or similar)
o Log/data management platforms (ELK stack, Splunk or similar)
o Elasticsearch
o CI/CD tools (GitLab, Jenkins, Bamboo or similar)
o Infrastructure/application monitoring tools (Check_mk, Prometheus, Grafana or similar)
3. Understanding of network protocols (communications, management, security).
Beneficial Experience
1. Administration of Linux-based operating systems.
2. Familiarity/experience with application containerization technologies.
3. Experience with HA concepts and technologies.
4. Understanding of database concepts.
#J-18808-Ljbffr