Software Development Engineer, Central Reliability & Response EngineeringJob ID: | Amazon Development Centre Ireland Limited
Are you ready to be the guardian of Amazon's global digital infrastructure?
CRRE, within Amazon's Engine organization, is where engineering meets impact at massive scale.
We're the team that ensures millions of customers can shop, stream, and connect without missing a beat - 24/7, across the globe.
About the Role:
Imagine building systems that protect Amazon's services across 20+ global marketplaces - that's the scale we operate at.
You'll join one of our elite teams that sits at the intersection of innovation and reliability, where your code will serve as the backbone of Amazon's operational excellence.
Whether you're preventing service disruptions before they happen or enabling lightning-fast incident response, your solutions will directly safeguard the shopping experience for hundreds of millions of customers worldwide.
We're looking for exceptional Software Development Engineers to join our dynamic team in Dublin, Ireland - a tech hub that's home to some of Amazon's most critical reliability and resilience engineering initiatives.
If you get excited about building real-time systems that process petabytes of data daily and want to work where your code impacts millions of customers globally, we want to talk to you.
Based in our state-of-the-art Dublin office, you'll collaborate with builders across Amazon to ensure seamless customer experiences across our vast network of backend and frontend services.
This isn't just any software engineering role - it's an opportunity to solve complex problems at a scale few engineers ever experience, where every line of code you write has the potential to impact global commerce in real-time.
Key job responsibilitiesDesign and implement large-scale systems processing petabytes of data dailyBuild and maintain high-quality, thoroughly tested software solutionsCreate tools and mechanisms that help service teams identify and prevent availability risksDevelop real-time monitoring and analysis capabilitiesCollaborate with teams across Amazon to improve service resilienceParticipate in on-call rotations to support business-critical systemsA day in the lifeYou'll work in an agile environment, designing and implementing solutions that operate at Amazon scale.
This could involve:
Building real-time data processing systems that analyze service healthDeveloping mechanisms to surface and prevent reliability risksCreating actionable insights that help teams deploy changes safelyCollaborating with service teams to implement resilience best practicesContributing to systems that process and analyze logs from thousands of servicesAbout the teamYou could join one of two specialized teams within Central Reliability & Response Engineering (CRRE):
Operational Intelligence (OI)
Team:
Owns Real-Time Log Analysis (RTLA), a critical platform used by thousands of internal customersHelps teams monitor and categorize service errors in real-timeEnables root cause analysis within minutes of issues occurringProcesses and analyzes massive amounts of log data dailyResilience Insights and Safety Engineering (RISE)
Team:
Creates tools to help services maintain availability under any conditionsDevelops frameworks for assessing and improving service resilienceBuilds systems to ensure safe deployment of code and configuration changesProvides actionable insights for improving service reliability
Minimum Qualifications:
Bachelor's degree or equivalentExperience programming with at least one modern language such as Java, C++, or C# including object-oriented designExperience contributing to the architecture and design (architecture, design patterns, reliability and scaling) of new and current systemsPREFERRED QUALIFICATIONSExperience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operationsExperience building complex software systems that have been successfully delivered to customersExperience using or building tools in the Observability space, such as log analysis, tracing, or monitoringAmazon is an equal opportunities employer.
We believe passionately that employing a diverse workforce is central to our success.
We make recruiting decisions based on your experience and skills.
We value your passion to discover, invent, simplify and build.
Amazon is committed to a diverse and inclusive workplace.
Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information.
#J-18808-Ljbffr