The Senior Infrastructure Reliability Engineer is a key member of the Amazon Web Services (AWS) Infrastructure Reliability & Quality engineering team. Our team owns the design, planning, delivery, and operation of all AWS global infrastructure, ensuring our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain.
As a Senior Infrastructure Reliability Engineer, you will join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You will collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers.
Our team provides engineering support for data center infrastructure equipment, including air handling units, switchgear, breakers, panel boards, uninterruptible power supplies, transformers, generators, automatic transfer switches, etc., as well as infrastructure security equipment, such as cameras and access control systems. A critical piece of this support is ensuring the highest level of initial quality and ongoing support from our suppliers.
Responsibilities:
* Proactively identify, assess, and mitigate reliability risks for critical data center infrastructure equipment
* Conduct comprehensive root cause analysis of any critical equipment failures in the field
* Collaborate cross-functionally with internal and external partners to influence product specification, design, and reliability qualification
* Develop and maintain data center infrastructure reliability models and quantify reliability risks
* Monitor field performance and drive ongoing reliability improvements
* Serve as a subject matter expert and provide technical leadership on reliability engineering best practices
Requirements:
* Bachelor's or Master's degree in Reliability Engineering, Physics, Electrical, Mechanical, or Materials Engineering or related field
* 8+ years of Reliability Engineering work experience in high reliability industry
* 5+ years of experience with failure analysis activities and root cause analysis
* 5+ years of experience with accelerated life testing, stress analysis, and finite element analysis