Job Summary
Software Development Manager, AWS Incident Tooling & Response
AWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. We're the people who keep the cloud running.
You'll join a diverse team of software, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to deliver high standards for safety, security, and availability.
AWS Incident Tooling is at the heart of Amazon Web Services' high availability. We make customer-impacting events shorter and less frequent by detecting early large-scale events and providing tooling to enable fast mitigation.
We provide automated tooling that quickly identifies issue causes and helps mitigate their impact. Our engineer time is spent on projects to improve the tooling and automation.
We also offer solutions for other AWS groups to manage their own events.
It's an exciting time to join our team as we're growing and expanding our offerings.
Job Description
As a Software Development Manager, you'll manage automated tooling roadmaps and delivery for the detection and resolution of issues within AWS infrastructure.
You'll work closely with teams managing incident response and leadership to gather new requirements.
You'll drive improvements into our automation, tooling, and processes based on learning from past incidents.
You'll coordinate across project teams to expand the use of our tooling to additional areas across Amazon.
This team offers great growth potential and the opportunity to make a huge impact.
Main Responsibilities:
* Define and Deliver Business Priorities: You'll be a key contributor and owner of the direction of the AWS Incident Management team.
* Cross-Site, Cross-Team Coordination: You'll coordinate with counterparts and sister teams to ensure a clear communication channel exists between AWS Incident tooling and Response teams.
* Performance Management/Team Health: You'll own performance and career management for the team, ensuring the operational load remains manageable.
Qualifications
BASIC QUALIFICATIONS
* Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle.
* Experience in engineering team management.
* Experience in leading the definition and development of multi-tier web services.
PREFERRED QUALIFICATIONS
* Experience communicating with users, technical teams, and senior leadership.
* Experience recruiting, hiring, mentoring/coaching, and managing teams of Software Engineers.
* Experience managing high-caliber Software Engineers developing complex software systems.