Software Development Manager, AWS Incident Tooling & Response
AWS Resilience owns services that prevent and respond to availability and security issues for all AWS Services. In other words, we're the people who keep the cloud running. We work on the most challenging problems, with constant new services and possible failure modes to prevent - and we're looking for talented people who want to help.
You'll join a diverse team of software, security experts, operations managers, and other vital roles. You'll collaborate with people across AWS to help us deliver the highest standards for safety, security, and availability. You'll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
AWS Incident Tooling is at the heart of the high availability of Amazon Web Services. We make customer-impacting events shorter and less frequent by detecting early large-scale events and providing the tooling to enable fast mitigation. Our automated tooling quickly identifies the cause of an issue and helps mitigate its impact. Our engineer time is spent on projects to improve the tooling and automation. We also provide our solutions for other AWS groups to manage their own events. It's an exciting time to join our team as we are growing and expanding our offerings.
Key Job Responsibilities
1. Define and Deliver Business Priorities
o You will be a key contributor and owner of the direction of the AWS Incident Management team. You will define, plan, track, and deliver on strategic goals for the team while ensuring that the team remains unblocked and focused.
2. Cross-Site, Cross-Team Coordination
o You will be responsible for coordinating with your counterparts and sister teams to ensure that a clear communication channel exists between AWS Incident tooling and Response teams. You will also work closely with the alarming systems to create and maintain a proper end-to-end experience from detecting, alarming to mitigating incidents.
3. Performance Management/Team Health
o You will own all facets of performance and career management for the team. You will ensure the operational load of your team remains manageable and as minimal as possible.
Requirements
BASIC QUALIFICATIONS
* Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
* Experience in engineering team management
* Experience in engineering
* Experience in leading the definition and development of multi-tier web services
* Experience partnering with product and program management teams
PREFERRED QUALIFICATIONS
* Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy
* Experience in recruiting, hiring, mentoring/coaching, and managing teams of Software Engineers to improve their skills and make them more effective product software engineers
* Experience managing a team of high-caliber Software Engineers developing complex, world-class, scalable software systems that have been successfully delivered to customers
Estimated Salary: $200,000 - $300,000 per year