Site Reliability Engineering Leader - Security - Apple Services Engineering
Dublin, County Dublin, Ireland
Software and Services
People at Apple don’t just build products — they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. The Apple Services Engineering (ASE) team builds and provides systems and infrastructure that fuel Apple’s services (such as iCloud, iTunes, Siri, and Maps). We are the foundation on which Apple’s software developers build the products that our customers love. We are looking for a passionate and talented Site Reliability Engineering Leader to continue our focus on providing our customers the highest quality Apple Services experience. Our services have to scale globally, stay highly available, and 'just work.' If you love designing, engineering, and running systems and infrastructure that will help millions of customers, then this is the place for you!
The services that this team manages are foundational security services. From host access and disk encryption to identity and authentication, these services are critical for operational security, as well as securing Apple's fleet and protecting our most critical data. You would lead a new SRE team dedicated to these security services, while partnering closely with the Security Development team to bring up and mature new services as part of our infrastructure investments.
Description
The SRE organization requires a strong SRE leader for its rapidly growing Security SRE team. In this role, you will oversee critical security infrastructure services and focus on improving the reliability and manageability of these services. You’ll establish a European SRE team to support these services and work in partnership as part of a global SRE team. You’ll be a senior engineering leader within the Infrastructure organization. You will lead the SRE teams responsible for the reliability and performance of critical security infrastructure services, and improve their reliability, observability, and manageability. These services are an important set of services responsible for all of Apple infrastructure; therefore, evolved approaches to changes, reliability, and resiliency are required for this role. The SRE teams are responsible for the reliability and performance of on-prem and cloud-based services. You’ll collaborate with multi-functional teams to design, implement, and maintain security measures, incident response protocols, and automation tools to strengthen our organization's overall security posture.
Minimum Qualifications
* 8 years plus engineering management experience.
* 5 years experience managing SRE teams, managing mission-critical production services, with progressively larger charters.
* Demonstrated success leading SRE teams and managing infrastructure development engineers.
* Understanding of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts.
* Proficient in at least one of Python, Golang, Java, or Rust. Experience working in a standard SDLC.
* Understanding of key Infrastructure Security concepts and principles.
Preferred Qualifications
* Proven experience with large scale, highly available, distributed, and fault-tolerant systems.
* Excellent understanding of operating systems concepts including multi-threading, memory management, networking and storage, performance and scale.
* Experience with Kubernetes, Docker, and containerization (CNCF Kubernetes Administrator or equivalent).
* Deep knowledge of Linux security primitives, systems, packaging, container security, and SELinux.
* Understanding of MacOS security primitives.
* BS/MS in Computer Science or Equivalent (5+ years of software development or production operations experience in a large-scale environment).
* Prior experience in security-related fields (or equivalent experience). Certs like OSCP, OSCE, OSEE, etc. helpful but not vital.
#J-18808-Ljbffr