Job Description
We are seeking a highly motivated Systems Engineer to join our team at Apple. As a member of the Fleet Operations Engineering group, you will play a critical role in supporting Apple's Global Compute Infrastructure and ensuring high uptime for services such as Apple Pay, iCloud, Music, TV, Maps, News, FaceTime, Siri, Claris, and more.
The ideal candidate will have the skills and experience to support Apple's cloud infrastructure, driving key goals in automation, process improvement, technical development, monitoring, and operational efficiency.
Responsibilities:
* Support customers of various platform services, collaborating with cross-functional teams to investigate and resolve customer issues.
* Troubleshoot and resolve container orchestration/management issues, specifically Docker, Kubernetes.
* Develop customer-facing product documentation and internally facing documentation/procedural run books.
* Work with engineering teams to deploy/migrate/upgrade applications.
* Monitor server and application performance, resource utilisation, and security/compliance.
* Configure, manage, and troubleshoot configuration management tools, such as Puppet, Chef, etc.
* Recognise and respond to failure trends due to hardware, software, or configurations to assist or isolate a potentially larger problem.
* Configure, manage, and troubleshoot load balancing technologies on both appliances or built-as-a-service on various hardware platforms.
* Configure, manage, and troubleshoot virtual technologies, such as VMWare, KVM, etc.
Minimum Qualifications:
* Bachelors or Masters in Computer Science, Computer Engineering, or equivalent experience.
* Customer-focused thinking and strong problem-solving skills with attention to detail.
* Understanding of base internet infrastructure services, including DNS, DHCP, LDAP, server virtualisation, and server monitoring in critical, large-scale distributed systems experience, combining hardware, operating systems, and software.
Preferred Qualifications:
* Fluency in Golang, Python, or similar languages.
* Experience with measuring, analysing, and optimising performance in host and guest environments.
* Prior experience working with diverse hardware, operating systems, container runtimes (lxc, docker, containerd), and virtualisation stacks (Qemu, KVM, Libvirt on x86, ARM).
* Prior working experience and deep understanding of node management agents (e.g., kubelet) and node orchestration solutions to manage large-scale infrastructures.
{lang_code}