Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products very quickly. Bring passion and dedication to your job, and there's no telling what we can accomplish together. We're looking for a hardworking and passionate person to join this amazing team, and if you feel this is you, we'd love to hear from you!
The Apple Services Engineering (ASE) organization is responsible for building powerful platforms that enable engineers to deliver incredible experiences to customers. Our compute team is responsible for designing and building the foundational pieces of our in-house cloud technologies. In this role, you will collaborate with teams across Apple to deliver forward-looking high-performance distributed storage solutions for various cloud platforms. The successful candidate is a highly motivated individual with strong technical and communication skills to create intuitive user experiences, who is passionate about quality, and has a keen eye for the details that surprise and delight our customers.
Join this team and you will design, implement, and run services and tools for infrastructure management such as SaltStack, Postgres, Zookeeper, DHCP, DNS, etc., alongside internally developed services. As a member of the EMEIA team, you will also engage with platform customers and support platform issues in the local timezone.
Description
In this role, you will be responsible for developing, debugging, and maintaining software to operate a large compute fleet. You will:
1. Closely collaborate with teams within and across organizations to support their workflows or integrate their technology into our platform.
2. Automate operations processes via services and tools.
3. Develop within configuration management and fleet orchestration via SaltStack, Ansible, Puppet, or others.
4. Design, implement, and maintain robust, scalable, and highly available services that support infrastructure management.
5. Monitor on-server system performance, identify bottlenecks, and implement solutions to enhance efficiency.
6. Conduct root cause analysis for on-server system failures and implement preventive measures.
7. Write and review code, generate and review design documentation.
8. Participate in qualifications and rollouts of software to production clusters.
9. Participate in a business-hours rotation where engineers respond to platform issues for same-day resolution.
Minimum Qualifications
* Familiarity with the mechanics behind infrastructure management.
* Fluency in Go (Golang), Python, or similar languages in a systems context.
* Customer-focused thinking and strong problem-solving with attention to detail.
* Strong systems programming skills and knowledge of operating system fundamentals.
* BS with 2+ years of experience in system software development or MS in Computer Science.
Preferred Qualifications
* Experience with large scale server provisioning and maintenance.
* Familiarity with node management systems like SaltStack, Ansible, Puppet.
* Strong proficiency in Linux/Unix internals, administration, and troubleshooting.
* Operational knowledge of Kubernetes clusters.
* Strong knowledge of networking protocols and security best practices.
* Experience with measuring, analyzing, and optimizing performance.
* Ability to effectively communicate within a team and with project stakeholders.
#J-18808-Ljbffr