Platform Engineering plays a crucial role in the Availability/Infrastructure Engineering organization, empowering development teams to easily assess product availability, promptly detect system disruptions and provide actionable insights into customer impact. This team owns platforms that support all key pillars of observability, reliability and availability. We create abstraction layers to simplify service ownership complexities and collaborate with development teams on cross-cutting initiatives to deliver dependable, high-performing systems and experiences for our customers.
We are hiring passionate Software Engineers (Senior and Lead levels) for the Platform Engineering team. The role involves designing, developing, and managing highly scalable, distributed platform services that empower engineering teams and ensure system reliability, scalability, and high-availability. The ideal candidates will have strong software development expertise, experience with Kubernetes and cloud technologies, and the ability to solve complex engineering problems.
Responsibilities
1. Design, develop, and optimize platform solutions using Python and/or Go to address key challenges in scalability, reliability, and automation.
2. Architect, build, and manage distributed systems and cloud-native applications.
3. Deploy, monitor, and manage containerized workloads using Kubernetes.
4. Automate infrastructure provisioning, configuration, and CI/CD pipelines using tools like Terraform.
5. Collaborate with cross-functional teams to deliver robust solutions aligned with business needs.
6. Drive software development best practices, including code quality, testing strategies, and design patterns.
7. Troubleshoot complex production issues, perform root cause analysis, and optimize system performance.
8. Mentor team members and lead high value initiatives.
9. Participate actively in Agile processes, including sprint planning, reviews, and retrospectives.
Required Skills/Experiences
1. 6+ years (Senior) / 8+ years (Lead) of hands-on software development experience with strong programming skills in Python and/or Go.
2. Proven experience in building and maintaining scalable, distributed systems and microservices.
3. Mandatory hands-on experience with Kubernetes for container orchestration and management.
4. Expertise with infrastructure-as-code tools like Terraform for cloud automation.
5. Experience with cloud platforms like AWS (GCP/Azure is a plus).
6. Solid understanding of CI/CD pipelines, automated deployments, and version control systems like Git.
7. Strong grasp of software design principles, coding standards and testing frameworks (unit and integration testing).
8. Ability to debug, troubleshoot, and optimize software systems for performance and reliability.
9. Nice to have: Experience with AI/ML technologies and their integration into platform services.
Additional Skills:
1. Ability to collaborate effectively with global teams and work in an Agile environment.
2. Strong communication and leadership skills (especially for Lead-level candidates).
#J-18808-Ljbffr