Principal Kubernetes Site Reliability Engineer
We are seeking an experienced Principal Kubernetes Site Reliability Engineer to join our Digital Assets Technical Operations Team. You will work closely with various engineering teams to design and implement a new multi-region, highly available, cloud-based deployment of our applications on AWS's Kubernetes Platform (EKS).
Responsibilities:
* Owning the design of a new multi-region, highly available, cloud-based deployment of our applications on EKS.
* Working with various engineering teams to implement and maintain the deployment.
* Collaborating with other organizations to coordinate the overall design of highly available, secure, scalable microservices-based applications in AWS.
Requirements:
* Several years of hands-on experience with AWS in a production environment.
* Production experience running Kubernetes workloads on AWS using EKS.
* Experience creating and deploying Helm charts & libraries.
* Specialist in AWS CloudFormation, IAM, VPC, and network security.
* Experience with monitoring tools e.g. Cloudwatch, Datadog, Splunk.
* Proficiency with Unix operating systems and shell scripting.
* Programming experience, e.g. Python, preferred.
* Experience with CDN Providers e.g. Akamai, preferred.
* Experience with the agile software development lifecycle, preferred.
* Automated CI/CD pipelines, e.g. JenkinsX (Kubernetes native), Jenkins Enterprise.
Skills:
* Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO.
* Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog, Splunk, and Kibana.
* Able to see problems as opportunities to automate.
* Ability to work independently with minimal direction.
* Track record of providing technical leadership to strong teams of Site Reliability Engineers.
* Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones.
* Able to communicate at all levels with track record of strong written and verbal communications.