Global Investment/Financial Services company is looking to hire an experienced Principal Kubernetes Site Reliability Engineer as part of their Digital Assets Technical Operations Team. You will work with various engineering teams to own the design of a new multi-region, highly available, cloud-based deployment of our applications to AWS’s Kubernetes Platform (EKS).
Experience
* Several years of hands-on experience with AWS in a production environment
* Production experience running Kubernetes workloads on AWS using EKS
* Experience creating and deploying Helm charts & libraries
* Specialist in AWS CloudFormation, IAM, VPC and network security
* Experience with monitoring tools e.g. Cloudwatch, Datadog, Splunk
* Proficiency with Unix operating systems and shell scripting
* Programming experience, e.g. Python, preferred
* Experience with CDN Providers e.g. Akamai, preferred
* Experience with the agile software development lifecycle preferred
Skills
* Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO
* Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog, Splunk, and Kibana
* See problems as opportunities to automate
* Ability to work independently with minimal direction
* Coordinate the overall design of highly available, secure, scalable microservices-based applications in AWS
* Track record of providing technical leadership to strong teams of Site Reliability Engineers
* Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones
* Work multi-functionally with other organizations and collaborate with our risk, product and engineering team leaders
* Ability to communicate at all levels with track record of strong written and verbal communications
To apply and find out more please reach out.
#J-18808-Ljbffr