About CVS Health and Signify Health
CVS Health, the parent company of Signify Health, is increasing investments in digital, data, analytics, and technology. Signify Health is excited to be involved in a dynamic new initiative for CVS Health that will run out of our state-of-the-art offices at Bonham Quay.
Transforming Healthcare with Technology
This is your opportunity to be involved with a pioneering business that is transforming healthcare in the United States by making customer experiences more seamless, convenient, and personalized.
Our Purpose
CVS Health is focused on driving business agility and growth through technology, data, digital, and experiential innovations. 'Digital First, Technology Forward, and Data Driven' is not simply an aspirational goal for the company, but a prerequisite to accelerated growth.
Opportunity Overview
Bring your heart to CVS Health. Every one of us at CVS Health shares a single, clear purpose: Bringing our heart to every moment of your health. This purpose guides our commitment to deliver enhanced human-centric healthcare for a rapidly changing world.
Flexible Work Arrangements
Anchored in our brand - with heart at its center - our purpose sends a personal message that how we deliver our services is just as important as what we deliver. Our Heart At Work Behaviors support this purpose. We want everyone who works at CVS Health to feel empowered by the role they play in transforming our culture and accelerating our ability to innovate and deliver solutions to make healthcare more personal, convenient, and affordable.
Join Enterprise Technology
We are hiring for a Staff Cloud Engineer to join our Enterprise Technology team. The successful candidate will put their engineering skills to use shaping the future of developer platforms for software engineers at Fortune 6 scale.
About the Role
The CVS Observability Platform gives engineers at CVS frictionless access to application instrumentation no matter where their applications run. As a Staff Cloud Engineer on the Observability Platform team, you will be architecting and scaling the data pipelines for billions of logs, metrics, and traces produced by thousands of workloads spanning multiple public cloud providers and datacenters.
Responsibilities
* Design and scale data pipelines for logs, metrics, and traces
* Develop custom software to drive the observability platform using technologies such as Java Spring Boot, Node JS, Golang, etc.
* Help engineers, who are primarily our customers, to the Observability Platform, in troubleshooting issues with Observability Platform instrumentations
* Implement OTEL client libraries for technologies such as Java Spring Boot, Node JS, Go-lang, etc.
* Participate in team 24/7/365 on-call rotations to ensure the health and stability of the Observability Platform
* Manage CI/CD pipelines for deploying and managing observability platform infrastructure to Kubernetes
* Deliver an exceptional customer experience by engaging with platform customers as they reach out with support questions
* Create comprehensive documentation for observability tools and technologies
* Watch the watchers by building and managing instrumentation and alerting for the Observability Platform itself to deliver a highly available platform
* Work closely with the SRE team to understand application team challenges in the observability space and identify opportunities to improve the Observability Platform to meet these challenges
Requirements
* 10+ years of experience in software engineering and/or site reliability engineering roles
* 10+ years of hands-on development experience with modern microservices using technologies such as Java Spring Boot, Go-Lang, Node JS, etc.
* Strong exposure to cloud platforms such as GCP, AWS, or Azure
* Strong familiarity with observability patterns and best practices including concepts like SLAs, SLOs, and SLIs
* Experience creating custom dashboards and views to understand system health and availability
* Extensive experience with modern infrastructure tooling like Docker, Kubernetes, Argo CD, Envoy/Istio
* Comfort using the Grafana Labs OSS stack: Loki, Grafana, Tempo, Mimir, et al.
Preferred Qualifications
* Understanding of the OpenTelemetry ecosystem, OTLP, and OTel Semantic Conventions
* Experience designing and scaling distributed systems
* Background in building and operating high-traffic backend services
* Familiarity with popular data-oriented open source technologies like Kafka and Postgres