OpenJaw Technologies is a leading online technology partner of the world’s biggest travel brands, with a customer portfolio that boasts Southwest Airlines, British Airways, Cathay Pacific, and Four Seasons (to name just a few).
We are seeking a highly skilled Senior Site Reliability Engineer to ensure the reliability, security, and scalability of our SaaS platform hosted on AWS. You will be a key player in maintaining 24/7 uptime, optimizing cloud infrastructure, automating deployments, and troubleshooting complex issues. This role requires deep AWS expertise, strong DevOps practices, and collaboration with DevOps, Support, and Engineering teams. This is a permanent, full-time role in an ambitious and dynamic company.
Key Responsibilities
Cloud Infrastructure & Operations
1. Manage and monitor AWS services (EC2, ECS/EKS, RDS, Lambda, S3, CloudFront, VPC, IAM, etc.).
2. Ensure high availability, performance, and cost efficiency of cloud resources.
3. Implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or CDK.
4. Automate deployments using CI/CD pipelines (GitHub Actions, Jenkins, AWS CodePipeline).
5. Configure and optimize monitoring & alerting (CloudWatch, Datadog, Prometheus, OpenSearch).
Security & Compliance
1. Enforce cloud security best practices (IAM policies, encryption, WAF, security groups).
2. Conduct vulnerability assessments and collaborate on SOC2/ISO 27001 compliance.
3. Troubleshoot network issues (VPN, VPC peering, NACLs, Route53).
Incident Management & Support
1. Lead incident response for production outages and perform root cause analysis (RCA).
2. Work with Customer Support to resolve escalated technical issues.
3. Maintain runbooks, disaster recovery (DR), and backup strategies.
Collaboration & Optimization
1. Partner with DevOps and Engineering to improve scalability and reliability.
2. Mentor junior engineers and document best practices.
3. Continuously optimize cost, performance, and automation.
Required Skills & Experience
1. 5+ years of hands-on AWS cloud operations/engineering experience.
2. Strong expertise in Linux/Windows administration, networking, and scripting (Bash, Python).
3. Proficient in containerization (Docker, Kubernetes/EKS) and serverless (Lambda).
4. Experience with IaC (Terraform preferred), CI/CD, and configuration management (Ansible).
5. Knowledge of monitoring, logging, and APM tools (CloudWatch, ELK, OpenSearch, Datadog).
6. Understanding of security best practices (CIS benchmarks, penetration testing).
7. AWS Certifications (e.g., AWS Certified SysOps Administrator, Solutions Architect, DevOps Engineer) are a plus.
Soft Skills
1. Strong problem-solving and troubleshooting skills.
2. Ability to work in a fast-paced, 24/7 SaaS environment.
3. Excellent communication for cross-team collaboration.
Nice to Haves
1. Experience in travel tech or B2B SaaS.
2. Knowledge of multi-region architectures.
3. Familiarity with ITIL processes (change management, incident management).
If you wish to be considered for this position, please submit your CV to recruitment@openjawtech.com.
OpenJaw Technologies is an equal opportunities employer. No agency calls please.
#J-18808-Ljbffr