3 of 3 Chaos Engineering Jobs in London

Senior Site Reliability Engineer

Hiring Organisation
Moneycorp
Location
London, UK
Employment Type
Full-time
including SLO governance, resilience testing, and platform patterns, ensuring our systems meet the highest levels of operational resilience and regulatory compliance Key Responsibilities: Reliability Engineering & Observability Define and maintain SLOs/SLIs and error budgets for critical services Build and improve observability pipelines (metrics, logs, traces) Maintain dashboards … issues using telemetry Automation, DR & Resilience Testing Automate backup, restore, and failover processes Validate RTO/RPO through regular DR testing Design and run chaos engineering experiments Enhance self-healing and rollback automation Operational Excellence & Incident Leadership Lead SEV-1/SEV-2 incidents and authorize critical decisions ...

Senior Application and Platform Engineer

Hiring Organisation
London Metal Exchange
Location
London, UK
Employment Type
Full-time
Middle Office, Back Office, and Market Data mission-critical applications. This role blends traditional application support with SRE principles and platform engineering practices to ensure stability, scalability, and continuous improvement across systems serving internal teams and external clients. Core Responsibilities Reliability Engineering: Embed SRE best practices into operational … evaluating and enhancing support processes. Maintain up-to-date documentation for all supported systems and platforms. Lead operational resiliency exercises, including disaster recovery and chaos engineering tests. Identify, manage, and remediate security vulnerabilities across systems and applications. Technical Responsibilities Maintain and regularly test disaster recovery procedures. Recommend ...

Site Reliability Engineer

Hiring Organisation
Capital on Tap
Location
London, England, United Kingdom
Pulumi Experience working with a cloud monitoring solution (advantageous to have DataDog) Experience with Kubernetes and Docker Nice to have skills: Experience with Chaos Engineering practices Experience with IDPs Experience with software cataloguing Experience with observability and tracing best practices Experience in Go (preferred), Powershell (preferred), Python, C# … Video call) 🤝 Second stage : 75 minute technical & questions with SRE Team lead (Video call) 🤝 Final stage : 45 minute CV overview with Head of department & Engineering team (Video call) Diversity & Inclusion 🌈 We welcome, consider and encourage applications from anyone who shares our commitment to inclusivity. Join us in creating ...