8 of 8 Chaos Engineering Jobs in London

SRE Architect (68019) (DEAI DS) Cloud & Data Engineering United Kingdom

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

passion for achieving great things in the world are equally as important to us. Job Description Mandatory Skills: Observability, Resiliency, Service Management, Reliability, Performance engineering, Scalability, release management, Cloud cost management. Role Description Skills: ROLE PURPOSE Lead the Site Reliability Engineering practice, driving the transformation from reactive operations … proactive, engineering-led reliability. Own the definition and enforcement of non-functional requirements (NFRs) using FMEA-based resiliency frameworks, and champion observability, self-healing automation, automated incident management, and database operations automation. Ensure systems are resilient, performant, cost-optimised, and continuously improving. KEY RESPONSIBILITIES Define and enforce non-functional ...

Lead Site Reliability Engineer (AWS)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

excellence of the the company's global digital platforms. With a strong specialism in AWS, along with demonstrable experience, you will work closely with engineering teams, architects, senior stakeholders, and our managed service partner, you will design, implement, and continuously improve resilient, scalable, and secure systems. You will leverage … incident management to deliver high system uptime and support evolving business needs. About the Team The role sits within the Digital and Technology Engineering team, which partners across the the company to deliver customer‐centric digital products and services. The Engineering function brings together Architecture, Software Engineering ...

Principal/Senior Site Reliability Engineer

Hiring Organisation: Jobleads-UK
Location: City of Westminster, England, United Kingdom

organisation to design resilient, cloud-based systems for MLOps and HPC workloads at global scale. This is a role for someone who wants their engineering craft to have real impact on science and patients. The Opportunity: You architect Infrastructure as Code using Terraform, Pulumi, or CloudFormation to provision … resilience building disaster recovery and failover plans with auto-scaling and load balancing to keep critical systems available worldwide. You strengthen reliability through chaos engineering running experiments that validate systems and surface weaknesses before they become incidents. You build deep observability with monitoring, logging, and alerting frameworks such ...

Senior DevOps Engineer

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

afraid to fail and enjoy tackling difficult problems head-on. What you will do: Provide senior DevOps expertise and leadership across Engineering at all layers of the stack Evangelise DevOps, security and reliability engineering across the Engineering team-at-large Provision resilient infrastructure across multiple regions … logs and dashboards Durably engineer away toil You will be a great fit here if you: Are passionate about DevOps and, Security and Reliability engineering Are passionate about helping engineering teams become high performing teams Are passionate about helping build reliable and performant cloud-native applications Act like ...

Architect & Delivery Lead (68018)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

Make architectural decisions across IAM, Cloud, SRE, Network, Data, and Security — ensuring coherence, reusability, and alignment with business objectives Establish and chair the Architecture & Engineering Governance board, providing technical assurance across all workstreams Own the programme roadmap, resource plan, and financial model — tracking cost savings, team reduction trajectory … vendor and tool selection, ensuring standardisation across the programme and eliminating redundant tooling Build and lead high‐performing distributed teams, fostering a culture of engineering excellence, accountability, and continuous improvement Define the continuous improvement factory model, ensuring the transformation sustains beyond the initial programme Technical Skills & Expertise Broad ...

Advisory and Solution Architecture - Executive Director

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

defining and governing end-to-end solution architectures for critical platforms and transformation programs. You will partner closely with CTOs, Chief Architects, and senior engineering leaders to deliver business and technology strategy. Your role blends deep architectural expertise with executive presence, enabling you to influence both business and technology … senior stakeholders and provide hands‐on guidance to engineering teams as needed. You will help accelerate delivery, embed security and resiliency, and foster a high-performing architecture community. Job responsibilities Translates business strategy into pragmatic solution roadmaps and reference architectures Drives architecture decisions balancing resilience, scalability, latency, cost, risk ...

Sr. Software Engineer - Data Platform (London, Hybrid)

Hiring Organisation: Jobleads-UK
Location: Greater London, England, United Kingdom

worked with all these technologies – we value strong distributed systems fundamentals, the ability to write production-quality code, and the passion for solving complex engineering challenges. We'll support your growth in streaming technologies and expect you'll be comfortable collaborating with teams distributed across various geographies and time … centers Develop RESTful APIs and SDKs that enable other teams to easily integrate with streaming platform capabilities Work closely with platform consumers, SREs, and engineering teams across the organization to understand requirements and deliver scalable solutions Challenge the status quo by continuously improving platform performance, reliability, scalability, and developer ...

OAT Quality Engineer

Hiring Organisation: Hays Technology
Location: London, United Kingdom
Employment Type: Contract
Contract Rate: £398/day £398 p/d Inside IR35

Service Management or mission-critical production environments. Strong analytical and problem-solving skills with the ability to identify and mitigate operational risks. Exposure to Chaos Engineering and/or DevOps practices would be highly advantageous. If you're currently SC Cleared and are interested in this role, click ...