Site Reliability Engineering Jobs in London

Employment Type

Remote Jobs

Hybrid/WFH 104

Sort By

Relevance
Date

Locations

Job Titles

Principal Site Reliability Engineer

London, UK

NielsenIQ

businesses to master market measurement, understand consumer behavior, and drive innovation. Job Description Key Responsibilities: Provide senior-level leadership and technical guidance to the Site Reliability Engineering team. Develop and execute a comprehensive technical strategy for ensuring high availability, scalability, and performance across GfK’s platforms, with … a focus on the next-generation platform, GFKnewron. Technical Expertise and Problem-Solving: Serve as the principal technical expert in all aspects of cloud engineering (especially Google Cloud Platform - GCP), container orchestration (Kubernetes), Infrastructure as Code (Terraform), GitOps practices, monitoring (Prometheus, Grafana), and CI/CD pipelines (GitLab). … Be the go-to person for resolving complex technical challenges and providing innovative solutions to ensure system reliability and performance. Team Leadership: Oversee and mentor team members, fostering a culture of continuous learning and improvement. Guide the implementation of best practices in site reliability engineering, ensuring More ❯

Posted: 5 days ago

Site Reliability Engineer - SRE Consultant

London, UK
Hybrid / WFH Options

Akkodis

Site Reliability Engineer - SRE Consultant Akkodis are currently working in partnership with a leading service provider to recruit an experienced Site Reliability Engineer with experience in ensuring reliability, scalability and efficiency of client platforms. Please note this is a fully remote role with travel to … you must be eligible to gain security clearance (do not need to hold currently). The Role As a Site Reliability Engineer (SRE) you will lead site reliability engineering initiatives with a strong emphasis on observability, ensuring high performance and reliability of applications & infrastructure. … Provide strategic insights to shape the overall SRE strategy while collaborating on the design and implementation of scalable and reliable solutions. Establish effective monitoring, alerting and incident response strategies to maintain system availability and promote continuous improvement by collaborating with team members to deliver observability best practices and SRE methodologies. More ❯

Posted: 5 days ago

Site Reliability Engineer - SRE Consultant

City of London, London, United Kingdom
Hybrid / WFH Options

Akkodis

Employment Type: Permanent

Salary: £45000 - £55000/annum

Posted: 12 days ago

Senior Service Assurance Engineer II

London, UK

American Express

to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. Site Reliability Engineering/Application Support (SRE/AS) is a continuous engineering discipline that effectively combines software development and systems engineering to build and run scalable, distributed, fault-tolerant … keeping an ever-watchful eye, automated, on capacity and performance. How will you make an impact in this role? This role will drive the SRE/AS mindset which strives to use software engineering to build and run better production systems. You will write software to optimize day to … Express digital assets and influence how millions of people interact with their cards, their merchants, and their money. The Senior Service Assurance Engineer II (SRE/AS Engineer) role is a hands-on Senior Architect Level position supporting American Express Site Reliability Engineering/Application Support team. More ❯

Posted: Yesterday

Senior Service Assurance Engineer II | London, UK

London, UK

American Express

Posted: 4 days ago

Senior Site Reliability Engineer (SRE)

London, UK
Hybrid / WFH Options

Stacklok

truly innovative and impactful. Learn more about Stacklok’s mission, virtues, and leadership, HERE . Location This is a hybrid role that requires on-site work at our London office three (3) days a week. Our office is conveniently located in WeWork at 1 Mark Square, London, EC2A 4EG. … effectively manage and maintain a robust security posture across the entire software supply chain. We are seeking a Senior Site Reliability Engineer (SRE) to support Stacklok Insight, our package intelligence service that empowers developers to make safer open source dependency choices. Embedded within the Stacklok Insight product team … exceptional service performance and reliability. In addition, this role will be part of a company-wide guild dedicated to unifying platform automation, observability, and reliability practices across all product lines, building a cohesive, high-performance SaaS platform with seamless observability and reliability throughout the Stacklok ecosystem. If site More ❯

Posted: Yesterday

Software Engineer, Site Reliability Engineering

London, United Kingdom

Google

Preferred Qualifications: Master's degree in Computer Science or Engineering, or a related field. About the Job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding … algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 5 days ago

Senior Systems Engineer, Site Reliability Engineering

London, UK

Google

Preferred Qualifications: Master's degree in Computer Science or Engineering, or a related field. About the Job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding … algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a More ❯

Posted: 4 days ago

Software Engineer II, Site Reliability Engineering

London, United Kingdom

Google

work, or Open Source projects. Preferred Qualifications: Master's degree in Computer Science or Engineering. About the Job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding … algorithms, complexity analysis, and large-scale system design. SRE's culture of intellectual curiosity, problem solving, and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences, and perspectives. We encourage them to collaborate, think big, and take risks in a blame More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 5 days ago

Software Engineer, Site Reliability Engineering, Google Cloud

London, United Kingdom

Google

automate routine tasks. Systematic problem-solving approach, coupled with effective verbal and written communication skills. About the Job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale unique to Google Cloud, while using your expertise in coding, algorithms, complexity … analysis, and large-scale system design. SRE's culture of intellectual curiosity, problem solving, and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences, and perspectives. We encourage them to collaborate, think big, and take risks in a blame-free environment. More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 5 days ago

Site Reliability Engineering - QA Tester

London, UK
Hybrid / WFH Options

ZILO™

Site Reliability Engineering - QA Tester About: Step forward into the future of technology with ZILO. We're here to redefine what's possible in technology. While we're trusted by the global Transfer Agency sector, our technology is truly flexible and designed to transform any business at … scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious mind, and set … our progress and creates real impact. If you're ready to shape the future, let's talk. Requirements As a QA Tester within our Site Reliability Team, you will play a crucial role in maintaining and enhancing the quality and reliability of our SaaS platform. You will More ❯

Posted: 3 days ago

Site Reliability Engineering - QA Tester

London, UK
Hybrid / WFH Options

TN United Kingdom

Social network you want to login/join with: Site Reliability Engineering - QA Tester, London Client: ZILO Location: London, United Kingdom Job Category: Other EU work permit required: Yes Job Reference: 3d67b994eba8 Job Views: 3 Posted: 30.03.2025 Expiry Date: 14.05.2025 Job Description: About: Step forward into the … flexible and designed to transform any business at scale. We’ve created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can’t match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore … our progress and creates real impact. If you’re ready to shape the future, let’s talk. Requirements: As a QA Tester within our Site Reliability Team, you will play a crucial role in maintaining and enhancing the quality and reliability of our SaaS platform. You will More ❯

Posted: 2 days ago

Technical Product Manager - Observability

London, UK

Buscojobs

small business, we’ll be building a stronger economy that can change the world. About the team In Site Reliability Engineering (SRE), we drive and influence Xero to provide the most reliable experience for our customers. We are a global team based across New Zealand, Australia and … ownership. About the role The Technical Product Manager for Observability plays a crucial role in shaping the organization's observability strategy, collaborating closely with SRE and product engineering teams to define a comprehensive vision, translating technical challenges into actionable product initiatives. This role will work with various engineering … feedback and operational experiences. Our ideal candidate will bring a balanced mix of technical knowledge and product management expertise. You'll be passionate about SRE & DevOps, with strong communication skills, and empathy for engineering teams and the challenges they face. What you'll do : Foster a culture of continuous More ❯

Posted: 5 days ago

Site Reliability Engineer (SRE)

London Area, United Kingdom

Levy Global

Site Reliability Engineer (Observability) London- Hybrid/3 Days Contract Inside IR35- 6 Months initially We’re looking for a Site Reliability Engineer (SRE) to join our client to build and maintain observability systems and to ensure their core services remain reliable, scalable, and high-performing. Responsibilities: Deploy and … incident response. Build Grafana dashboards for system insights. Apply Infrastructure as Code (IaC) principles. Develop tooling in Golang (preferred) or Python . Advocate for SRE principles like SLOs, SLIs, and error budgets. Integrate monitoring with incident management workflows. Requirements: SRE principles and reliability engineering expertise. Solid familiarity with More ❯

Posted: 2 days ago

Site Reliability Engineer (SRE)

london, south east england, united kingdom

Levy Global

Posted: 8 days ago

Senior Site Reliability Engineer | UK

London, UK

Prima Assicurazioni

is used in context with load balancing, in order to optimize user experience. 1 day CookieNameProviderPurposeMaximum Storage DurationTypeNameProviderPurposeMaximum Storage DurationType London - United Kingdom **Senior Site Reliability Engineer | UK****Overview****Job description**IT technology lies at the very core of everything we do and our Engineering and Product … is, so we’ll go the extra mile to help you when we can. We are seeking an experienced Site Reliability Engineer (SRE) to join our Infrastructure team. As an SRE, your primary responsibility will be to ensure the reliability, availability, and performance of our technology platforms. … best security practices, participating in vulnerability assessments, and threat mitigation.Requirements: - Deep understanding and experience in Site Reliability Engineering and in implementing SRE Practices- Excellent knowledge of AWS services and hands-on experience in production environments- Proficiency with networking protocols, DNS principles, and container orchestration technologies (Kubernetes, Helm More ❯

Posted: 2 days ago

Head of Site Reliability Engineering

London, United Kingdom

Rewardgateway

for a Head of Site Reliability Engineering to join our team to help us transform our existing operational workloads to an SRE approach. Key Responsibilities Establishing and managing our new SRE function Operating and modernising our existing cloud infrastructure Partnering with our DevOps team to ensure fast … levels Acting as a key Incident Commander and escalation point Liaising closely with our SecOps teams to ensure timely vulnerability management Educating teams in SRE practices and maintaining high standards of compliance Implementing world-class observability standards utilising SLI/SLO/Error Budgets Continually evolving our observability platforms for … greater coverage Liaising with Product & Engineering teams for constant evolution of metrics Aligning SRE Sprints & Backlog with our roadmaps to meet business expectations Guiding our teams in a more Agile approach to demand management Actively taking part in our daily stand-ups and keeping our Sprints on track Keeping More ❯

Employment Type: Permanent

Salary: GBP Annual

Posted: 5 days ago

Lead Site Reliability Engineer

London, UK

Mentmore

on your skills and experience — talk with your recruiter to learn more. Base Pay Range My financial services client is looking for a Lead Site Reliability Engineer who will be responsible for ensuring the reliability and scalability of their infrastructure and services. This is a senior role …/mentoring experience and be able to balance technical delivery, team productivity, performance measurement, and collaboration across teams and stakeholders. Duties & Responsibilities: Hands-On Engineering & Technical Leadership Design, develop, and maintain cloud infrastructure (Azure/AWS) using Terraform and automation. Lead troubleshooting, performance optimisation, and incident resolution to enhance … security requirements, such as ISO 27001, PCI DSS, CE+ and SOX, with experience implementing compliance-driven engineering practices. Advocate for modern DevOps and SRE best practices, championing collaboration, transparency, automation, continuous learning, and continuous improvement across teams. Excellent communication skills, able to engage stakeholders, collaborate cross-functionally, and drive More ❯

Posted: 2 days ago

Group Data Operations Manager

London, UK
Hybrid / WFH Options

Howden Group Holdings

Howden. Job Purpose: The Operations Manager will be responsible for overseeing operational support teams and driving the development of Service Reliability Engineering (SRE) practices. This role will be pivotal in ensuring the reliability, availability, and performance of our data-driven services while fostering a culture of continuous … support professionals, fostering an environment of collaboration, accountability, and professional growth. Develop and implement strategies to enhance team performance and customer service delivery. Service Reliability Engineering Build and establish a comprehensive Service Reliability Engineering framework that aligns with the company's business objectives and enhances the … reliability of our data services. Collaborate closely with platform engineering teams to ensure that the architecture, design, and deployment of data platforms facilitate high availability, scalability, and resilience. Implement best practices for incident management, change management, and problem resolution to minimize service disruptions and enhance user experience. Continuous More ❯

Posted: 2 days ago

Site Reliability Engineer - Equity Trading Technology

London, UK

Millennium Management

Site Reliability … Engineer - Equity Trading Technology The Equity Trading Technology organization is seeking a skilled Software Engineer to join the Site Reliability Engineering (SRE) Team in our London office. This role involves designing, developing, and optimizing systems to enhance the reliability, scalability, and observability of trading applications. The … improve productivity and reduce operational toil. Stay current with industry trends: Continuously monitor and evaluate emerging trends, technologies and best practices within SDLC and SRE related fields. Identify and introduce the most relevant changes to improve development process, tools and methodologies. Qualifications and Skills 5+ years of professional software development More ❯

Posted: Yesterday

Site Reliability Engineer with Python

London, UK

TN United Kingdom

Social network you want to login/join with: Site Reliability Engineer with Python, London Client: Nexus Location: London, United Kingdom Job Category: Other EU work permit required: Yes Job Reference: 34191969ba68 Job Views: 3 Posted: 26.03.2025 Expiry Date: 10.05.2025 Job Description: Site Reliability Engineer with … Python Our Client is looking to bring on a Site Reliability Engineer to help deploy, manage, troubleshoot, and enhance our complex cloud-based set of internal tools and externally managed services for a variety of users across our wide-ranging organization. You will have at least 7 to … years of hands-on expertise working as a Site Reliability Engineer. You will work closely with IT, product, and engineering to extend and maintain this set of tools and services and to help debug and resolve problems. In addition, the ideal candidate will proactively look for system More ❯

Posted: 2 days ago

Site Reliability Engineer, Core Network

London, UK

Amazon

opportunity for you to join a world-class network team in a dynamic environment that has the feel of a start-up. As a Site Reliability Engineer you will help to deploy, manage, fix and reinvent the tools, services and components that network engineering rely on to … life harmony. Striking a healthy balance between your personal and professional life is crucial to your happiness and success here. Key job responsibilities A Site Reliability Engineer is responsible for maintaining their teams’ services, requiring them to troubleshoot and identify the root causes of any issues that arise … within their systems and any subcomponents. A Site Reliability Engineer will utilise testing, monitoring, and validations on their services, tools, and infrastructure to ensure their teams can continuously deploy new versions of the services with minimal interruption. A Site Reliability Engineer will identify areas to invent More ❯

Posted: 2 days ago

Senior Site Reliability Engineer

London, UK

Public Sector Resourcing, managed by AMS

On behalf of the Cabinet Office, we are looking for a Senior Site Reliability Engineer (Inside IR35) for a 6 Month contract based hybrid in London, Bristol or Manchester. The Cabinet Office supports the Prime Minister and ensures the effective running of government. The Cabinet Office is also … given to candidates who meet all of the essential criteria and hold active security clearance. Job Description: We are seeking a dedicated and skilled Site Reliability Engineer to join … our team. The ideal candidate will have expertise in maintaining and improving the reliability, performance, and scalability of our systems. As a senior SRE, you will work collaboratively with software engineers, operations, and other stakeholders to ensure a seamless and reliable service for our users. Key Responsibilities: Implement and More ❯

Posted: Yesterday

Site Reliability Engineer (SRE) - iCloud

London, UK

Apple Inc

Site Reliability Engineer (SRE) - iCloud London, England, United Kingdom Software and Services The Apple Service Engineering - iCloud SRE team is looking for Site Reliability Engineers to build and run the services that hundreds of millions of customers use every day. This team provides systems that … to provide extraordinary availability, scalability, and security for services that “just work.” We're looking for a talented and passionate person who loves designing, engineering, and running systems and infrastructure that will help millions of customers. Description The services that Apple and … iCloud run are massive; iCloud comprises a set of platforms and products which are foundational for both users and other Apple Services. As an SRE @ Apple, you'll need to solve problems using data, teamwork, and your own expertise. SREs @ Apple own the full infrastructure stack; from device driver performance More ❯

Posted: 4 days ago

Site Reliability Engineer

London, UK

JD.COM

and more, aiming to transform traditional business models with cutting-edge digital solutions. Know more about us: JD Corporate JD.com is seeking a passionate site reliability engineer who can ensure the stability of our eCommerce mobile apps and web apps in European countries such as the UK, France … Germany, and the Netherlands. In this role, you will be responsible for monitoring, incident management, automating deployments, scaling, reliability testing, incident post-mortems, and more. You will work closely with engineering and commercial teams globally to ensure a seamless, reliable user experience while balancing the demands of development … s degree in Computer Science, Software Engineering, or a related field. 3+ years of experience in DevOps, site reliability engineering (SRE), system stability assurance, operations and maintenance development, or related fields. Experience with common public cloud platform products (e.g., cloud hosting, cloud storage, object storage, CDN More ❯

Posted: 4 days ago

12 3 4 5

Salary Guide

Site Reliability Engineering
London

10th Percentile: £67,500
25th Percentile: £86,250
Median: £110,000
75th Percentile: £138,750

More Site Reliability Engineering insights »

1 to 25 of 287 Site Reliability Engineering Jobs in London