Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
Leeds Building Society
How you'll help us live our purpose SiteReliabilityEngineering Lead DevOps Core Banking We've been helping our members save for their future and buy a home of their own since 1845. By joining us, you'll play a big role in helping us to … It's a purpose that drives everything we do and one we're proud of. And you can play your part too as our SiteReliabilityEngineering Lead for Core Banking. This is a unique opportunity to be there from the beginning to shape and manage a … revolutionise the way we deliver solutions to our customers and colleagues. How you'll make a difference We are currently looking to recruit a SiteReliabilityEngineering Lead for Core Banking who is passionate about leading operational excellence across our core banking cloud platforms at enterprise scale. More ❯
Direct message the job poster from bet365 Recruitment Specialist - Infrastructure and Cloud Who we are looking for A SiteReliability Engineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software … engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices … features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture More ❯
The SiteReliabilityEngineering (SRE) team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives, and working with developers and product managers to ensure that our products are not only reliable and performant, but also cost-efficient. Our … on-call and incident management functions, supporting a high-throughput platform which processes more than 15 billion events per day. To ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand service level objectives, think through failures scenarios, and design … systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information Security team to ensure that cloud infrastructure is properly secured, and that sufficient controls are in place to meet our compliance goals with respect to industry standards such as SOC 2. Role Responsibilities Write high-quality More ❯
and ensure Morrisons’ applications and infrastructure are resilient, efficient, and aligned with architectural goals. This is a key role for those passionate about advancing SRE practices at enterprise scale. Responsibilities Act as SME within their Domain teams for advice & guidance in terms of CI/CD, automation and product ways … of working and SRE/Engineering standards Drive the adoption of Engineering standards and Continuous Delivery principles within multiple domains The escalation point for SRE/Engineering ways of working Influence good practices and standards within SDLC throughout the business Influence partners Infrastructure best practices Implementation of … and patterns Engineering Tooling, Patterns, Framework and Standards Proprietary code quality management inclusive of technical debt About you Knowledge In depth understanding of SRE/Engineering, Architecture and Testing practices In depth understanding of the principals of CI/CD within SRE/Engineering In depth understanding More ❯
architecture and design of new and existing systems, establish best working practices, and deliver high-quality software products. With your knowledge of various software engineering methodologies, you’ll bring fresh ideas and approaches that have a real impact at the heart of our mission to keep the UK safe … to develop yourself and others. You might be reviewing pull requests, defining review, branching, and deployment strategies, or working with a range of software engineering frameworks. You operate at a deep technical level, leveraging your familiarity with languages such as JavaScript, Java, C++, Node, Python, Rust, Go, and .NET. … Importantly, you’ll bring a genuine excitement for discovering new software engineering techniques. You are part of a wider network of peers keen to share experiences, collaborate on projects, and learn from each other. With your experience, you set the standard, share innovative ways of working, and identify new More ❯
architecture and design of new and existing systems, establish best working practices, and deliver high-quality software products. With your knowledge of various software engineering methodologies, you’ll bring fresh ideas and approaches that have a real impact at the heart of our mission to keep the UK safe … to develop yourself and others. You might be reviewing pull requests, defining review, branching, and deployment strategies, or working with a range of software engineering frameworks. You operate at a deep technical level, leveraging your familiarity with languages such as JavaScript, Java, C++, Node, Python, Rust, Go, and .NET. … Importantly, you’ll bring a genuine excitement for discovering new software engineering techniques. You are part of a wider network of peers keen to share experiences, collaborate on projects, and learn from each other. With your experience, you set the standard, share innovative ways of working, and identify new More ❯
Lead SiteReliability Engineer Pay up to £89,995 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. Are you someone who has excellent stakeholder management and problem-solving skills? Do you like finding the root cause of a problem and … happen again? DWP. Digital with Purpose. We have a fantastic opportunity to join our community of tech experts at DWP Digital as a Lead SiteReliability Engineer. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly …/CD pipelines for efficient and reliable software delivery. Strong experience in resolving complex technical incidents, ensuring minimal downtime and swift recovery. Expertise in reliabilityengineering, including capacity and performance management through effective monitoring, logging, and alerting. Ability to engage with stakeholders at various levels, providing valuable feedback More ❯
Lead SiteReliability Engineer Pay up to £89,995 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. Are you someone who has excellent stakeholder management and problem-solving skills? Do you like finding the root cause of a problem and … happen again? DWP. Digital with Purpose. We have a fantastic opportunity to join our community of tech experts at DWP Digital as a Lead SiteReliability Engineer. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly …/CD pipelines for efficient and reliable software delivery. Strong experience in resolving complex technical incidents, ensuring minimal downtime and swift recovery. Expertise in reliabilityengineering, including capacity and performance management through effective monitoring, logging, and alerting. Ability to engage with stakeholders at various levels, providing valuable feedback More ❯
Lead SiteReliability Engineer Pay up to £89,995 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work-life balance. Are you someone who has excellent stakeholder management and problem-solving skills? Do you like finding the root cause of a problem and … happen again? DWP. Digital with Purpose. We have a fantastic opportunity to join our community of tech experts at DWP Digital as a Lead SiteReliability Engineer. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly …/CD pipelines for efficient and reliable software delivery. Strong experience in resolving complex technical incidents, ensuring minimal downtime and swift recovery. Expertise in reliabilityengineering, including capacity and performance management through effective monitoring, logging, and alerting. Ability to engage with stakeholders at various levels, providing valuable feedback More ❯
also assist with CloudOps activities. Are you an experienced IT professional with a strong background in DevOps and SiteReliabilityEngineering (SRE)? Are you passionate about working with cutting-edge technologies, driving agile methodologies, and implementing CI/CD practices? Do you have knowledge of infrastructure as … code? Requirements: - Solid experience in a similar role, working on DevOps or SRE initiatives within complex IT environments with Software Engineering - AWS environment - Proficiency in DevOps practices and related technologies, such as CI/CD pipelines & infrastructure as code tools such as Terraform, Ansible, Puppet or Bicep. - Strong understanding … to ensure system reliability and performance. - Any Linux experience would be a bonus Key Responsibilities: - Drive the strategy and implementation for DevOps and SRE practices. - Collaborate with cross-functional teams to design and implement CI/CD pipelines, ensuring efficient and reliable software delivery. - Establish and maintain best practices More ❯
Job Title: Senior SiteReliability Engineer Role Overview: As a Senior SiteReliability Engineer, you will be responsible for maintaining and enhancing the reliability, scalability, and performance of our clients' platform. You’ll collaborate with engineering teams to troubleshoot, prevent issues, and build proactive … monitoring, alerting, and diagnostic tools to identify and resolve infrastructure, platform, and application issues quickly and effectively. Proactively monitor system health to spot potential reliability, performance, and operational improvements. Lead the incident response process, conducting root cause analysis, and driving improvements to prevent future incidents. Optimise resource usage in … cloud environments, with a particular focus on AWS, to improve cost-efficiency and scalability. Create and maintain tools that promote best practices in service reliability, ensuring smooth adoption across the organisation. Write clean, efficient code that enhances system scalability, performance, maintainability, and security. Collaborate with cross-functional teams to More ❯
Manager, Software Engineering - Couchbase SRE Team Manchester, UK As industries race to embrace AI, traditional database solutions fall short of rising demands for versatility, performance, and affordability. Couchbase is leading the way with Capella, the developer data platform for critical applications in our AI world. By uniting transactional, analytical … of the Fortune 100, Couchbase is unlocking innovation, accelerating AI transformation, and redefining customer experiences. Come join our mission. Software Engineering Manager - Couchbase SRE The SiteReliabilityEngineering team is responsible for pro-active health and maintenance of Capella - Couchbase Cloud Offering. They pro-actively ensure … the health of Capella services meeting SLAs/SLOs. SRE is engaged for the entire lifecycle of service & features, ensuring metrics, monitoring, alerting, troubleshooting, and mitigations. SRE is promoting Observability and implementing required Infrastructure and Tooling. You will lead and manage the Capella Operational Tooling (FM) team and collaborate across More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
Couchbase
Manager, Software Engineering - Couchbase SRE Team Manchester, UK As industries race to embrace AI, traditional database solutions fall short of rising demands for versatility, performance, and affordability. Couchbase is leading the way with Capella, the developer data platform for critical applications in our AI world. By uniting transactional, analytical … of the Fortune 100, Couchbase is unlocking innovation, accelerating AI transformation, and redefining customer experiences. Come join our mission. Software Engineering Manager - Couchbase SRE The SiteReliabilityEngineering team is responsible for pro-active health and maintenance of Capella - Couchbase Cloud Offering. They pro-actively ensure … the health of Capella services meeting SLAs/SLOs. SRE is engaged for the entire lifecycle of service & features, ensuring metrics, monitoring, alerting, troubleshooting, and mitigations. SRE is promoting Observability and implementing required Infrastructure and Tooling. You will lead and manage the Capella Operational Tooling (FM) team and collaborate across More ❯
Manchester Area, United Kingdom Hybrid / WFH Options
LinuxRecruit
Time to enhance your scope; broaden your horizon by delving into SiteReliabilityEngineering (SRE). You’ll take the skills you have picked up in software engineering and apply these to improve overall system and application performance and reliability. You’ll work on internal developer … others! You will be developing solutions to complex monitoring, automation, and capacity management problems so prior working experience of approaching tasks methodically to solve engineering problems is key. In addition to your programming skills, knowledge of metrics, monitoring and observability would be highly beneficial. Experience with the full SDLC … and deployments of code through pipelines into containers - modern cloud native software engineering, would be beneficial. This role offers the chance to work on a large scale infrastructure that has to operate at high speed to meet substantial consumer demand. A 24/7 uptime is essential in this More ❯
re Looking For: Basic Required Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a SiteReliability Engineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running in … troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues Familiarity working in an Agile environment True understanding of SiteReliabilityEngineering Ability to build and maintain a system and culture that supports and implements SLOs. Familiar with Docker & Kubernetes, specifically EKS More ❯
GTIS Public Cloud Engineering is a global team of circa 100 colleagues based in the UK, India, and the US. We are accountable for strategic engineering and delivery of Public Cloud services within Enterprise Technology. Our team is at the forefront of the migration of Barclays applications to … team to meet the objective of You Build It, You Own It . This role is for a SiteReliability Engineer (SRE) in a team that will be part of the whole lifecycle of feature development, from solution design all the way through to production support and … back again, helping to shape and drive our SRE capability. We are looking for an experienced Engineer who is a motivated, supportive, self-starter; able to thrive in a dynamic and fluid environment and to lead in technical design discussions as well as communicating to senior groups and stakeholders. You More ❯
Job Title QA Engineer - SRE Job Description Sage is a global leader in accounting, payroll, and financial management solutions. We empower businesses with innovative cloud-based software, simplifying and automating financial processes so they can thrive in a digital world. We're looking for a QA Engineer to join our … SiteReliabilityEngineering (SRE) team within Cloud Services Engineering & Operations. In this role, you'll play a key part in ensuring the reliability, performance, and resilience of our cloud-based accounting products, helping us deliver seamless, high-quality solutions to customers worldwide. This is a … contributing to continuous improvement initiatives. Coordinate User Acceptance Testing (UAT) to ensure smooth product releases. Mentor junior QA engineers in automation, cloud QA, and SRE methodologies. What We're Looking For Must-Have Skills: Experience in performance and load testing for cloud-based applications. Proficiency in UI test automation (e.g. More ❯
Reliability Engineer - Azure - Newcastle Location: Newcastle Upon Tyne, England, United Kingdom Role Overview: We are seeking a SiteReliability Engineer (SRE) with experience in Kubernetes/Openshift to work on a Terraform-centric opportunity in the Cloud. This role involves working with a team of high … and maintain Cloud environments. Utilize Terraform, Kubernetes, and other related technologies. Requirements: Active SC Clearance. Experience with Kubernetes and Terraform. Ability to work on-site 3 days a week in Newcastle (relocation provided). Compensation: £50,000 plus benefits including certification support, perkbox discounts, and a solid progression roadmap. More ❯
Newcastle upon Tyne, England, United Kingdom Hybrid / WFH Options
DWP Digital
Job Description Lead SiteReliability Engineer Pay up to £89,995 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work life balance. Are you someone who has excellent stakeholder management and problem-solving skills? Do you like finding the root cause of a … happen again? DWP. Digital with Purpose. We have a fantastic opportunity to join our community of tech experts at DWP Digital as a Lead SiteReliability Engineer. We're using fresh ideas and leading-edge tech to build and maintain digital solutions that will be used by nearly More ❯
re Looking For: Basic Required Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a SiteReliability Engineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry. Experienced in production environments running in AWS. … troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues. Familiarity working in an Agile environment. True understanding of SiteReliability Engineering. Ability to build and maintain a system and culture that supports and implements SLOs. Familiar with Docker & Kubernetes, specifically EKS & ECS. More ❯
Manchester, North West, United Kingdom Hybrid / WFH Options
Stealth IT Consulting Limited
SiteReliability Engineer (SRE) Global Consultancy Up to £55k + benefits Hybrid remote, based in Manchester, London, or Glasgow A global consultancy with extensive plans to expand their Digital teams throughout 2025 are looking for a SRE to join the team. We are ideally looking for a security … cleared SRE (Valid and transferable clearance) but we can consider candidates eligible for security clearance. The role will require out of hours support dependant on the clients request, please only apply if you are comfortable and are able to support this. Desired Skills and Experience: Proven experience in implementing SRE … the ability to collaborate effectively with developers and stakeholders. Experience with tools like Dynatrace, Prometheus, and Open Telemetry is a plus. Key Responsibilities: Implementing SRE principles (Dynatrace, Prometheus, and Open Telemetry are a bonus). Collaborating with teams to define and implement Service Level Indicators (SLIs) and Service Level Objectives More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Sage City
Job Description We are looking for a SiteReliability Engineer to join our SRE Enablement team, a specialised function within Cloud Operations focused on building reusable infrastructure, automation, and tools that enable CloudOps and Engineering teams to operate more efficiently. You will have the opportunity to be … a key driver for SRE adoption within Sage, taking the helm in developing scalable frameworks to improve developer experience, remove toil and ultimately focus on embedding SRE best practices within the wider business. If you have experience working with Terraform and modern CI/CD workflows this could be the … also engage with broader teams to help implement these new approaches. You will have oversight of the entirety of Sage's product-suite and SRE teams as you work closely with them to build tools to make them more successful. Please note this is a hybrid role - you will be More ❯
Role Title: Service Management Enterprise Architect Location : Chester - 2 to 3 days per week on site Rate: £750 to £850 via umbrella company Length: Initial 3 months with strong potential for extension We are seeking an experienced Service Management Architect to lead the design, implementation, and optimization of our … and cloud-based environment. The Service Management Architect will define service strategies, establish best practices, and drive continuous improvement to enhance service delivery, system reliability, and customer satisfaction at an enterprise level. Key Responsibilities: Service Strategy and Design: Develop and implement a comprehensive Service Management Strategy aligned with enterprise … or highly regulated industries. Certification in COBIT or other IT governance frameworks. Experience with cloud-native service management solutions and microservices architecture. Familiarity with SRE (SiteReliabilityEngineering) principles and practices. More ❯
DevOps SiteReliability DevOps Engineer Location: West Yorkshire Hybrid, 2 days a week in the office Salary: £45,000 - £56,500 + benefits Working with a leading E-commerce company we are seeking an experienced DevOps Engineer to optimise software deployment and enhance the performance of high-traffic … GDPR, PCI-DSS) and OWASP best practices. Support disaster recovery planning and backup strategies. What We're Looking For: Considerable experience in DevOps/SiteReliabilityEngineering for high-traffic systems. Experience with CI/CD tools (Jenkins, GitLab) and performance monitoring (New Relic, Lighthouse). Strong More ❯
GTIS Public Cloud Engineering is a global team of circa 100 colleagues based in the UK, India, and the US. We are accountable for strategic engineering and delivery of Public Cloud services within Enterprise Technology. Our team is at the forefront of the migration of Barclays applications to … team to meet the objective of You Build It, You Own It . This role is for a SiteReliability Engineer (SRE) in a team that will be part of the whole lifecycle of feature development, from solution design all the way through to production support and … back again, helping to shape and drive our SRE capability. You must demonstrate strong problem-solving techniques and display an ability to reach decisions under conditions of uncertainty or high risk. Microsoft Azure Accreditation Experience with the Azure platform, including its services, architecture, and best practices. Proficiency in Python Intermediate More ❯