The SiteReliability Engineering (SRE) team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives, and working with developers and product managers to ensure that our products are not only reliable and performant, but also cost-efficient. Our platform … on-call and incident management functions, supporting a high-throughput platform which processes more than 15 billion events per day. To ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand service level objectives, think through failures scenarios, and design … systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information Security team to ensure that cloud infrastructure is properly secured, and that sufficient controls are in place to meet our compliance goals with respect to industry standards such as SOC 2. Role Responsibilities Write high-quality More ❯
The SiteReliability Engineering (SRE) team at Pendo is responsible for provisioning and maintaining cloud infrastructure from development through production for all product initiatives, and working with developers and product managers to ensure that our products are not only reliable and performant, but also cost-efficient. Our platform … on-call and incident management functions, supporting a high-throughput platform which processes more than 15 billion events per day. To ensure the reliability of this environment for our customers, SREs work closely with developers and product managers to understand service level objectives, think through failures scenarios, and design … systems which balance cost with reliability objectives. Additionally, SREs collaborate with the Information Security team to ensure that cloud infrastructure is properly secured, and that sufficient controls are in place to meet our compliance goals with respect to industry standards such as SOC 2. Role Responsibilities Write high-quality More ❯
Manchester, Lancashire, United Kingdom Hybrid / WFH Options
bet365 Group
A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability … of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration … is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands More ❯
A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability … of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration … is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands More ❯
Manchester, England, United Kingdom Hybrid / WFH Options
bet365 Group
A SiteReliabilityEngineer, who will enhance system reliability, observability and performance through a strong engineering approach and assist with incident resolution and best practices. You will have software engineering skills, focusing on system reliability and observability. You will monitor the health, performance and availability … of critical systems, directly impacting operational efficiency. Using your engineering expertise, you will implement solutions that enhance reliability, including service instrumentation with tools such as Open Telemetry, improve logging practices and develop features for maintainability. You will also help engineer tools and automation for effective service management. Collaboration … is key, working across multiple functions to integrate reliability and observability best practices into the software development life cycle. By supporting governance standards set by the central teams, you will foster a culture where these principles are integral to development. Your contributions will ensure our systems meet user demands More ❯
LinuxRecruit, specialising in Software Engineering - Golang, Python, Rust... Time to enhance your scope; broaden your horizon by delving into SiteReliability Engineering (SRE). You’ll take the skills you have picked up in software engineering and apply these to improve overall system and application performance and reliability. … free gym membership, 5 weeks holiday plus bank holidays and hefty bonus too. Seniority level Mid-Senior level Employment type Full-time Job function SiteReliabilityEngineer #J-18808-Ljbffr More ❯
re Looking For: Basic Required Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a SiteReliabilityEngineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running … troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues Familiarity working in an Agile environment True understanding of SiteReliability Engineering Ability to build and maintain a system and culture that supports and implements SLOs. Familiar with Docker & Kubernetes, specifically EKS & ECS More ❯
re Looking For: Basic Required Qualifications: Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a SiteReliabilityEngineer or equivalent in a similar role. Proficient in application and infrastructure observability, Splunk OpenTelemetry preferred Experienced in production environments running … troubleshooting and problem-solving skills with a knack for identifying and resolving complex technical issues Familiarity working in an Agile environment True understanding of SiteReliability Engineering Ability to build and maintain a system and culture that supports and implements SLOs. Familiar with Docker & Kubernetes, specifically EKS & ECS More ❯
Newcastle Upon Tyne, Tyne And Wear, United Kingdom
Sage City
Job Description We are looking for a SiteReliabilityEngineer to join our SRE Enablement team, a specialised function within Cloud Operations focused on building reusable infrastructure, automation, and tools that enable CloudOps and Engineering teams to operate more efficiently. You will have the opportunity to be … a key driver for SRE adoption within Sage, taking the helm in developing scalable frameworks to improve developer experience, remove toil and ultimately focus on embedding SRE best practices within the wider business. If you have experience working with Terraform and modern CI/CD workflows this could be the … also engage with broader teams to help implement these new approaches. You will have oversight of the entirety of Sage's product-suite and SRE teams as you work closely with them to build tools to make them more successful. Please note this is a hybrid role - you will be More ❯
Job Description We are looking for a SiteReliabilityEngineer to join our SRE Enablement team, a specialised function within Cloud Operations focused on building reusable infrastructure, automation, and tools that enable CloudOps and Engineering teams to operate more efficiently. You will have the opportunity to be … a key driver for SRE adoption within Sage, taking the helm in developing scalable frameworks to improve developer experience, remove toil and ultimately focus on embedding SRE best practices within the wider business. If you have experience working with Terraform and modern CI/CD workflows this could be the … also engage with broader teams to help implement these new approaches. You will have oversight of the entirety of Sage’s product-suite and SRE teams as you work closely with them to build tools to make them more successful. *** Please note this is a hybrid role – you will be More ❯
Job Description SiteReliabilityEngineer Exciting opportunity to join a growing technical leader, in a specialist technical capacity Hybrid based position (2 days a week on site) Salary up to £60,000 Central Manchester based client Based out of our revamped central Manchester office, you will … a wide range of technologies like Terraform, AWS/GCP, Splunk, New Relic, Grafana, Python, and Golang We need you to have Experience in SRE/DevOps focused positions An appreciation of the Software Delivery lifecycle A finger on the pulse for the latest technologies and trends To be Considered More ❯
and ensure Morrisons’ applications and infrastructure are resilient, efficient, and aligned with architectural goals. This is a key role for those passionate about advancing SRE practices at enterprise scale. Responsibilities Act as SME within their Domain teams for advice & guidance in terms of CI/CD, automation and product ways … of working and SRE/Engineering standards Drive the adoption of Engineering standards and Continuous Delivery principles within multiple domains The escalation point for SRE/Engineering ways of working Influence good practices and standards within SDLC throughout the business Influence partners Infrastructure best practices Implementation of least privilege approach … strategy and patterns Engineering Tooling, Patterns, Framework and Standards Proprietary code quality management inclusive of technical debt About you Knowledge In depth understanding of SRE/Engineering, Architecture and Testing practices In depth understanding of the principals of CI/CD within SRE/Engineering In depth understanding of Cloud More ❯
Job Description Lead SiteReliabilityEngineer Pay up to £89,995 plus 28.97% employer pension contributions, hybrid working, flexible hours, and a truly great work-life balance. Are you someone who has excellent stakeholder management and problem-solving skills? Do you like finding the root cause of More ❯
Leeds, Yorkshire, United Kingdom Hybrid / WFH Options
Leeds Building Society
How you'll help us live our purpose SiteReliability Engineering Lead DevOps Core Banking We've been helping our members save for their future and buy a home of their own since 1845. By joining us, you'll play a big role in helping us to put … It's a purpose that drives everything we do and one we're proud of. And you can play your part too as our SiteReliability Engineering Lead for Core Banking. This is a unique opportunity to be there from the beginning to shape and manage a team … revolutionise the way we deliver solutions to our customers and colleagues. How you'll make a difference We are currently looking to recruit a SiteReliability Engineering Lead for Core Banking who is passionate about leading operational excellence across our core banking cloud platforms at enterprise scale. Your More ❯
How you'll help us live our purpose SiteReliability Engineering Lead | DevOps | Core Banking We've been helping our members save for their future and buy a home of their own since 1845. By joining us, you'll play a big role in helping us to put … It's a purpose that drives everything we do and one we're proud of. And you can play your part too as our SiteReliability Engineering Lead for Core Banking. This is a unique opportunity to be there from the beginning to shape and manage a team … revolutionise the way we deliver solutions to our customers and colleagues. How you'll make a difference We are currently looking to recruit a SiteReliability Engineering Lead for Core Banking who is passionate about leading operational excellence across our core banking cloud platforms at enterprise scale. Your More ❯
Time to enhance your scope; broaden your horizon by delving into SiteReliability Engineering (SRE). You’ll take the skills you have picked up in software engineering and apply these to improve overall system and application performance and reliability. You’ll work on internal developer tooling, using More ❯
have moved to their own platform, which is a great time to join their business. As part of this growth, they are looking for SiteReliability Engineers to work closely with Product and Engineering teams to ensure that services and components meet all agreed performance-related targets alongside More ❯
You will work with Development and Product Management to design and deliver new functionality. You will perform deep dives into both systemic and latent reliability issues; partner with software engineers across the organization to produce and roll out fixes. You will drive standardization efforts across multiple disciplines and services … a solid understanding of continuous integration, deployment and operations concepts. You have production experience of managing Windows Infrastructure running IIS workloads. Passion for resolving reliability issues and identify strategies to mitigate going forward. Automation mindset - if you can automate it, do it. Fluency in English. What you'll gain More ❯