SiteReliabilityEngineer - SRE, Kubernetes, Observability, Prometheus, Dynatrace, OpenTelemetry Description for the SiteReliabilityEngineer role:- This is a fantastic opportunity with a consulting company, who are looking to fill multiple SRE roles to play a key role in managing their clients platforms. There … occasional client visits will be necessary for meetings but these will be fully funded. Experience required for the SiteReliabilityEngineer (SRE) roles:- 2yrs+ commercial experience in a Platform/DevOps/SRE role At least 6mths in an SiteReliabilityEngineer (SRE) role … to have lived in the UK for 5yrs+ to be able to obtain Security clearance Salary for the SiteReliabilityEngineer (SRE) roles will be negotiable dependent on experience but expect £55,000 basic + an excellent benefits package + Fantastic training Applicants need to be eligible more »
Manchester Area, United Kingdom Hybrid / WFH Options
Inara
Role: DevOps/SiteReliabilityEngineer Industry: Digital, SaaS, Compliance Location: Manchester …/Hybrid (on-site working at least 1 day a week) Are you an experienced DevOps/SiteReliabilityEngineer (SRE) looking to advance your career? We're looking for a DevOps Engineer to join an innovative tech company that's making a global impact. … with search, analysis, and reporting. Search & Download: Simple archive searches and downloads. What You'll Do: You will be a key part of the SRE team, focusing on maintaining and improving the reliability of our services. This role involves hands-on technical work, contributing to automation, and supporting various more »
A global consultancy planning to significantly expand its Digital teams in 2025 seeks a SiteReliabilityEngineer (SRE). The ideal candidate will have valid and transferable security clearance, although we may also consider applicants eligible for clearance. Permanent role Remote with Occasional client visits Up to … Applicants should only apply if they are willing and able to accommodate this requirement. What our client is looking for: Proven experience in implementing SRE principles is essential, and familiarity with Dynatrace, Prometheus, and OpenTelemetry would be advantageous. A solid grasp of the role of a SiteReliabilityEngineer (SRE) and the contribution to the team is essential. Knowledge of microservices and container orchestration is important, as is experience in facilitating continuous delivery pipelines. Understanding the construction and deployment of pipelines is crucial. Effective communication skills are necessary, along with the confidence to engage with developers more »
Cambridge, Cambridgeshire, United Kingdom Hybrid / WFH Options
Adecco
SiteReliabilityEngineer - support, SRE, degree, AWS, Location: Hybrid/Cambridge Salary: Competitive + Benefits About the Company: Our client is one of the UK's most innovative software houses, renowned for their cutting-edge advancements in artificial intelligence. With a series of prestigious awards, they continue … core of their success, fostering a collaborative environment where every suggestion and idea is valued. Job Opportunity: We have an impressive opening for a SiteReliabilityEngineer who is eager to join our award-winning software house in Cambridge. Ideal Candidate Profile: * Problem Solver: Enjoys operating in … CV will be treated in the strictest confidence and we would always speak to you before discussing your CV with any potential employer. Keywords: SRE, CAMBRIDGE, DEVOPS, AWS, AZURE, CI, CD, WEB, CLOUD, SECURITY, AUTOMATION more »
exciting opportunity to join a LEADING EdTech organisation as a Senior SiteReliabilityEngineer! Senior SiteReliabilityEngineer (SRE) The Company... A leader in the EdTech space, offering a vast set of products used by schools in more than 70 countries! What are the … system backups, and foster a culture of continuous improvement and professional development within the team. What skills are they looking for? Experience within a SRE position Windows Server AWS (particularly CDK - Cloud Development KIT) TypeScript/Python (Web-application maintenance) Terraform experience would be ideal! What's in it for more »
and delightful experiences for every user by maintaining highly available, performant, and observable cloud infrastructure. You will collaborate across Development, DevOps, InfoSec, QA, and SRE teams to continuously improve system reliability, deployment strategies, and alerting infrastructure. Key duties & responsibilities: Maintain and enhance monitoring, logging, and alerting systems to proactively … detect and resolve potential issues across our digital channels Collaborate with Development, Platform/DevOps, InfoSec, QA, and SRE teams, as well as with Technical Architects and the Digital Ops Manager, to ensure reliability and observability of infrastructure and applications. Optimise deployment strategies and streamline recovery processes to support … to incident response strategies, including detection, communication, and swift recovery processes, without on-call obligations outside of office hours Essential Skills: Can articulate core SRE principles (e.g. Golden Signals, SLIs and SLOs, SRE metrics, release engineering, blameless retrospective, process capability) and apply them in practice Excellent log analysis and incident more »
Clapham, England, United Kingdom Hybrid / WFH Options
The Gym Group
and delightful experiences for every user by maintaining highly available, performant, and observable cloud infrastructure. You will collaborate across Development, DevOps, InfoSec, QA, and SRE teams to continuously improve system reliability, deployment strategies, and alerting infrastructure. Key duties & responsibilities: Maintain and enhance monitoring, logging, and alerting systems to proactively … detect and resolve potential issues across our digital channels Collaborate with Development, Platform/DevOps, InfoSec, QA, and SRE teams, as well as with Technical Architects and the Digital Ops Manager, to ensure reliability and observability of infrastructure and applications. Optimise deployment strategies and streamline recovery processes to support … to incident response strategies, including detection, communication, and swift recovery processes, without on-call obligations outside of office hours Essential Skills: Can articulate core SRE principles (e.g. Golden Signals, SLIs and SLOs, SRE metrics, release engineering, blameless retrospective, process capability) and apply them in practice Excellent log analysis and incident more »
Baltimore, Maryland, United States Hybrid / WFH Options
Fearless
What you'll be doing We're looking to change the world by building software with a soul, and we want your help. The SiteReliabilityEngineer III leads the design and implementation of reliable infrastructure solutions that solve customer and user problems. They enable efficient delivery … recent) years as a DevOps Engineer/SRE. Programming language experience with Java, Python, and Bash Prior experience leading a small team of SRE/DevOps Engineers (served as a Tech Lead or Project Lead). Strong demonstrated working experience managing Kubernetes Clusters using EKS within an AWS environment. … have deep AWS cloud experience in a production environment (e.g. network, security, deployment, automation, server-less technologies). Shall have experience and understanding in SRE principles for highly scalable and reliable systems. Shall have strong experience with Configuration Management and Infrastructure as Code. Experience with core infrastructure capabilities: operating systems more »
San Antonio, Texas, United States Hybrid / WFH Options
Fearless
What you'll be doing We're looking to change the world by building software with a soul, and we want your help. The SiteReliabilityEngineer III leads the design and implementation of reliable infrastructure solutions that solve customer and user problems. They enable efficient delivery … through efficient architectures, infrastructure, and pipelines. They bring expertise in digesting complex tasks and business requirements and aligning a group around an implementation. The SiteReliabilityEngineer III also supports operations of the production environments including observability and troubleshooting issues (sometimes outside of normal business hours if … mandated by the contract). We need your SiteReliability skills! What other skills will help you succeed at Fearless? Glad you asked! We're excited about candidates who can accomplish the following: Responsibilities and Contributions Organizational and Leadership Role Synthesizes business requirements and objectives and drives the more »
City of London, London, United Kingdom Hybrid / WFH Options
Client Server
Senior SREEngineer/Technical Lead London/WFH to £130k Are you an SRE technologist seeking a role where you can make the technology choices, influence strategy and remain hands-on? You could be progressing your career in an impactful role at a global CFD trading company that … has been consistently voted as one of the UKs top employers. As a Senior SREEngineer/Technical Lead you will focus on improving and raising the bar for SRE operations across the firm. You establish SLOs, leveraging public cloud, containerisation, reliability testing and observability, liaising with business … stakeholders to establish the product roadmap and providing technical leadership to a small team of experienced SRE Engineers. Location/WFH: There's a hybrid model with two days a week work from home, when you are in the office you'll be based in the City with an upbeat more »
Do you want to shape the future of software delivery in the financial services industry? Kosli is looking for a SiteReliabilityEngineer to join our growing team. As part of a fast-paced startup, this role is about building and maintaining a large scale data and … us. We are funded by leading VC investors such as Heavybit (investors in Snyk, LaunchDarkly, CircleCI, Netlify, Tailscale). About the Role As a SiteReliabilityEngineer, you will be responsible for building and maintaining Kosli's infrastructure, ensuring our platform's reliability, and developing solutions … track record of improving system reliability and deployment processes Familiarity with Python, Go and shell scripting Passion for quality infrastructure code and modern SRE practices Strong problem-solving abilities and attention to detail Clear communication skills and ability to collaborate in a distributed team Enthusiasm for being an early more »
Washington, Washington DC, United States Hybrid / WFH Options
Technica Corporation
business process knowledge as a trusted advisor in support of our Department of Defense and other Federal Agency customers. Responsibilities Technica is seeking a SiteReliabilityEngineer is responsible for both the operations and maintenance of the Atlassian Developer Tools in support of our developer customers, and … the service in our organizational structure as it supports our customers in utilizing but not limited to the following: Bamboo, Bitbucket, Crucible, & Fisheye. The SiteReliabilityEngineer conducts technical project milestone reviews, codes architecture sessions, provides resource estimation, and utilizes development best practices. He/she will … requests, and other formal documentation Ensures that software deployments minimally impact production workloads running in production environments This is a hybrid opportunity, 50% on-site at Patriots Plaza II location in Washington, D.C. and 50% remote. The government has established policies for on-site vs remote work which more »
London, England, United Kingdom Hybrid / WFH Options
Arrows
SiteReliabilityEngineer - SRE - Azure - Kubernetes - Terraform - Cyber Security - Paying up to £570 (Inside IR35) - Remote - Immediate Starters My client is passionate about the utilisation of modern DevOps practices, agile support and deployment processes. You will be a leading member of our team working with a diverse more »
Manchester Area, United Kingdom Hybrid / WFH Options
bet365
Who we are looking for A SiteReliabilityEngineer who will develop software solutions, consult with development teams and work with modern telemetry data to maintain and improve the performance of key systems. The sitereliability team provide an increasingly important service to our technology … department. Focusing on application performance, reliability, availability, capacity and health, you will work with other teams across the platform department to help ensure our critical systems are reliable and observable. You will be working to provide solutions to help minimise toil and provide operational efficiency at scale on our … critical systems. This role is eligible for inclusion in the Company’s hybrid working from home policy. Preferred skills and experience Excellent knowledge of SRE principles, including the creation and management of effective SLI’s and SLO’s for reliability and customer satisfaction. Knowledge of contemporary observability tools, techniques more »
City of London, London, United Kingdom Hybrid / WFH Options
Stealth IT Consulting Limited
SiteReliabilityEngineer Global consultancy Up to £55k + benefits Hybrid remote … out of either Manchester, London or Glasgow A global consultancy with extensive plans to expand their Digital teams throughout 2025 are looking for a SRE to join the team. We are ideally looking for a security cleared SRE (Valid and transferable clearance) but we can consider candidates eligible for clearance. … only apply if you are comfortable and are able to support this. The ideal candidate will have the following key skills: Proven experience implementing SRE principles- Dynatrace, Prometheus and OpenTelemetry would be a bonus Strong understanding of what a SRE does and the part you play in the team An more »
or email dave.henderson@searchability.com Who are we … Our client are a bleeding edge software house, specialising in HR software among other things. Their SRE function is a key part of their development operations and you will be the pivot between a number of teams including development and Infrastructure alongside project … be doing?... You will be focusing on maintaining and developing all aspects of the clients technical estate, having an existing working background within SRE and Linux administration before that. Part of your day to day duties will include creating infrastructure solutions and automating the findings, Improving CI/CD … submit (subject to required skills) your application to our client in conjunction with this vacancy only. KEY SKILLS – AWS/Terraform/Golang/SRE/MySQL more »
Senior SiteReliabilityEngineer We are working with an exciting business on a mission to put themselves at the forefront of digital marketplace consumer services. To rival the corporate, and to put the little people - the merchant and the consumer - at the forefront of business success. Having … accumulated many accolades from the tech and marketplace industries, the company is ready to scale. A crucial role in streamlining software delivery pipelines, enhancing reliability, performance and scalability of systems, and driving continuous improvement across the software lifecycle. There are no passengers in this engineering team, everyone contributes and … up your sleeves and get stuck in! Senior level technical capability with strong problem solving skills. Proven experience with GCP as either a DevOps Engineer or SRE. A good depth of experience using Terraform and Ansible. Strong proficiency in programming languages such as Python, Go, or Java. Experience with more »
Senior SiteReliabilityEngineer We are working with an exciting business on a mission to put themselves at the forefront of digital marketplace consumer services. To rival the corporate, and to put the little people - the merchant and the consumer - at the forefront of business success. Having … accumulated many accolades from the tech and marketplace industries, the company is ready to scale. A crucial role in streamlining software delivery pipelines, enhancing reliability, performance and scalability of systems, and driving continuous improvement across the software lifecycle. There are no passengers in this engineering team, everyone contributes and … up your sleeves and get stuck in! Senior level technical capability with strong problem solving skills. Proven experience with GCP as either a DevOps Engineer or SRE. A good depth of experience using Terraform and Ansible. Strong proficiency in programming languages such as Python, Go, or Java. Experience with more »
Honolulu, Hawaii, United States Hybrid / WFH Options
OMW Consulting
Role - SiteReliabilityEngineer Location - Honolulu - Hybrid - 1-2 days a week on site Security clearance - Minimum Secret - need this … ahead of applying Salary - $150k-$200k + Equity I am partnered with a leading defense tech scale up who are looking to add an SRE to their team based in Hawaii. This role is hybrid with an expectation of 1-2 days on site in Honolulu, however there is … some weeks where you will not need to go on site at all. Due to the nature of the client you must hold an active secret clearance as a minimum ahead of applying for this position. To be considered for this position you must have experience with the following more »
San Antonio, Texas, United States Hybrid / WFH Options
General Dynamics Information Technology
impact by advancing the Department of Defense's mission to keep our country safe and secure. Job Description GDIT is in need of a SiteReliabilityEngineer in the San Antonio, TX area supporting the Air Force Black Label Day 1 Day 2 (D1D2) program supporting the … other technical degree Required Experience: 10+ years of overall related experience. At least three years experience in the following areas: Site Reliabilty Engineering (SRE) principles, tools, and automation Service Level Objectives (SLO's) & Error Budgets Reducing Toil Monitoring and Service Level Indicators (SLIs) Anti-fragility, performance management, and incident … SCI eligibility required at start Certification: DoD 8570.01-M, IASAE Level II or higher (i.e. CompTIA CASP or higher) Location: Hybrid (Remote and Client Site as required) US Citizenship Required GDIT IS YOUR PLACE: 401K with company match Comprehensive health and wellness packages Internal mobility team dedicated to helping more »
help you grow. We're never one-size-fits-all. Our careers are as unique as you are. We are looking for a Lead SiteReliabilityEngineer to be responsible for ensuring the reliability, availability, operational excellence and performance of our systems and services. You will … You will be accountable for: Leadership and Team Management: Leading and mentoring a team of SREs, providing guidance and support. Fostering a culture of reliability and continuous improvement within the team. Ensuring the team have the right skills and training to be able to deliver the services and technologies … required. System Reliability and Performance: Designing, implementing, and maintaining scalable, secure and reliable systems. Monitoring system performance, identify bottlenecks, and optimize for efficiency. Developing and maintaining automation tools to improve system reliability, delivery processes and reduce manual intervention. Supporting shared services that are used in development, testing and more »
Leeds, West Yorkshire, Yorkshire, United Kingdom Hybrid / WFH Options
Evri
help you grow. We're never one-size-fits-all. Our careers are as unique as you are. We are looking for a Lead SiteReliabilityEngineer to be responsible for ensuring the reliability, availability, operational excellence and performance of our systems and services. You will … You will be accountable for: Leadership and Team Management: Leading and mentoring a team of SREs, providing guidance and support. Fostering a culture of reliability and continuous improvement within the team. Ensuring the team have the right skills and training to be able to deliver the services and technologies … required. System Reliability and Performance: Designing, implementing, and maintaining scalable, secure and reliable systems. Monitoring system performance, identify bottlenecks, and optimize for efficiency. Developing and maintaining automation tools to improve system reliability, delivery processes and reduce manual intervention. Supporting shared services that are used in development, testing and more »
Morley, England, United Kingdom Hybrid / WFH Options
Evri
help you grow. We're never one-size-fits-all. Our careers are as unique as you are. We are looking for a Lead SiteReliabilityEngineer to be responsible for ensuring the reliability, availability, operational excellence and performance of our systems and services. You will … You will be accountable for: Leadership and Team Management: Leading and mentoring a team of SREs, providing guidance and support. Fostering a culture of reliability and continuous improvement within the team. Ensuring the team have the right skills and training to be able to deliver the services and technologies … required. System Reliability and Performance: Designing, implementing, and maintaining scalable, secure and reliable systems. Monitoring system performance, identify bottlenecks, and optimize for efficiency. Developing and maintaining automation tools to improve system reliability, delivery processes and reduce manual intervention. Supporting shared services that are used in development, testing and more »
Pittsburgh, Pennsylvania, United States Hybrid / WFH Options
General Dynamics Mission Systems
work performed within our facilities, U.S. citizenship is required. Responsibilities for this Position ROLE AND POSITION OBJECTIVES: Relocation package available. As a Senior Principal SiteReliabilityEngineer for GDMS's Space and Intelligence Systems line of business, you'll be a member of a cross functional team … responsible for maintaining survivability and reliability of mission critical resources. We encourage you to apply if you have any of these preferred skills or experiences: Ensuring Uptime of Critical Systems Automating Systems Administration Activities Configuring, Monitoring, and Troubleshooting Enterprise Services Experience administrating Linux systems What sets you apart: Creative … programs, employee resource and social groups, and more See more at Workplace Options: This position allows flex time (mixed work from home and on-site). Please note that this position is not 100% remote. While on-site, you will be a part of the Pittsburgh Office Target more »
Title : Platform Observability Engineer (junior & senior levels being considered) Client : Quant Fund – Global collaborative firm run by passionate Computer Scientists Salary : Up to £150,000 Starting Base+ Exceptional bonus/benefits package Location : London, Liverpool Street (Hybrid/Remote) This firm is a dynamic global quant fund that is more »