SiteReliabilityEngineer - SRE One of our biggest customers based in the Financial Services sector is looking for an experienced SiteReliabilityEngineer - SRE to join them as they look to create a newly appointed team. SiteReliabilityEngineer: We have … an exciting brand-new opportunity to join a dynamic IT Team as a SiteReliability Engineer. We are looking for an expert in this field who has extensive experience and knowledge in managing APM tools such … as Dynatrace and has demonstrable experience (at least 3 years) as a SiteReliability Engineer. The SiteReliabilityEngineer (SRE) will take ownership of the observability suite, leveraging deep DevOps skills and experience to proactively enhance the performance and stability of APIs and applications. This more »
SRE/SiteReliabilityEngineer (Azure PagerDuty DataDog) Reigate/WFH to £85k Do you have expertise with SRE within an Azure cloud environment? You could be progressing your career in an impactful role at a global FinTech. As an SRE/SiteReliabilityEngineer … week for team meet-ups and stakeholder meetings with the other three days work from home. About you: You have experience in a similar SRE/SiteReliabilityEngineer position You have experience of running 24x7 services in the public cloud - Azure You have experience with observability … keen to take ownership of projects and happy to collaborate with senior stakeholders and mentor others What's in it for you: As a SRE/SiteReliabilityEngineer you will receive a competitive salary plus a range of perks and benefits: Up to £95k salary plus more »
City of London, London, United Kingdom Hybrid / WFH Options
Tec Partners
Job Title: SiteReliabilityEngineer (Software Dev Background) Type: Permanent Location: Fully remote Salary: £55-65K Our client are growing their team and are looking for a SiteReliabilityEngineer - (ideally from a software development/software engineering background) to contribute to the … our cloud infrastructure, help with the definition of best practices for infrastructure management and to support development processes with CI/CD. As a SiteReliabilityEngineer, you will play a crucial role in ensuring the reliability, scalability, and security of their systems. Responsibilities: Design, build … functional teams to identify opportunities for automation, streamlining workflows, and improving efficiency. Stay up-to-date with the latest trends and technologies in the SRE/DevOps space, making recommendations for adoption based on industry best practices. Requirements: Proven experience as a SiteReliabilityEngineerSRE or more »
London (city), London, England Hybrid / WFH Options
T Rowe Price
invite you to explore the opportunity to join us and grow your career with us. Job Title: Principal SiteReliabilityEngineer (SRE) Department: CDO Technology Group Summary: We are seeking a highly motivated and experienced Principal SiteReliabilityEngineer (SRE) to join the CDO … Technology leadership team to stand up and lead the SRE function within CDO Technology. In this role, you will be responsible for ensuring the availability, latency, performance, efficiency, and stability of our critical infrastructure, which supports a range of data platforms, applications, and services. You will collaborate closely with development … infrastructure, and anticipate significant risks. Work with development teams to review architecture design to ensure high availability and proper disaster recovery strategy Collaborate with reliability and infrastructure engineering team in T Rowe Price to build synergy in tooling for the implementation of observability, tracing, and alerting Qualifications: Bachelor's more »
Winchester, Hampshire, United Kingdom Hybrid / WFH Options
Context Recruitment
SiteReliabilityEngineer/DevOps Engineer (Azure) Opportunity to join one of the top UK Insurers who are on a mission to become the leading 'digital first' insurer in the UK. As a SiteReliabilityEngineer, you will be the backbone of their … Azure environment, ensuring it's **scalability, reliability, and operational excellence**. You will work closely with cross-functional teams to build and maintain a robust infrastructure that supports their dynamic needs. Key Responsibilities: Assume responsibility for the observability suite, encompassing tools for monitoring, logging, and alerting, to guarantee a … on-call schedule, offering support for resolving incidents and conducting necessary troubleshooting. Qualifications : Experience in a DevOps/SiteReliabilityEngineer ( SRE ) position, dedicated to ensuring the high availability, reliability, and scalability of live systems. Proficient in observability tools like Prometheus, ELK stack, Grafana, and Azure more »
Cloud Infrastructure SiteReliabilityEngineer (SRE) £55,000 - £65,000 Fully remote Due to the nature of the position candidates must be eligible and willing to undergo Security Clearance My client are a household name and global organisation who deliver innovative, digitally enabled solutions to transform, simplify … and support their customers. They are recruiting for a Site … reliabilityengineer to support their customers using their public cloud infrastructure. Job Description: The Cloud Infrastructure SiteReliabilityEngineer (SRE) supports the public cloud infrastructure used to deliver public cloud hosted managed services to customers. We will have a high customer focus being actively involved more »
Cloud Infrastructure SiteReliabilityEngineer (SRE) £55,000 - £65,000 Fully remote Due to the nature of the position candidates must be eligible and willing to undergo Security Clearance My client are a household name and global organisation who deliver innovative, digitally enabled solutions to transform, simplify … and support their customers. They are recruiting for a Site … reliabilityengineer to support their customers using their public cloud infrastructure. Job Description: The Cloud Infrastructure SiteReliabilityEngineer (SRE) supports the public cloud infrastructure used to deliver public cloud hosted managed services to customers. We will have a high customer focus being actively involved more »
About the role: Loftware is expanding its worldwide 24x7 Cloud Operations Team and we are looking for a technically motivated English speaking Cloud Operations SiteReliabilityEngineer with a strong cloud-based Linux and Windows knowledge. The Cloud Operations SiteReliabilityEngineer will be … troubleshooting customer environments for mission-critical application use across the range of cloud platforms used by Loftware, including AWS and Azure. The Cloud Operations SiteReliabilityEngineer is someone that is a team player with the desire and passion for modern technology and keen to take on … large-scale responsibility for the cloud environment. The Cloud Operations SiteReliabilityEngineer will work with the rest of the Cloud Operations team and alongside QA and Development to continually improve automated infrastructure and application deployment, to build and maintain reliable cloud infrastructure and services and to more »
SiteReliabilityEngineer London (Hybrid 2 days a week on site) Permanent £75,000 - £85,000 p/a The Background We are partnered with an innovative IT consultancy based in London but with a global presence who are leading advisors in their industry by creating … lasting value for their clients. Due to growth within the business they are looking for a highly skilled Systems Engineer to join their Corporate IT Team and focus on the Applications side of their IT offering. This is an exciting opportunity for someone with a passion for technology to … flexible benefits fund. You… In order to be a successful SiteReliabilityEngineer you will have… Previous experience working as an SRE/at system administrator level In-depth knowledge of Windows Operating Systems and VMware with a good understanding of Linux Operating Systems In depth knowledge more »
Chester, Cheshire, North West, United Kingdom Hybrid / WFH Options
Searchability (UK) Ltd
SiteReliabilityEngineer Role Description: An opportunity for an experienced sitereliabilityengineer to work for a globally recognised company in the heart of Chester on a hybrid working basis has arisen. You will join a team who are responsible for building a suite … illness Use of a flex fund to use towards benefits Wellbeing helpline, mental health first aiders and virtual GP service Main Responsibilities of a SiteReliabilityEngineer: Maintain and enhance network monitoring, orchestration, and automation solutions, encompassing tasks such as inventory reconciliation, workflow automation, network configuration validation more »
Lead SiteReliabilityEngineer Leeds - once a month in the office on average £80,000-£90,000 + benefits A leading global organisation are seeking a Lead SiteReliabilityEngineer to play a pivotal role in the development, implementation … and ongoing maintenance of its core Infrastructure and Cloud-based platforms. This role encompasses diverse responsibilities, including leading and managing a small DevOps/SRE team. The Lead SiteReliabilityEngineer will lead the charge in selecting, configuring, and supporting Cloud Platform components and tooling. Proficiency in more »
SiteReliabilityEngineer … Global Quantitative Investment Management Permanent/Contract - London, UK - Competitive We are seeking a highly skilled and motivated SiteReliabilityEngineer (SRE) to join a leading quantitative research and technology firm specializing in leveraging innovative data science and cutting-edge technology to deliver unparalleled insights and solutions. … You will be working at the intersection of technology and finance ensuring the reliability, availability, performance, and cost-efficiency of their critical systems and infrastructure. You will work closely with development, operations, and research teams to build and maintain robust, scalable systems using AWS, Terraform, Ansible, and Kubernetes. Key more »
London, England, United Kingdom Hybrid / WFH Options
Bayside Solutions
SiteReliabilityEngineer Contract Salary Range: £91,400 - £108,000 per year Location: London, England - Hybrid Role Job Summary: We seek a SiteReliabilityEngineer to join our team and play a crucial role in ensuring our applications and services' reliability, availability, and … Willingness to adapt and learn new tools and technologies as needed Availability to participate in on-call rotations as required Desired Skills and Experience SiteReliability, Java, AWS, Azure, Kubernetes, GIT, CD Bayside Solutions, Inc. may collect your personal information during the position application process. Please reference Bayside more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »
architecture. Key Responsibilities : Design and manage Java based microservices, bash scripts, Redis, High-Availability design, while strictly adhering to SiteReliability Engineering (SRE) principles. Thrive in high-pressure environments, working swiftly and reliably to maintain system integrity and meet service level objectives (SLOs) and service level indicators (SLIs … Lead initiatives to enhance current systems and implement innovative solutions in collaboration with a fast-paced, mission-driven team, focusing on the implementation of SRE best practices. Conduct thorough root-cause analyses for production incidents and generate high-quality RCA reports, leveraging SRE methodologies to prevent recurrence. Apply software engineering … principles to rectify operational challenges and optimize system performance, with a specific focus on implementing SRE-driven solutions. Ensure the availability, latency, performance, efficiency, and security of our infrastructure, adhering rigorously to SRE principles and best practices. Design and maintain robust production monitoring systems to ensure timely detection and resolution more »