businesses to master market measurement, understand consumer behavior, and drive innovation. Job Description Key Responsibilities: Provide senior-level leadership and technical guidance to the SiteReliabilityEngineering team. Develop and execute a comprehensive technical strategy for ensuring high availability, scalability, and performance across GfK’s platforms, with … a focus on the next-generation platform, GFKnewron. Technical Expertise and Problem-Solving: Serve as the principal technical expert in all aspects of cloud engineering (especially Google Cloud Platform - GCP), container orchestration (Kubernetes), Infrastructure as Code (Terraform), GitOps practices, monitoring (Prometheus, Grafana), and CI/CD pipelines (GitLab). … Be the go-to person for resolving complex technical challenges and providing innovative solutions to ensure system reliability and performance. Team Leadership: Oversee and mentor team members, fostering a culture of continuous learning and improvement. Guide the implementation of best practices in sitereliabilityengineering, ensuring More ❯
SiteReliability Engineer - SRE Consultant Akkodis are currently working in partnership with a leading service provider to recruit an experienced SiteReliability Engineer with experience in ensuring reliability, scalability and efficiency of client platforms. Please note this is a fully remote role with travel to … you must be eligible to gain security clearance (do not need to hold currently). The Role As a SiteReliability Engineer (SRE) you will lead sitereliabilityengineering initiatives with a strong emphasis on observability, ensuring high performance and reliability of applications & infrastructure. … Provide strategic insights to shape the overall SRE strategy while collaborating on the design and implementation of scalable and reliable solutions. Establish effective monitoring, alerting and incident response strategies to maintain system availability and promote continuous improvement by collaborating with team members to deliver observability best practices and SRE methodologies. More ❯
City of London, London, United Kingdom Hybrid / WFH Options
Akkodis
SiteReliability Engineer - SRE Consultant Akkodis are currently working in partnership with a leading service provider to recruit an experienced SiteReliability Engineer with experience in ensuring reliability, scalability and efficiency of client platforms. Please note this is a fully remote role with travel to … you must be eligible to gain security clearance (do not need to hold currently). The Role As a SiteReliability Engineer (SRE) you will lead sitereliabilityengineering initiatives with a strong emphasis on observability, ensuring high performance and reliability of applications & infrastructure. … Provide strategic insights to shape the overall SRE strategy while collaborating on the design and implementation of scalable and reliable solutions. Establish effective monitoring, alerting and incident response strategies to maintain system availability and promote continuous improvement by collaborating with team members to deliver observability best practices and SRE methodologies. More ❯
to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. SiteReliabilityEngineering/Application Support (SRE/AS) is a continuous engineering discipline that effectively combines software development and systems engineering to build and run scalable, distributed, fault-tolerant … keeping an ever-watchful eye, automated, on capacity and performance. How will you make an impact in this role? This role will drive the SRE/AS mindset which strives to use software engineering to build and run better production systems. You will write software to optimize day to … Express digital assets and influence how millions of people interact with their cards, their merchants, and their money. The Senior Service Assurance Engineer II (SRE/AS Engineer) role is a hands-on Senior Architect Level position supporting American Express SiteReliabilityEngineering/Application Support team. More ❯
to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. SiteReliabilityEngineering/Application Support (SRE/AS) is a continuous engineering discipline that effectively combines software development and systems engineering to build and run scalable, distributed, fault-tolerant … keeping an ever-watchful eye, automated, on capacity and performance. How will you make an impact in this role? This role will drive the SRE/AS mindset which strives to use software engineering to build and run better production systems. You will write software to optimize day to … Express digital assets and influence how millions of people interact with their cards, their merchants, and their money. The Senior Service Assurance Engineer II (SRE/AS Engineer) role is a hands-on Senior Architect Level position supporting American Express SiteReliabilityEngineering/Application Support team. More ❯
truly innovative and impactful. Learn more about Stacklok’s mission, virtues, and leadership, HERE . Location This is a hybrid role that requires on-site work at our London office three (3) days a week. Our office is conveniently located in WeWork at 1 Mark Square, London, EC2A 4EG. … effectively manage and maintain a robust security posture across the entire software supply chain. We are seeking a Senior SiteReliability Engineer (SRE) to support Stacklok Insight, our package intelligence service that empowers developers to make safer open source dependency choices. Embedded within the Stacklok Insight product team … exceptional service performance and reliability. In addition, this role will be part of a company-wide guild dedicated to unifying platform automation, observability, and reliability practices across all product lines, building a cohesive, high-performance SaaS platform with seamless observability and reliability throughout the Stacklok ecosystem. If siteMore ❯
Preferred Qualifications: Master's degree in Computer Science or Engineering, or a related field. About the Job SiteReliabilityEngineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding … algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame More ❯
Preferred Qualifications: Master's degree in Computer Science or Engineering, or a related field. About the Job SiteReliabilityEngineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding … algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a More ❯
work, or Open Source projects. Preferred Qualifications: Master's degree in Computer Science or Engineering. About the Job SiteReliabilityEngineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure, and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding … algorithms, complexity analysis, and large-scale system design. SRE's culture of intellectual curiosity, problem solving, and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences, and perspectives. We encourage them to collaborate, think big, and take risks in a blame More ❯
automate routine tasks. Systematic problem-solving approach, coupled with effective verbal and written communication skills. About the Job SiteReliabilityEngineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both … our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale unique to Google Cloud, while using your expertise in coding, algorithms, complexity … analysis, and large-scale system design. SRE's culture of intellectual curiosity, problem solving, and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences, and perspectives. We encourage them to collaborate, think big, and take risks in a blame-free environment. More ❯
SiteReliabilityEngineering - QA Tester About: Step forward into the future of technology with ZILO. We're here to redefine what's possible in technology. While we're trusted by the global Transfer Agency sector, our technology is truly flexible and designed to transform any business at … scale. We've created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can't match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore new ideas with a curious mind, and set … our progress and creates real impact. If you're ready to shape the future, let's talk. Requirements As a QA Tester within our SiteReliability Team, you will play a crucial role in maintaining and enhancing the quality and reliability of our SaaS platform. You will More ❯
Social network you want to login/join with: SiteReliabilityEngineering - QA Tester, London Client: ZILO Location: London, United Kingdom Job Category: Other EU work permit required: Yes Job Reference: 3d67b994eba8 Job Views: 3 Posted: 30.03.2025 Expiry Date: 14.05.2025 Job Description: About: Step forward into the … flexible and designed to transform any business at scale. We’ve created a unified platform that adapts to diverse needs, offering the scalability and reliability legacy systems simply can’t match. At ZILO, our DNA is built on Character, Creativity, and Craftsmanship. We face every challenge with integrity, explore … our progress and creates real impact. If you’re ready to shape the future, let’s talk. Requirements: As a QA Tester within our SiteReliability Team, you will play a crucial role in maintaining and enhancing the quality and reliability of our SaaS platform. You will More ❯
small business, we’ll be building a stronger economy that can change the world. About the team In SiteReliabilityEngineering (SRE), we drive and influence Xero to provide the most reliable experience for our customers. We are a global team based across New Zealand, Australia and … ownership. About the role The Technical Product Manager for Observability plays a crucial role in shaping the organization's observability strategy, collaborating closely with SRE and product engineering teams to define a comprehensive vision, translating technical challenges into actionable product initiatives. This role will work with various engineering … feedback and operational experiences. Our ideal candidate will bring a balanced mix of technical knowledge and product management expertise. You'll be passionate about SRE & DevOps, with strong communication skills, and empathy for engineering teams and the challenges they face. What you'll do : Foster a culture of continuous More ❯
SiteReliability Engineer (Observability) London- Hybrid/3 Days Contract Inside IR35- 6 Months initially We’re looking for a SiteReliability Engineer (SRE) to join our client to build and maintain observability systems and to ensure their core services remain reliable, scalable, and high-performing. Responsibilities: Deploy and … incident response. Build Grafana dashboards for system insights. Apply Infrastructure as Code (IaC) principles. Develop tooling in Golang (preferred) or Python . Advocate for SRE principles like SLOs, SLIs, and error budgets. Integrate monitoring with incident management workflows. Requirements: SRE principles and reliabilityengineering expertise. Solid familiarity with More ❯
SiteReliability Engineer (Observability) London- Hybrid/3 Days Contract Inside IR35- 6 Months initially We’re looking for a SiteReliability Engineer (SRE) to join our client to build and maintain observability systems and to ensure their core services remain reliable, scalable, and high-performing. Responsibilities: Deploy and … incident response. Build Grafana dashboards for system insights. Apply Infrastructure as Code (IaC) principles. Develop tooling in Golang (preferred) or Python . Advocate for SRE principles like SLOs, SLIs, and error budgets. Integrate monitoring with incident management workflows. Requirements: SRE principles and reliabilityengineering expertise. Solid familiarity with More ❯
is used in context with load balancing, in order to optimize user experience. 1 day CookieNameProviderPurposeMaximum Storage DurationTypeNameProviderPurposeMaximum Storage DurationType London - United Kingdom **Senior SiteReliability Engineer | UK****Overview****Job description**IT technology lies at the very core of everything we do and our Engineering and Product … is, so we’ll go the extra mile to help you when we can. We are seeking an experienced SiteReliability Engineer (SRE) to join our Infrastructure team. As an SRE, your primary responsibility will be to ensure the reliability, availability, and performance of our technology platforms. … best security practices, participating in vulnerability assessments, and threat mitigation.Requirements: - Deep understanding and experience in SiteReliabilityEngineering and in implementing SRE Practices- Excellent knowledge of AWS services and hands-on experience in production environments- Proficiency with networking protocols, DNS principles, and container orchestration technologies (Kubernetes, Helm More ❯
for a Head of SiteReliabilityEngineering to join our team to help us transform our existing operational workloads to an SRE approach. Key Responsibilities Establishing and managing our new SRE function Operating and modernising our existing cloud infrastructure Partnering with our DevOps team to ensure fast … levels Acting as a key Incident Commander and escalation point Liaising closely with our SecOps teams to ensure timely vulnerability management Educating teams in SRE practices and maintaining high standards of compliance Implementing world-class observability standards utilising SLI/SLO/Error Budgets Continually evolving our observability platforms for … greater coverage Liaising with Product & Engineering teams for constant evolution of metrics Aligning SRE Sprints & Backlog with our roadmaps to meet business expectations Guiding our teams in a more Agile approach to demand management Actively taking part in our daily stand-ups and keeping our Sprints on track Keeping More ❯
on your skills and experience — talk with your recruiter to learn more. Base Pay Range My financial services client is looking for a Lead SiteReliability Engineer who will be responsible for ensuring the reliability and scalability of their infrastructure and services. This is a senior role …/mentoring experience and be able to balance technical delivery, team productivity, performance measurement, and collaboration across teams and stakeholders. Duties & Responsibilities: Hands-On Engineering & Technical Leadership Design, develop, and maintain cloud infrastructure (Azure/AWS) using Terraform and automation. Lead troubleshooting, performance optimisation, and incident resolution to enhance … security requirements, such as ISO 27001, PCI DSS, CE+ and SOX, with experience implementing compliance-driven engineering practices. Advocate for modern DevOps and SRE best practices, championing collaboration, transparency, automation, continuous learning, and continuous improvement across teams. Excellent communication skills, able to engage stakeholders, collaborate cross-functionally, and drive More ❯
Howden. Job Purpose: The Operations Manager will be responsible for overseeing operational support teams and driving the development of Service ReliabilityEngineering (SRE) practices. This role will be pivotal in ensuring the reliability, availability, and performance of our data-driven services while fostering a culture of continuous … support professionals, fostering an environment of collaboration, accountability, and professional growth. Develop and implement strategies to enhance team performance and customer service delivery. Service ReliabilityEngineering Build and establish a comprehensive Service ReliabilityEngineering framework that aligns with the company's business objectives and enhances the … reliability of our data services. Collaborate closely with platform engineering teams to ensure that the architecture, design, and deployment of data platforms facilitate high availability, scalability, and resilience. Implement best practices for incident management, change management, and problem resolution to minimize service disruptions and enhance user experience. Continuous More ❯
SiteReliability … Engineer - Equity Trading Technology The Equity Trading Technology organization is seeking a skilled Software Engineer to join the SiteReliabilityEngineering (SRE) Team in our London office. This role involves designing, developing, and optimizing systems to enhance the reliability, scalability, and observability of trading applications. The … improve productivity and reduce operational toil. Stay current with industry trends: Continuously monitor and evaluate emerging trends, technologies and best practices within SDLC and SRE related fields. Identify and introduce the most relevant changes to improve development process, tools and methodologies. Qualifications and Skills 5+ years of professional software development More ❯
Social network you want to login/join with: SiteReliability Engineer with Python, London Client: Nexus Location: London, United Kingdom Job Category: Other EU work permit required: Yes Job Reference: 34191969ba68 Job Views: 3 Posted: 26.03.2025 Expiry Date: 10.05.2025 Job Description: SiteReliability Engineer with … Python Our Client is looking to bring on a SiteReliability Engineer to help deploy, manage, troubleshoot, and enhance our complex cloud-based set of internal tools and externally managed services for a variety of users across our wide-ranging organization. You will have at least 7 to … years of hands-on expertise working as a SiteReliability Engineer. You will work closely with IT, product, and engineering to extend and maintain this set of tools and services and to help debug and resolve problems. In addition, the ideal candidate will proactively look for system More ❯
opportunity for you to join a world-class network team in a dynamic environment that has the feel of a start-up. As a SiteReliability Engineer you will help to deploy, manage, fix and reinvent the tools, services and components that network engineering rely on to … life harmony. Striking a healthy balance between your personal and professional life is crucial to your happiness and success here. Key job responsibilities A SiteReliability Engineer is responsible for maintaining their teams’ services, requiring them to troubleshoot and identify the root causes of any issues that arise … within their systems and any subcomponents. A SiteReliability Engineer will utilise testing, monitoring, and validations on their services, tools, and infrastructure to ensure their teams can continuously deploy new versions of the services with minimal interruption. A SiteReliability Engineer will identify areas to invent More ❯
On behalf of the Cabinet Office, we are looking for a Senior SiteReliability Engineer (Inside IR35) for a 6 Month contract based hybrid in London, Bristol or Manchester. The Cabinet Office supports the Prime Minister and ensures the effective running of government. The Cabinet Office is also … given to candidates who meet all of the essential criteria and hold active security clearance. Job Description: We are seeking a dedicated and skilled SiteReliability Engineer to join … our team. The ideal candidate will have expertise in maintaining and improving the reliability, performance, and scalability of our systems. As a senior SRE, you will work collaboratively with software engineers, operations, and other stakeholders to ensure a seamless and reliable service for our users. Key Responsibilities: Implement and More ❯
SiteReliability Engineer (SRE) - iCloud London, England, United Kingdom Software and Services The Apple Service Engineering - iCloud SRE team is looking for SiteReliability Engineers to build and run the services that hundreds of millions of customers use every day. This team provides systems that … to provide extraordinary availability, scalability, and security for services that “just work.” We're looking for a talented and passionate person who loves designing, engineering, and running systems and infrastructure that will help millions of customers. Description The services that Apple and … iCloud run are massive; iCloud comprises a set of platforms and products which are foundational for both users and other Apple Services. As an SRE @ Apple, you'll need to solve problems using data, teamwork, and your own expertise. SREs @ Apple own the full infrastructure stack; from device driver performance More ❯
and more, aiming to transform traditional business models with cutting-edge digital solutions. Know more about us: JD Corporate JD.com is seeking a passionate sitereliability engineer who can ensure the stability of our eCommerce mobile apps and web apps in European countries such as the UK, France … Germany, and the Netherlands. In this role, you will be responsible for monitoring, incident management, automating deployments, scaling, reliability testing, incident post-mortems, and more. You will work closely with engineering and commercial teams globally to ensure a seamless, reliable user experience while balancing the demands of development … s degree in Computer Science, Software Engineering, or a related field. 3+ years of experience in DevOps, sitereliabilityengineering (SRE), system stability assurance, operations and maintenance development, or related fields. Experience with common public cloud platform products (e.g., cloud hosting, cloud storage, object storage, CDN More ❯