with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable reading, modifying, and writing code in Python and/or Node.js Nice to Have More ❯
with relational databases in production environments (e.g., Postgres, MySQL), including basic performance troubleshooting, migrations, backups, and access control. Familiarity with observability tools such as Prometheus, Grafana, ELK stack, or OpenTelemetry Experience with container orchestration platforms, particularly Kubernetes Ability to systematically troubleshoot and debug distributed systems Comfortable reading, modifying, and writing code in Python and/or Node.js Nice to Have More ❯
and automated policy and quality controls in delivery pipelines, ideally with Azure DevOps. Extensive change management experience and evidential metrics-based delivery capability. Standardised logging/metrics/tracing (OpenTelemetry, Prometheus/Grafana); built comprehensive dashboards and runbooks. Owned Service reliability: SLOs/SLIs, error budgets, chaos experiments, game days, incident command. Nice-to-Haves Mortgage origination and servicing, Payments More ❯
of CI/CD pipelines using GitLab and ArgoCD. Design and operate containerised workloads with EKS, Fargate, and Kubernetes. Manage Kubernetes deployments using Helm charts. Implement observability solutions using OpenTelemetry (OTel), Grafana, and Splunk. Optimise infrastructure with Karpenter for autoscaling and cost efficiency. Ensure robust AWS networking (VPC, Transit Gateway, PrivateLink, Route 53) and enforce security best practices. Drive incident … response, monitoring, and performance tuning. Key Technologies: AWS (EKS, Fargate, EC2, S3), Terraform, CloudFormation, GitLab, ArgoCD, Docker, Kubernetes, Helm, Cassandra, OTel, Grafana, Splunk, Karpenter, Python, Bash. Desirable: Experience with Google Cloud Platform (GCP), Apigee Hybrid, and hybrid/multi-cloud environments. Carbon60, Lorien & SRG - The Impellam Group STEM Portfolio are acting as an Employment Business in relation to this vacancy. More ❯
Lambda, ECS/Fargate, S3, DynamoDB, CloudWatch, API Gateway) Data & Messaging: PostgreSQL, Redis, Kafka or SQS CI/CD & Infrastructure: Docker, Terraform, GitHub Actions, CloudFormation Monitoring & Observability: Prometheus, Grafana, OpenTelemetry Testing: Pytest, integration and load testing frameworks Key Skills & Expertise Proven experience designing and delivering production systems using Python on AWS . Strong understanding of distributed systems, API design, and More ❯
Lambda, ECS/Fargate, S3, DynamoDB, CloudWatch, API Gateway) Data & Messaging: PostgreSQL, Redis, Kafka or SQS CI/CD & Infrastructure: Docker, Terraform, GitHub Actions, CloudFormation Monitoring & Observability: Prometheus, Grafana, OpenTelemetry Testing: Pytest, integration and load testing frameworks Key Skills & Expertise Proven experience designing and delivering production systems using Python on AWS . Strong understanding of distributed systems, API design, and More ❯
Preferred Qualifications: OpenShift certifications (e.g., Red Hat Certified Specialist in OpenShift Administration). Experience with multi-cluster and hybrid cloud OpenShift deployments. Familiarity with monitoring and logging tools (e.g., oTel, Grafana, Splunk stack). Knowledge of OpenShift Operators and Helm charts. Experience with large-scale migration projects. More ❯
Preferred Qualifications: OpenShift certifications (e.g., Red Hat Certified Specialist in OpenShift Administration). Experience with multi-cluster and hybrid cloud OpenShift deployments. Familiarity with monitoring and logging tools (e.g., oTel, Grafana, Splunk stack). Knowledge of OpenShift Operators and Helm charts. Experience with large-scale migration projects. More ❯
developer self-service capabilities. Programming/scripting proficiency in Python and Bash , with a focus on automation frameworks and platform tooling. Familiarity with modern observability stacks (Grafana, Prometheus, Loki, OpenTelemetry ) and centralised logging/search platforms . Strong collaboration and communication skills, with the ability to explain technical concepts clearly to non-technical stakeholders. Solid understanding of AWS security and More ❯
developer self-service capabilities. Programming/scripting proficiency in Python and Bash , with a focus on automation frameworks and platform tooling. Familiarity with modern observability stacks (Grafana, Prometheus, Loki, OpenTelemetry ) and centralised logging/search platforms . Strong collaboration and communication skills, with the ability to explain technical concepts clearly to non-technical stakeholders. Solid understanding of AWS security and More ❯
will drive DevOps adoption, build scalable telemetry systems, integrate with monitoring tools, and optimize performance across cloud and on-prem infrastructure. Key Responsibilities: Design and implement observability solutions using OpenTelemetry across storage platforms. Develop and maintain CI/CD pipelines , distributed tracing, metrics, and logging. Integrate telemetry with tools like Prometheus, Grafana, Kafka, Splunk, Loki . Analyze telemetry data, optimize More ❯
will drive DevOps adoption, build scalable telemetry systems, integrate with monitoring tools, and optimize performance across cloud and on-prem infrastructure. Key Responsibilities: Design and implement observability solutions using OpenTelemetry across storage platforms. Develop and maintain CI/CD pipelines , distributed tracing, metrics, and logging. Integrate telemetry with tools like Prometheus, Grafana, Kafka, Splunk, Loki . Analyze telemetry data, optimize More ❯
with relational database schemas Excellent problem solving and communication skills, with a collaborative mindset Proficient in incremental software delivery leveraging agile processes Experience with software observability practices (distributed tracing, OpenTelemetry, etc.) Basic understanding of artificial intelligence concepts, with curiosity and enthusiasm for learning how AI tools can be used to improve processes and drive efficiency. Interest in exploring AI systems More ❯
with observability tools, APM, log analytics, and infrastructure monitoring. Proficiency in scripting or programming languages (e.g., Java, Python, JavaScript). Certifications in Dynatrace, AWS, Azure, or GCP. Familiarity with OpenTelemetry, FluentBit, Cribl, or similar data pipeline tools.Ability to translate technical capabilities into business value, aligning observability solutions with customer KPIs and strategic goals Excellent communication and presentation skills. Ability to More ❯
Infrastructure as Code principles Hands-on experience with CI/CD systems and release automation Comfortable working with Python, Bash, or Go Monitoring and alerting tools (Prometheus, CloudWatch, Grafana, OpenTelemetry, etc.) Passionate about developer tooling, DevOps culture, and improving engineering workflows Any experience in Fintech or with public APIs would be a bonus Sound Interesting? Feel free to reach-out More ❯
Infrastructure as Code principles Hands-on experience with CI/CD systems and release automation Comfortable working with Python, Bash, or Go Monitoring and alerting tools (Prometheus, CloudWatch, Grafana, OpenTelemetry, etc.) Passionate about developer tooling, DevOps culture, and improving engineering workflows Any experience in Fintech or with public APIs would be a bonus Sound Interesting? Feel free to reach-out More ❯
development. Familiarity with testing frameworks (Vitest, Playwright) for both API and end-to-end testing. Experience with Docker, Helm, YAML, Kubernetes, and cloud-native deployments. Telemetry tools; Prometheus, Grafana, OpenTelemetry, DataDog, APM tools Understanding of infrastructure-as-code and CI/CD pipelines. Ability to improve codebases and influence architectural direction. Experience mentoring or coaching engineers. Please send updated CV More ❯
architectures . Deep understanding of CI/CD (GitHub Actions, Jenkins, or AWS CodePipeline). Proven ability to secure and scale production systems. Monitoring and observability tools (CloudWatch, Grafana, OpenTelemetry). Familiar with data exchange formats (JSON, YAML, Parquet) and API design. Leadership & Delivery 4–8 years in software development and/or DevOps , including 2+ in a management or More ❯
ML lifecycle tools, model monitoring, and versioning Exposure to tools like KServe, Ray Serve, Triton, or vLLM a big plus Bonus Points: Experience with observability frameworks like Prometheus or OpenTelemetry Knowledge of ML libraries: TensorFlow, PyTorch, HuggingFace Exposure to Azure or GCP Passion for financial services Requirements: Degree in Computer Science, Engineering, Data Science or similar What We Offer A More ❯