Senior Site Reliability Engineer
- Hiring Organisation
- Moneycorp
- Location
- London, England, United Kingdom
including SLO governance, resilience testing, and platform patterns, ensuring our systems meet the highest levels of operational resilience and regulatory compliance Key Responsibilities: Reliability Engineering & Observability Define and maintain SLOs/SLIs and error budgets for critical services Build and improve observability pipelines (metrics, logs, traces) Maintain dashboards … issues using telemetry Automation, DR & Resilience Testing Automate backup, restore, and failover processes Validate RTO/RPO through regular DR testing Design and run chaos engineering experiments Enhance self-healing and rollback automation Operational Excellence & Incident Leadership Lead SEV-1/SEV-2 incidents and authorize critical decisions ...