Lead Site Reliability Engineer
Wallingford, England, United Kingdom
Hybrid / WFH Options
Hybrid / WFH Options
Dexory
oriented product that integrates autonomous robot systems and data insights. Your role will be pivotal in developing and maintaining company-wide monitoring, alerting, and management systems. You will work across various teams to implement robust incident management strategies and support engineering teams in collecting and publishing critical … documentation and runbooks to handle changes and incidents efficiently. Your key responsibilities will include: Monitoring and maintaining our systems for metrics collection, alerting, and incident management. Working across teams to ensure a robust incident management strategy is in place. Preparing documentation and runbooks for handling changes and … have: Extensive experience in site reliability engineering or a related field, with a focus on hardware-oriented products. Strong knowledge of monitoring, alerting, and incident management systems. Proven experience in developing and implementing incident management strategies. Proficiency in creating and maintaining documentation and runbooks. Experience with more »
Posted: