Manchester, England, United Kingdom Hybrid / WFH Options
AJ Bell
talent with the majority. VMware - ESXi, vROps, vRA, NSX, vRNI, vRLI, HCX. SD WAN. Fortinet firewall products. Linux server configuration and management. SolarWinds, Zabbix, Opsgenie monitoring software. iSCSI SAN technology, Dell EqualLogic & NetApp SANs. Microsoft Windows Server. ADFS, SSO, MFA. Microsoft Exchange Server. Office 365. Windows VDI, VMware Horizon. More ❯
Bash/PowerShell). System Knowledge: Hands-on experience with Linux and Windows. Preferred Skills: Familiarity with Refinitiv TREP and DevOps tools (GitHub, Slack, OpsGenie). More ❯
and logging solutions, e.g. Prometheus, AWS Cloudwatch, Grafana, OpenTelemetry, Honeycomb, ELK etc. Basic SRE knowledge, and experience in alerting and incident management platforms (eg. Opsgenie, Pagerduty). Proven ability to provide and support strong and scalable CI/CD pipelines. Linux, Git, Docker and good scripting skills in e.g. More ❯
define implement and improve business performance SLO’s. 2+ years of experience with Production operations including 24x7 on-call support, escalation/paging with OpsGenie, incident management, RCA (Root Cause Analysis) and retrospective analysis. 2+ or more years in hands-on technical roles (such as site reliability engineer, software More ❯
Prometheus, AWS CloudWatch, Grafana, OpenTelemetry, Honeycomb, and ELK. Basic knowledge of Site Reliability Engineering (SRE) and experience with alerting and incident management systems like Opsgenie and PagerDuty. Demonstrated capability to develop and maintain robust and scalable Continuous Integration/Continuous Deployment (CI/CD) pipelines. Familiar with Linux, Git More ❯
technical architecture, service management, 24 7 operations, and client support. Familiarity with monitoring, observability, and incident management tools (e.g., Datadog, New Relic, Prometheus, Grafana, Opsgenie, PagerDuty, ServiceNow). Excellent communication skills, with the ability to explain technical concepts to non-technical stakeholders. Strong client-facing experience, with the ability More ❯
technical architecture, service management, 24 7 operations, and client support. Familiarity with monitoring, observability, and incident management tools (e.g., Datadog, New Relic, Prometheus, Grafana, Opsgenie, PagerDuty, ServiceNow). Excellent communication skills, with the ability to explain technical concepts to non-technical stakeholders. Strong client-facing experience, with the ability More ❯
Back End, and APIs. There's also a strong DevOps and observability culture, so you'll get stuck into tooling like Dynatrace, Splunk, and OpsGenie, and help improve reliability and performance from the ground up. This is a role for someone who wants to own the quality space and More ❯
Back End, and APIs. There's also a strong DevOps and observability culture, so you'll get stuck into tooling like Dynatrace, Splunk, and OpsGenie, and help improve reliability and performance from the ground up. This is a role for someone who wants to own the quality space and More ❯
frontend, backend, and APIs. There's also a strong DevOps and observability culture, so you'll get stuck into tooling like Dynatrace, Splunk, and OpsGenie, and help improve reliability and performance from the ground up. This is a role for someone who wants to own the quality space and More ❯
frontend, backend, and APIs. There's also a strong DevOps and observability culture, so you'll get stuck into tooling like Dynatrace, Splunk, and OpsGenie, and help improve reliability and performance from the ground up. This is a role for someone who wants to own the quality space and More ❯
effort and the escalation and prioritisation of those items. Monitor hardware, applications and environmental conditions of our Order Management systems using tools such as OpsGenie & CheckMK (Nagios). Manage production releases of our Order Management systems. Participate in Disaster Recovery planning, updating run books and DR tests. Ensure that More ❯
distributed environment, debug and solve it in a structured manner. Knowledge of Kubernetes is optional. Knowledge of modern MLA stacks (Prometheus, Grafana, Loki, Vector, Opsgenie). Knowledge of DPUs is a plus. Python programming skills are a plus. Postgres optimization skills are a plus. WHAT WE OFFER With us More ❯
effort and the escalation and prioritisation of those items. Monitor hardware, applications and environmental conditions of our Order Management systems using tools such as OpsGenie & CheckMK (Nagios). Manage production releases of our Order Management systems. Participate in Disaster Recovery planning, updating run books and DR tests. Ensure that More ❯
distributed environment, debug and solve it in a structured manner. Knowledge of Kubernetes (optional). Knowledge of modern MLA stacks (Prometheus, Grafana, Loki, Vector, Opsgenie). Location: 108 E 16th Street, New York, NY 10003 #J-18808-Ljbffr More ❯
Automation & IaC – Use Python, PowerShell, Terraform, and Ansible to automate configurations, monitoring, and troubleshooting. Monitoring & Observability – Maintain and improve system observability with Grafana, Splunk, OpsGenie, and PRTG to proactively address issues. Incident & Disaster Recovery – Manage incident response, root cause analysis, and DR plans to ensure business continuity. Security & Compliance … troubleshooting skills in Linux & Windows environments Deep knowledge of cloud platforms (Azure, AWS), VMware, Citrix, and Office 365 Expertise in monitoring tools (Grafana, Splunk, OpsGenie, PRTG) Hands-on experience with Terraform & Ansible for system configuration Proficiency in Python & PowerShell for automation Strong leadership & stakeholder engagement experience Familiarity with ITIL More ❯
Swindon, England, United Kingdom Hybrid / WFH Options
Vision Municipal Solutions
Your Privacy We use cookies and similar technologies to help personalise content, tailor and measure ads, and provide a better experience. By clicking "All Cookies", you agree to this, as outlined in our Cookie Policy. By clicking "Essential Cookies" you More ❯