Manchester, England, United Kingdom Hybrid / WFH Options
AJ Bell
talent with the majority. VMware - ESXi, vROps, vRA, NSX, vRNI, vRLI, HCX. SD WAN. Fortinet firewall products. Linux server configuration and management. SolarWinds, Zabbix, Opsgenie monitoring software. iSCSI SAN technology, Dell EqualLogic & NetApp SANs. Microsoft Windows Server. ADFS, SSO, MFA. Microsoft Exchange Server. Office 365. Windows VDI, VMware Horizon. More ❯
Bash/PowerShell). System Knowledge: Hands-on experience with Linux and Windows. Preferred Skills: Familiarity with Refinitiv TREP and DevOps tools (GitHub, Slack, OpsGenie). More ❯
and logging solutions, e.g. Prometheus, AWS Cloudwatch, Grafana, OpenTelemetry, Honeycomb, ELK etc. Basic SRE knowledge, and experience in alerting and incident management platforms (eg. Opsgenie, Pagerduty). Proven ability to provide and support strong and scalable CI/CD pipelines. Linux, Git, Docker and good scripting skills in e.g. More ❯
Prometheus, AWS CloudWatch, Grafana, OpenTelemetry, Honeycomb, and ELK. Basic knowledge of Site Reliability Engineering (SRE) and experience with alerting and incident management systems like Opsgenie and PagerDuty. Demonstrated capability to develop and maintain robust and scalable Continuous Integration/Continuous Deployment (CI/CD) pipelines. Familiar with Linux, Git More ❯
technical architecture, service management, 24 7 operations, and client support. Familiarity with monitoring, observability, and incident management tools (e.g., Datadog, New Relic, Prometheus, Grafana, Opsgenie, PagerDuty, ServiceNow). Excellent communication skills, with the ability to explain technical concepts to non-technical stakeholders. Strong client-facing experience, with the ability More ❯
technical architecture, service management, 24 7 operations, and client support. Familiarity with monitoring, observability, and incident management tools (e.g., Datadog, New Relic, Prometheus, Grafana, Opsgenie, PagerDuty, ServiceNow). Excellent communication skills, with the ability to explain technical concepts to non-technical stakeholders. Strong client-facing experience, with the ability More ❯
effort and the escalation and prioritisation of those items. Monitor hardware, applications and environmental conditions of our Order Management systems using tools such as OpsGenie & CheckMK (Nagios). Manage production releases of our Order Management systems. Participate in Disaster Recovery planning, updating run books and DR tests. Ensure that More ❯
Additional Tools & Practices : Experience with JIRA, SCRUM, and Confluence to support project management and communication across teams. Incident Management Tools: Familiarity with tools like OpsGenie or similar for effective incident response. Understanding of SRE Concepts such as SLIs (Service Level Indicators), SLOs (Service Level Objectives), error budgets, and operational More ❯
effort and the escalation and prioritisation of those items. Monitor hardware, applications and environmental conditions of our Order Management systems using tools such as OpsGenie & CheckMK (Nagios). Manage production releases of our Order Management systems. Participate in Disaster Recovery planning, updating run books and DR tests. Ensure that More ❯
distributed environment, debug and solve it in a structured manner. Knowledge of Kubernetes is optional. Knowledge of modern MLA stacks (Prometheus, Grafana, Loki, Vector, Opsgenie). Knowledge of DPUs is a plus. Python programming skills are a plus. Postgres optimization skills are a plus. WHAT WE OFFER With us More ❯
distributed environment, debug and solve it in a structured manner. Knowledge of Kubernetes (optional). Knowledge of modern MLA stacks (Prometheus, Grafana, Loki, Vector, Opsgenie). Location: 108 E 16th Street, New York, NY 10003 #J-18808-Ljbffr More ❯
Automation & IaC – Use Python, PowerShell, Terraform, and Ansible to automate configurations, monitoring, and troubleshooting. Monitoring & Observability – Maintain and improve system observability with Grafana, Splunk, OpsGenie, and PRTG to proactively address issues. Incident & Disaster Recovery – Manage incident response, root cause analysis, and DR plans to ensure business continuity. Security & Compliance … troubleshooting skills in Linux & Windows environments Deep knowledge of cloud platforms (Azure, AWS), VMware, Citrix, and Office 365 Expertise in monitoring tools (Grafana, Splunk, OpsGenie, PRTG) Hands-on experience with Terraform & Ansible for system configuration Proficiency in Python & PowerShell for automation Strong leadership & stakeholder engagement experience Familiarity with ITIL More ❯
Swindon, England, United Kingdom Hybrid / WFH Options
Vision Municipal Solutions
Your Privacy We use cookies and similar technologies to help personalise content, tailor and measure ads, and provide a better experience. By clicking "All Cookies", you agree to this, as outlined in our Cookie Policy. By clicking "Essential Cookies" you More ❯