Bash/PowerShell). System Knowledge: Hands-on experience with Linux and Windows. Preferred Skills: Familiarity with Refinitiv TREP and DevOps tools (GitHub, Slack, OpsGenie). more »
Experience with frontend SPA frameworks (e.g., React, Angular, Vue). Experience with container technology (e.g., Docker, Kubernetes). Experience with observability and telemetry (e.g., Opsgenie, Splunk). Experience with SQL Server or other relational database systems. Experience with any non-relational database service (e.g., MongoDB, DynamoDB). Ability to more »
are willing to present and defend your ideas to technical and non-technical audiences. Additional Desired Skills Experience with incident management platforms like PagerDuty, OpsGenie, or similar tools Understanding of SLO/SLA management and implementations Knowledge of industry standard incident management frameworks and best practices Familiarity with automated more »
to have: Either coding or scripting experience. Tools: Kubernetes, Docker and message queues (e.g., Kafka, RabbitMQ, HiveMQ) or similar Queuing Technology, Prometheus, Grafana, BetterStack, OpsGenie, and Status Page. Soft skills: Leadership, mentoring, stakeholders management, problem-solving, strategic thinking, cultural leadership, and self-starter. Diversity and Inclusion: We believe that more »
/distributed environment, debug and solve it in a structured manner. Knowledge of kubernetes optional. Knowledge of modern MLA stacks (prometheus, grafana, loki, vector, opsgenie). Knowledge of DPUs a plus. Python programming skills a plus. Postgres optimization skills a plus. WHAT WE OFFER With us, you will work more »
effort and the escalation and prioritisation of those items. Monitor hardware, applications and environmental conditions of our Order Management systems using tools such as OpsGenie & CheckMK (Nagios). Manage production releases of our Order Management systems. Participate in Disaster Recovery planning, updating run books and DR tests. Ensure that more »
Automation & IaC – Use Python, PowerShell, Terraform, and Ansible to automate configurations, monitoring, and troubleshooting. Monitoring & Observability – Maintain and improve system observability with Grafana, Splunk, OpsGenie, and PRTG to proactively address issues. Incident & Disaster Recovery – Manage incident response, root cause analysis, and DR plans to ensure business continuity. Security & Compliance … troubleshooting skills in Linux & Windows environments Deep knowledge of cloud platforms (Azure, AWS), VMware, Citrix, and Office 365 Expertise in monitoring tools (Grafana, Splunk, OpsGenie, PRTG) Hands-on experience with Terraform & Ansible for system configuration Proficiency in Python & PowerShell for automation Strong leadership & stakeholder engagement experience Familiarity with ITIL more »