Slurm Workload Manager Jobs in London

1 to 25 of 33 Slurm Workload Manager Jobs in London

Senior HPC AI Engineer

London, England, United Kingdom
NVIDIA
up large scale performance platforms. What you will be doing: Design, implement and maintain large scale HPC/AI clusters with monitoring, logging and alerting Manage Linux job/workload schedules and orchestration tools Develop and maintain continuous integration and delivery pipelines Develop tooling to automate deployment and management of large-scale infrastructure environments, to automate operational monitoring and … of HPC and AI solution technologies from CPU’s and GPU’s to high speed interconnects and supporting software Experience with job scheduling workloads and orchestration tools such as Slurm, K8s Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking (sockets, firewalld, iptables, wireshark, etc.) and internals, ACLs and OS level security protection and common protocols More ❯
Posted:

Dev Ops Lead , , London

London, England, United Kingdom
Bit Warmer Ltd
social responsibility. SUSE Stack: System is based on SUSE’s transactional Leap Micro, using Salt Stack configuration through Uyuni. High-Performance Computing (HPC): Experience with HPC and tools like SLURM would be advantageous; either as a user or admin. Distributed Workloads: Background in managing distributed systems. Single Pane Solutions: Familiarity with Rancher, Azure Arc, etc. Workflow Orchestration Tools: Knowledge More ❯
Posted:

Platform Services Specialist- Global Investment Management

London, England, United Kingdom
Oxford Knight
AWS and GCP). disaster recovery management leveraging cloud specific capabilities. Cloud Storage concepts (Block storage/Blob storage). job scheduling tools such as Airflow, Prefect Scheduler and Slurm (or other HPC scheduler). designing and maintaining CICD pipelines to ensure fast delivery and integration of the platform services. Contact If this sounds like you, or you'd More ❯
Posted:

Research Engineer, Machine Learning

London, England, United Kingdom
Hybrid / WFH Options
Mistral AI
or equivalent proven track record) 4 + years working on large-scale ML codebases Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed/FSDP/SLURM/K8s) Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops Strong software-design instincts: testing, code review, CI/CD Self-starter, low More ❯
Posted:

Quantitative Developer

London, England, United Kingdom
Hybrid / WFH Options
Tower Research Capital
learn, PyTorch) Experience building distributed systems with message buses (Kafka, ZeroMQ) and asynchronous I/O Experience with cloud or on-prem orchestration and scheduling frameworks (Kubernetes, HT Condor, SLURM) Benefits Tower’s headquarters are in the historic Equitable Building, right in the heart of NYC’s Financial District and our impact is global, with over a dozen offices More ❯
Posted:

Machine Learning Software Engineer, Research

London, England, United Kingdom
Hybrid / WFH Options
PhysicsX Ltd
for computer vision, geometry processing, or scientific computing; software engineering concepts and best practices (e.g., versioning, testing, CI/CD, API design, MLOps); container-ization and orchestration (Docker, Kubernetes, Slurm); writing pipelines and experiment environments, including running experiments in pipelines in a systematic way. What we offer Be part of something larger: Make an impact and meaningfully shape an More ❯
Posted:

Machine Learning Software Engineer, Research

London, England, United Kingdom
Hybrid / WFH Options
PhysicsX
for computer vision, geometry processing, or scientific computing; software engineering concepts and best practices (e.g., versioning, testing, CI/CD, API design, MLOps); container-ization and orchestration (Docker, Kubernetes, Slurm); writing pipelines and experiment environments, including running experiments in pipelines in a systematic way What We Offer Be part of something larger: Make an impact and meaningfully shape an More ❯
Posted:

Biostatistics HPC Solution Architect

London, England, United Kingdom
Hybrid / WFH Options
ZipRecruiter
days per week. This is a 6 month temporary contract, to start ASAP. Day rate: Competitive Market rate. The right candidate should have a strong understanding of HPC (Slurm) including the installation and configuration. Key Requirements: Strong understanding of Infrastructure (Azure, On-premises and other cloud techs) Knowledge of Cloud Platforms: Understanding of cloud platforms and Azure, in particular … R : The candidate should have a good understanding of R HPC Skills: The candidate should have a strong understanding of HPC (Slurm) including the installation and configuration Experience with Python: The candidate should have experience with the Python installation and configuration on Linux system Associates should have deep understanding of Biostatistics and Life science domain (especially Clinical) knowledge Basic More ❯
Posted:

HPC Solution Architect

London, England, United Kingdom
Whitehall Resources Ltd
currently seeking an HPC Solution Architect based in Hertfordshire or London for an initial 6-month contract. Note: *** INSIDE IR35 *** The candidate should have a strong understanding of HPC (Slurm), including installation and configuration. Main Responsibilities: Contribute to the development and understanding of various architectural levels. Key Skills: Linux Azure Cloud HPC Python Posit Component (Rstudio) GitHub SAS Design More ❯
Posted:

HPC Solution Architect

London, United Kingdom
Queen Square Recruitment Ltd
clearly across technical and non-technical teams Required Skills & Knowledge: Strong understanding of Infrastructure (Azure, On-premises, Cloud) Proficiency in R and Python environments Experience with HPC systems (e.g., Slurm) Basic SAS knowledge Deep understanding of Life Science and Biostatics Desirable: Background in Life Sciences or Clinical Data Broad knowledge of infrastructure solutions If you have the relevant skills More ❯
Employment Type: Contract
Posted:

Platform Applications Specialist

London, England, United Kingdom
Squarepoint Capital
software development tooling, such as Gitlab, Artifactory, or Docker. Experience with infrastructure automation and configuration management, such as Ansible and Terraform. Experience with HPC and orchestration technologies, such as Slurm or Kubernetes. Experience with Databases and Observability systems, such as Elasticsearch, Datadog, Prometheus, PostgreSQL. #J-18808-Ljbffr More ❯
Posted:

Platform Specialist - Scheduler and Orchestrators

London, England, United Kingdom
Squarepoint Capital
4+ years of experience in DevOps, SRE, or platform engineering roles. Experience with software development (Python, Git) Experience with system administration (Bash, Linux, Containerization) Deep knowledge of HPC (e.g. Slurm) or orchestration technologies (e.g. Kubernetes) Excellent written and verbal communication skills. Ability to work well in a fast-paced environment. Nice to have: Experience with other orchestration technologies (Prefect More ❯
Posted:

Quant Developer (C++)

London, England, United Kingdom
Squarepoint Capital
RAD) Experience working on a global team Python, Q/kdb+ Testing methodologies (unit tests, regression tests) Dev workflow – SVN, GIT, JIRA, Code Reviews, etc Grid & cluster tools (especially SLURM) The minimum base salary for this role is $60,000 if located in New York. This expectation is based on available information at the time of posting. This role More ❯
Posted:

GCP Public Cloud Infrastructure Architect (HPC, GKE)

London, England, United Kingdom
Hybrid / WFH Options
Derisk360
What You Bring 10+ years in cloud infrastructure design or DevOps roles. Proven expertise in Google Cloud infrastructure, GKE, and HPC architecture. Strong background in batch scheduling, job queuing (Slurm), and distributed storage systems. Proficient in Kubernetes internals, pod autoscaling, node management. Skilled in Infrastructure as Code (Terraform, Deployment Manager). Hands-on experience with Docker, Helm, Istio … Trivy or Aqua. Fluent in English, with excellent communication and problem-solving skills. Certification: Google Professional Cloud Architect (mandatory). Nice To Have Experience with GPU/TPU workloads, Slurm, Intel MPI/OpenMPI. Exposure to hybrid or multi-cloud setups using Anthos or GCVE. Familiarity with GitOps (ArgoCD, Flux), workload identity, and K8s RBAC. Experience in life More ❯
Posted:

Head of Engineering

City of London, London, United Kingdom
Hybrid / WFH Options
Enertek Group
engineering teams. Passion for open-source and decentralized infrastructure. Excellent communication and executive presence. Preferred Tech Stack Languages: Go, Rust, Python, Solidity Infrastructure: Kubernetes, Docker, GPU Scheduling (e.g., Kubeflow, Slurm), CI/CD pipelines Blockchain: EVM, Cosmos SDK, ZK/L2 solutions AI Stack (plus): PyTorch, Hugging Face, Ray, ONNX What We Offer Competitive salary + equity/token More ❯
Posted:

Head of Engineering

London Area, United Kingdom
Hybrid / WFH Options
Enertek Group
engineering teams. Passion for open-source and decentralized infrastructure. Excellent communication and executive presence. Preferred Tech Stack Languages: Go, Rust, Python, Solidity Infrastructure: Kubernetes, Docker, GPU Scheduling (e.g., Kubeflow, Slurm), CI/CD pipelines Blockchain: EVM, Cosmos SDK, ZK/L2 solutions AI Stack (plus): PyTorch, Hugging Face, Ray, ONNX What We Offer Competitive salary + equity/token More ❯
Posted:

HPC Infrastructure Engineer

London, England, United Kingdom
PhysicsX Ltd
drive business growth. What You Will Do Enhance our CPU, GPU, HPC, and cloud infrastructure Implement upgrades, patching, and system enhancements Provide expertise with technologies such as Linux, CUDA, SLURM, Python etc. Innovate to maintain the highest standards for our technology stack Drive IT solutions that align with our business objectives Research and evaluate new technology solutions Collaborate with More ❯
Posted:

Member of Technical Staff (Infrastructure)

London, England, United Kingdom
Hybrid / WFH Options
Reka AI
logging tools (e.g., Prometheus, Grafana). A deep understanding of cloud computing platforms (e.g., AWS, GCP, Azure). Strongly desired: Experience with HPC/GPU cluster management tools (e.g., Slurm, GPU monitoring tools, distributed file systems). The ability to build in a fast-paced environment under some uncertainty. Reka's Mission Reka's mission is to build useful More ❯
Posted:

HPC Infrastructure Engineer

London, England, United Kingdom
PhysicsX
drive business growth. What You Will Do Enhance our CPU, GPU, HPC, and cloud infrastructure Implement upgrades, patching, and system enhancements Provide expertise with technologies such as Linux, CUDA, SLURM, Python etc. Innovate to maintain the highest standards for our technology stack Drive IT solutions that align with our business objectives Research and evaluate new technology solutions Collaborate with More ❯
Posted:

Platform Engineer Observability

London, England, United Kingdom
Sahomelocator
Experience with Gitlab, Bitbucket, and CI tools like GitHub or Bamboo. Willingness to engage in technical discussions and produce high-quality code. Enthusiasm to learn and grow. Knowledge of Slurm and HPC is a bonus. The role involves developing in Python within an SRE team, impacting a greenfield set of services that will enhance a leading European trading platform. More ❯
Posted:

Quantitative Developer, Systematic Equities

London, England, United Kingdom
Millennium Management
well as related efforts, such as the preparation and transformation of data and other operational tasks. Preferred Location London or Dubai preferred Principal Responsibilities Partner closely with the Portfolio Manager to develop data engineering and prediction tools primarily for the systematic trading of equities Develop software engineering solutions for quantitative research and trading Assist in designing, coding, and maintaining … Skills Expert in Python and/or KDB/Q Proficient in modern data science tools stacks (Jupyter, pandas, numpy, sklearn) with machine learning experience Good understanding of using Slurm or similar parallel computing tools Bachelor's or Master's degree in Computer Science, Mathematics, Statistics, or related STEM field from top ranked University Proficient in quantitative analysis, mathematical More ❯
Posted:

HPC Linux Operations Engineer Chicago & London

London, United Kingdom
Jump Trading, LLC
compute, storage, and interconnects. Technologies involved include RDMA fabrics, parallel filesystems, HPC batch schedulers, FUSE filesystems, internal Jump software, multi-vendor hardware, cybersecurity requirements, a challenging and unpredictable client workload, and high user expectations Solve problem reports and questions posed by members of Jump's research community, escalating as needed and managing the entire problem lifecycle Respond to alerts … desire for operational work as primary job function 2+ years of professional experience with Linux systems High performance computing (HPC), including parallel filesystems (e.g., Lustre, GPFS), batch systems (e.g., Slurm, Grid Engine), and high-performance network interconnects experience is a plus, but not required High proficiency with at least one programming/scripting language (e.g., Go, Python, C) and More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Infrastructure Architect (we have office locations in Cambridge, Leeds & London)

London, England, United Kingdom
Hybrid / WFH Options
Genomics England
both on-premise and AWS. About the Tech Stack Our HPC clusters are built in our on-premises data centres and in AWS. We use IBM LSF for our workload management currently. Hardware wise we have a large footprint of FGPA Servers (DRAGEN) both on-premises and in AWS, as well as standard HPC Compute nodes both on-premises … certifications, we are primarily interested in your real-world experience. Essential Skills and Experience: Extensive knowledge and understanding of HPC Technologies – Including but not limited to IBM LSF, NextFlow, Slurm, AWS Batch. Experience working within an On-Premises estate and working to build/design platform on premise considering physical networking, Bare Metal Servers, and Hardware Lifecycles. Strong Experience More ❯
Posted:

HPC Engineer

London, United Kingdom
LinuxRecruit
the future of healthcare today. This company is on the hunt for HPC Engineers to power their 25 Petabyte system Sound good? Well there's more! Imagine working with Slurm clusters and GPFS storage, all while being an integral part of groundbreaking translational research. You will work in adynamic team of five, where your hands-on expertise will support More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Senior Quantitative Researcher – Systematic Macro & Execution Alpha | London, UK | In-Office

London, England, United Kingdom
Eka Finance
high-caliber team of junior researchers (2–3 people), contributing to both leadership and hands-on research Leverage a modern research stack that includes distributed computing environments (e.g. AWS, Slurm), large-scale data tools (e.g. kdb+, Exasol), and advanced methods in statistics and machine learning Ideal Candidate Will Have: 3+ years of experience in a quantitative trading or research More ❯
Posted: