4 of 4 Slurm Workload Manager Jobs in the UK

HPC AI Cloud Engineer

Hiring Organisation
WWT EMEA UK LIMITED
Location
Manchester, North West, United Kingdom
Employment Type
Contract
Contract Rate
£77 per hour
Hands-on with NVIDIA (CUDA/NCCL), AMD (ROCm), and TPUs (XLA/JAX/TF) Solid knowledge of HPC concepts (MPI, RDMA, InfiniBand, Slurm/Kubernetes) Experience with performance benchmarks (MLPerf, HPL, NCCL, STREAM) Proficiency in Python, Bash, and IaC tools (Terraform/Ansible) Ability to analyze profiling ...

HPC & AI Infrastructure Engineer — AWS, Slurm & GPUs

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
A leading research technology company in the Greater London area is seeking a skilled professional to join their research computing team. In this role, you will manage and optimize a high-performance environment, supporting advanced ...

Site Reliability Engineer

Hiring Organisation
Jobleads-UK
Location
Greater London, England, United Kingdom
DevOps, Sysadmin, and/or HPC engineering experience. Great verbal and written communication skills in English. Experience deploying and operating Kubernetes and/or SLURM clusters. Experience in writing Go, Python, Bash. Experience using Ansible, Terraform, and other automation or IAC tools. Strong engineering background, preferably in Computer Science … Software Engineering, Math, Computer Engineering, or similar fields. Nice To Haves You have built and operated an AI workload at 1000+ GPU scale. You have built multi-tenant, hyperscale Kubernetes based services. You have physically deployed infrastructure in a datacenter, managed bare metal hardware via MaaS or Netbox, etc. ...

Senior HPC Engineer

Hiring Organisation
Jobleads-UK
Location
Bristol, England, United Kingdom
Linux system administration. Familiarity with at least one scripting language (e.g., Bash, Python). Interest in high-performance computing and willingness to learn Slurm, xCAT, and Ansible. Strong problem‐solving skills and attention to detail. Good communication and collaboration skills. Ability to work independently with mentorship and as part … week. No hybrid/remote working option. Internship or academic experience in a research computing or HPC environment. Exposure to job schedulers (e.g., Slurm, LSF). Familiarity with version control systems (e.g., Git). Coursework or projects involving distributed systems, networking, or parallel computing. Understanding of basic cybersecurity concepts. ...