Slurm Workload Manager Jobs in the UK

1 to 25 of 27 Slurm Workload Manager Jobs in the UK

Senior HPC Infrastructure Engineer

Hampshire, England, United Kingdom
Hybrid / WFH Options
Hays Specialist Recruitment Limited
It's a great opportunity for someone who thrives in project-led infrastructure work and wants to help shape cutting-edge HPC solutions. What you'll need to succeed Slurm: Proven experience managing and tuning HPC job schedulers. Infiniband and RoCE: Deep knowledge of high-speed networking technologies. Ansible: Proficiency in using Ansible for automation and configuration management. Networking More ❯
Employment Type: Full-Time
Salary: £100,000 - £130,000 per annum
Posted:

HPC Engineer

London, United Kingdom
Red - The Global SAP Solutions Provider
or engineering large-scale computing environments (HPC, HTC, or Big Compute). Proven ability to drive innovation and integrate emerging technologies into HPC solutions. Administration experience with cluster and workload management software (eg, Slurm , LSF , Grid Engine ). Strong knowledge of Linux system administration , TCP/IP Networking , and storage systems . Experience managing parallel file systems (eg More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

HPC Engineer (High-Performance Computing)

United Kingdom, UK
Hybrid / WFH Options
RED Global
or engineering large-scale computing environments (HPC, HTC, or Big Compute). Proven ability to drive innovation and integrate emerging technologies into HPC solutions. Administration experience with cluster and workload management software (e.g., Slurm , LSF , Grid Engine ). Strong knowledge of Linux system administration , TCP/IP networking , and storage systems . Experience managing parallel file systems (e.g. More ❯
Employment Type: Part-time
Posted:

HPC Engineer (High-Performance Computing)

Greater London, England, United Kingdom
Hybrid / WFH Options
RED Global
or engineering large-scale computing environments (HPC, HTC, or Big Compute). Proven ability to drive innovation and integrate emerging technologies into HPC solutions. Administration experience with cluster and workload management software (e.g., Slurm , LSF , Grid Engine ). Strong knowledge of Linux system administration , TCP/IP networking , and storage systems . Experience managing parallel file systems (e.g. More ❯
Posted:

HPC Engineer (High-Performance Computing)

london, south east england, united kingdom
Hybrid / WFH Options
RED Global
or engineering large-scale computing environments (HPC, HTC, or Big Compute). Proven ability to drive innovation and integrate emerging technologies into HPC solutions. Administration experience with cluster and workload management software (e.g., Slurm , LSF , Grid Engine ). Strong knowledge of Linux system administration , TCP/IP networking , and storage systems . Experience managing parallel file systems (e.g. More ❯
Posted:

HPC Engineer (High-Performance Computing)

slough, south east england, united kingdom
Hybrid / WFH Options
RED Global
or engineering large-scale computing environments (HPC, HTC, or Big Compute). Proven ability to drive innovation and integrate emerging technologies into HPC solutions. Administration experience with cluster and workload management software (e.g., Slurm , LSF , Grid Engine ). Strong knowledge of Linux system administration , TCP/IP networking , and storage systems . Experience managing parallel file systems (e.g. More ❯
Posted:

Lead HPC & AI Infrastructure Engineer

Dorset, England, United Kingdom
Hybrid / WFH Options
Hays Specialist Recruitment Limited
infrastructure solutions across compute, storage, and networking Producing detailed technical documentation: hardware specs, data centre layouts, cabling, power and cooling Installing and tuning Linux-based operating systems and configuring SLURM job schedulers Optimising high-speed networking technologies (Infiniband, RoCE) Automating deployments and maintenance using Ansible, Terraform, Bash, and Python Troubleshooting complex distributed systems and mentoring junior engineers This is … building systems that scale, this role is for you. What you'll need to succeed Proven experience designing and scaling large HPC clusters (hundreds to thousands of nodes) Strong SLURM configuration skills - partitions, priorities, resource management Advanced Linux administration and performance tuning Expertise in high-performance networking (Infiniband, RoCE, RDMA) Experience with distributed file systems (Lustre, Ceph, WEKA, VAST More ❯
Employment Type: Full-Time
Salary: £130,000 per annum
Posted:

R&D Solution Architect

United Kingdom
Elanco Tiergesundheit AG
." - "Practical experience designing and implementing solutions that adhere to FAIR data principles (Findable, Accessible, Interoperable, Reusable)." - Experience architecting for High-Performance Computing (HPC) environments, including knowledge of workload schedulers (e.g., SLURM) and applying cloud-native patterns to scientific, batch-processing workloads. - Familiarity with scientific workflow management tools (e.g., Nextflow, Snakemake) and the use of containerization (Docker More ❯
Employment Type: Permanent
Salary: GBP Annual
Posted:

Associate Technical Architect - Platform Engineering

United Kingdom
Quantiphi
to design, optimize, and scale infrastructure for GenAI workloads. The ideal candidate will have deep hands-on experience in GPU profiling, parallelization strategies, and scheduling compute-intensive jobs using Slurm and Red Hat OpenShift. This role also includes supporting the build-out of GenAI platform foundations and contributing to customer-facing projects in production environments. Key Responsibilities: Design and … implement infrastructure architectures for LLM and GenAI workloads on multi-GPU systems. Perform GPU profiling, benchmarking, and performance optimization across distributed training workloads. Manage and schedule jobs on Slurm-based clusters and containerized environments like Red Hat OpenShift/Kubernetes. Enable and optimize NVIDIA GPU stack components (CUDA, cuDNN, NCCL, Triton Inference Server, RAPIDS, etc.) for GenAI and DL … e.g., Terraform, Helm charts) for deploying GPU-ready environments. Contribute to internal capability development (e.g., workshops, PoCs) and translate solutions to client delivery engagements. Required Skills: Strong expertise in Slurm job scheduler and distributed training environments. Experience with Red Hat OpenShift and/or Kubernetes-based orchestration. Knowledge of NVIDIA GPU ecosystem – CUDA, cuDNN, NCCL, Nsight Systems, and NVIDIA More ❯
Posted:

Junior HPC Engineer

northamptonshire, midlands, united kingdom
Arcus Search
science and engineering. Youll gain exposure to technologies that power large-scale modeling, including FEA, and data-driven research, and develop your skills across Linux systems, compute clusters, and workload management tools. Responsibilities: Assist in the setup, monitoring, and maintenance of HPC clusters, storage, and interconnects. Support Linux system administration tasks (RHEL, Rocky), with a focus on stability and … uptime. Help configure and troubleshoot workload managers such as Slurm. Work with senior engineers to monitor performance of key applications and identify opportunities for improvement. Contribute to scripting and automation tasks (Bash, Python) to streamline system operations. Support end-users by responding to tickets, preparing documentation, and guiding researchers on best practices. Learn about parallel computing concepts (MPI, OpenMP … or Python). Exposure to bare metal environments (installing, configuring, and troubleshooting physical servers). Interest in high-performance computing, scientific computing, or distributed systems. Eagerness to learn about workload managers (Slurm or similar). Good problem-solving skills, with the ability to troubleshoot technical issues. Strong communication skills and a collaborative mindset. This role is ideal for More ❯
Posted:

Junior HPC Engineer

Oxfordshire, England, United Kingdom
Arcus Search
and engineering. You’ll gain exposure to technologies that power large-scale modeling, including FEA, and data-driven research, and develop your skills across Linux systems, compute clusters, and workload management tools. Responsibilities: Assist in the setup, monitoring, and maintenance of HPC clusters, storage, and interconnects. Support Linux system administration tasks (RHEL, Rocky), with a focus on stability and … uptime. Help configure and troubleshoot workload managers such as Slurm. Work with senior engineers to monitor performance of key applications and identify opportunities for improvement. Contribute to scripting and automation tasks (Bash, Python) to streamline system operations. Support end-users by responding to tickets, preparing documentation, and guiding researchers on best practices. Learn about parallel computing concepts (MPI, OpenMP … or Python). Exposure to bare metal environments (installing, configuring, and troubleshooting physical servers). Interest in high-performance computing, scientific computing, or distributed systems. Eagerness to learn about workload managers (Slurm or similar). Good problem-solving skills, with the ability to troubleshoot technical issues. Strong communication skills and a collaborative mindset. This role is ideal for More ❯
Posted:

Junior HPC Engineer

oxford district, south east england, united kingdom
Arcus Search
and engineering. You’ll gain exposure to technologies that power large-scale modeling, including FEA, and data-driven research, and develop your skills across Linux systems, compute clusters, and workload management tools. Responsibilities: Assist in the setup, monitoring, and maintenance of HPC clusters, storage, and interconnects. Support Linux system administration tasks (RHEL, Rocky), with a focus on stability and … uptime. Help configure and troubleshoot workload managers such as Slurm. Work with senior engineers to monitor performance of key applications and identify opportunities for improvement. Contribute to scripting and automation tasks (Bash, Python) to streamline system operations. Support end-users by responding to tickets, preparing documentation, and guiding researchers on best practices. Learn about parallel computing concepts (MPI, OpenMP … or Python). Exposure to bare metal environments (installing, configuring, and troubleshooting physical servers). Interest in high-performance computing, scientific computing, or distributed systems. Eagerness to learn about workload managers (Slurm or similar). Good problem-solving skills, with the ability to troubleshoot technical issues. Strong communication skills and a collaborative mindset. This role is ideal for More ❯
Posted:

Unix Systems Specialist

Abingdon, Oxfordshire, South East, United Kingdom
Rullion Limited
customers, including the network infrastructure, security, server, storage, end user compute and device management. Role Overview : The UNIX Systems Specialist reports to the Unix Systems Group lead, Infrastructure Systems Manager (UNIX), and is responsible for design, management and support in the Linux System Administration team, manage the day-to-day running of the UKAEA Linux based IT Systems, HPC …/BPSS level minimum). Desirable o Experience of managing Linux systems at scale. o Experience managing IT projects. o Experience setting up and supporting batch queueing systems (i.e. slurm) o Experience setting up and supporting Nvidia GPU systems o Ability to write well documented code in a high-level language or script (Python/Perl) o Experience in More ❯
Employment Type: Contract
Posted:

Staff Software Engineer

City of London, London, United Kingdom
Motive Group
ll help shape the orchestration layer for one of the most advanced AI compute environments in the world. Your work will involve: Designing core platform services for cluster provisioning, workload orchestration, and resource management APIs. Building integrations with schedulers (Kubernetes, Slurm) and container runtimes for reliable, high-performance GPU workloads. Developing automation for deployment, imaging, and multi-tenant More ❯
Posted:

Staff Software Engineer

London Area, United Kingdom
Motive Group
ll help shape the orchestration layer for one of the most advanced AI compute environments in the world. Your work will involve: Designing core platform services for cluster provisioning, workload orchestration, and resource management APIs. Building integrations with schedulers (Kubernetes, Slurm) and container runtimes for reliable, high-performance GPU workloads. Developing automation for deployment, imaging, and multi-tenant More ❯
Posted:

Staff Software Engineer

slough, south east england, united kingdom
Motive Group
ll help shape the orchestration layer for one of the most advanced AI compute environments in the world. Your work will involve: Designing core platform services for cluster provisioning, workload orchestration, and resource management APIs. Building integrations with schedulers (Kubernetes, Slurm) and container runtimes for reliable, high-performance GPU workloads. Developing automation for deployment, imaging, and multi-tenant More ❯
Posted:

Staff Software Engineer

london, south east england, united kingdom
Motive Group
ll help shape the orchestration layer for one of the most advanced AI compute environments in the world. Your work will involve: Designing core platform services for cluster provisioning, workload orchestration, and resource management APIs. Building integrations with schedulers (Kubernetes, Slurm) and container runtimes for reliable, high-performance GPU workloads. Developing automation for deployment, imaging, and multi-tenant More ❯
Posted:

Staff Software Engineer

london (city of london), south east england, united kingdom
Motive Group
ll help shape the orchestration layer for one of the most advanced AI compute environments in the world. Your work will involve: Designing core platform services for cluster provisioning, workload orchestration, and resource management APIs. Building integrations with schedulers (Kubernetes, Slurm) and container runtimes for reliable, high-performance GPU workloads. Developing automation for deployment, imaging, and multi-tenant More ❯
Posted:

Linux Systems Administrator

cambridgeshire, east anglia, united kingdom
Hyper Recruitment Solutions
related field 2. Proven industry experience in building, deploying, and maintaining Linux servers (Red Hat/Rocky Linux) 3. A working knowledge and practical experience with batch queuing systems (Slurm) and cloud computing, particularly AWS Key Words: Linux Systems Administrator/Scientific Computing/Red Hat/Rocky Linux/Slurm/AWS/Oracle DBA/IT More ❯
Posted:

Linux Systems Administrator

cambridge, east anglia, united kingdom
Hyper Recruitment Solutions
related field 2. Proven industry experience in building, deploying, and maintaining Linux servers (Red Hat/Rocky Linux) 3. A working knowledge and practical experience with batch queuing systems (Slurm) and cloud computing, particularly AWS Key Words: Linux Systems Administrator/Scientific Computing/Red Hat/Rocky Linux/Slurm/AWS/Oracle DBA/IT More ❯
Posted:

Linux Systems Administrator

Cambridge, Cambridgeshire, UK
Hyper Recruitment Solutions
related field 2. Proven industry experience in building, deploying, and maintaining Linux servers (Red Hat/Rocky Linux) 3. A working knowledge and practical experience with batch queuing systems (Slurm) and cloud computing, particularly AWS Key Words: Linux Systems Administrator/Scientific Computing/Red Hat/Rocky Linux/Slurm/AWS/Oracle DBA/IT More ❯
Employment Type: Full-time
Posted:

Linux Systems Administrator

Cambridgeshire, England, United Kingdom
Hyper Recruitment Solutions Ltd
a related field2. Proven industry experience in building, deploying, and maintaining Linux servers (Red Hat/Rocky Linux)3. A working knowledge and practical experience with batch queuing systems (Slurm) and cloud computing, particularly AWSKey Words: Linux Systems Administrator/Scientific Computing/Red Hat/Rocky Linux/Slurm/AWS/Oracle DBA/IT Security More ❯
Employment Type: Full-Time
Salary: Competitive salary
Posted:

HPC Engineer

United Kingdom, UK
Hybrid / WFH Options
IO Associates
involves designing, managing, and optimising HPC clusters. The successful candidate will work flexibly and collaborate with security-cleared teams. Responsibilities: Manage and maintain HPC clusters, monitoring performance (e.g., Ganglia, Slurm) and troubleshooting hardware/software issues for 24/7 uptime. Optimise job scheduling (e.g., Slurm, Grid Engine, IBM) and tune MPI-based applications for genomic and health More ❯
Employment Type: Full-time
Posted:

Senior Linux HPC Systems Administrator/Engineer

Stevenage, England, United Kingdom
Cognizant
research demands and IT infrastructure. Leverage any scientific computing experience to optimize system performance and manage specialized applications. Assist with management of high-performance compute resources, including experience with Slurm, clustering, and related HPC technologies. Work closely with other technical teams and stakeholders to align IT services with organizational needs. Build and maintain strong stakeholder relationships, communicating complex technical … 9. Proven experience with high-end workstation hardware setups and scientific application support. Demonstrated knowledge of scientific computing and experience in high performance compute environments, including experience with Slurm and clustering, is highly desirable. Strong troubleshooting skills for both hardware and software issues. Desirable Skills: Working knowledge of ServiceNow and its application in incident and service management. Familiarity with More ❯
Posted:

Senior Linux HPC Systems Administrator/Engineer

stevenage, east anglia, united kingdom
Cognizant
research demands and IT infrastructure. Leverage any scientific computing experience to optimize system performance and manage specialized applications. Assist with management of high-performance compute resources, including experience with Slurm, clustering, and related HPC technologies. Work closely with other technical teams and stakeholders to align IT services with organizational needs. Build and maintain strong stakeholder relationships, communicating complex technical … 9. Proven experience with high-end workstation hardware setups and scientific application support. Demonstrated knowledge of scientific computing and experience in high performance compute environments, including experience with Slurm and clustering, is highly desirable. Strong troubleshooting skills for both hardware and software issues. Desirable Skills: Working knowledge of ServiceNow and its application in incident and service management. Familiarity with More ❯
Posted:
Slurm Workload Manager
25th Percentile
£107,500
Median
£115,000
75th Percentile
£126,250
90th Percentile
£128,500