Slurm Workload Manager Jobs in the UK

8 of 8 Slurm Workload Manager Jobs in the UK

HPC Engineer

London, United Kingdom
Opus Recruitment Solutions
role Remote £550 Inside ir35 6 Months contract Key Skills needed - Design/implementing Unix/Linux system and services open-source solutions and performance tuning. - HPC technologies: Lustre, Slurm - Configuration systems such as Ansible and Terraform - Unix/Linux scripting. - Networking: TCP/IP, DHCP, VLANs, spanning tree protocol, link aggregation for performance (MTU settings) and reliability requirements. More ❯
Employment Type: Contract
Rate: £500 - £550/day Remote
Posted:

HPC Engineer

London, South East, England, United Kingdom
Opus Recruitment Solutions Ltd
role Remote£550 Inside ir35 6 Months contract Key Skills needed - Design/implementing Unix/Linux system and services open-source solutions and performance tuning.- HPC technologies: Lustre, Slurm- Configuration systems such as Ansible and Terraform- Unix/Linux scripting.- Networking: TCP/IP, DHCP, VLANs, spanning tree protocol, link aggregation for performance (MTU settings) and reliability requirements. More ❯
Employment Type: Contractor
Rate: £500 - £550 per day
Posted:

High-Performance Computing Engineer

oxford district, south east england, united kingdom
Ellison Institute of Technology
Computing Facility, the HPC Engineer will design, deploy, and optimise systems that enable large-scale data processing, AI-driven analytics, and simulation workloads across. For example deploying Kubernetes and Slurm to enable real-time data analysis from instruments, MLOps, or scientific workflow managers. We will be hiring either at the regular or senior level, depending on the applicant's … computational research workloads. Evaluate and integrate advanced technologies including GPU/TPU acceleration, high-speed interconnects, and parallel file systems. Manage HPC environments, including Linux-based clusters, schedulers (e.g., Slurm), and high-performance storage systems (e.g., Lustre, BeeGFS, GPFS). Implement robust monitoring, fault-tolerance, and capacity management for high availability and reliability. Develop automation scripts and tools (Python … or cloud computing) in scientific or research settings. Proficiency in Linux system administration, networking, and parallel computing (MPI, OpenMP, CUDA, or ROCm). Experience with using HPC job schedulers (Slurm preferred) and parallel file systems (Lustre, BeeGFS, GPFS). At the senior level: Extensive experience designing, deploying, and managing HPC clusters (or cloud computing) in scientific or research settings. More ❯
Posted:

Solution Architect - NVIDIA Cluster (End-to-End Design & Validation)

London, United Kingdom
Hybrid/Remote Options
WNTD
to NVIDIA reference architectures (NVAIE, Base Command, DGX SuperPod specs, etc.). Cluster Integration & Validation Define and execute validation test plans for GPU cluster performance, resilience, networking throughput, and workload behaviour. Oversee integration of GPU nodes, networking, and storage systems into the existing datacenter environment. Collaborate with DevOps/Platform teams to validate cluster orchestration (Kubernetes, Slurm, Bright … Cluster Manager, or equivalents). Validate firmware, drivers, NCCL, CUDA libraries, and container environments for production readiness. Deployment & Delivery Oversight Provide technical leadership across the full deployment life cycle. Partner with datacenter operations to ensure correct rack layouts, cabling, airflow and power design. Support delivery teams during build-out phases, ensuring the design is executed correctly. Participate in factory … on understanding of GPU interconnects (NVLink/NVSwitch) and DGX/HGX/SuperPod architectures. Deep knowledge of InfiniBand and high-performance networking architectures. Experience with cluster orchestration: Kubernetes , Slurm, PBS, or similar. Familiarity with AI/ML workload requirements, CUDA, Docker/OCI containers, and NVIDIA software stacks (NCCL, CUDA Toolkit). Comfort with Linux systems engineering More ❯
Employment Type: Contract
Rate: GBP Annual
Posted:

Senior Linux Engineer

Stevenage, Hertfordshire, South East, United Kingdom
Anson Mccade
scripting, particularly Bash, Python, and at least one other language. Clustering: Experience with clustered environments and cluster orchestration tools. Storage: Experience with clustered, parallel file systems (e.g., Lustre). Workload Management: Experience managing batch scheduling systems (PBS Pro, Slurm, SGE/UGE, etc.). HPC Knowledge: Knowledge of HPC management systems (e.g., Bright). Networking/Storage Admin More ❯
Employment Type: Permanent
Salary: £65,000
Posted:

Staff Software Engineer

London, UK
Motive Group
ll help shape the orchestration layer for one of the most advanced AI compute environments in the world. Your work will involve: Designing core platform services for cluster provisioning, workload orchestration, and resource management APIs. Building integrations with schedulers (Kubernetes, Slurm) and container runtimes for reliable, high-performance GPU workloads. Developing automation for deployment, imaging, and multi-tenant More ❯
Posted:

HPC Platform Management Engineer

London, UK
Hybrid/Remote Options
Client Server
hands-on role at a global systematic trading firm with $25 billion under management, earning significant bonuses. As a HPC Platform Management Engineer you'll develop and support scalable workload scheduling solutions for HPC environments using tools such as YellowDog within a large scale computing environment with both on-premise and cloud (AWS) based services. You'll collaborate with … with flexibility to work from home 1-2 days a week. About you: You have experience of engineering and supporting at least one HPC scheduler, such as YellowDog, Ray, Slurm or IBM Symphony You have a deep knowledge of Linux You have a good understanding of both loosely coupled and tightly coupled HPC workloads and experience of working on More ❯
Posted:

Senior Linux HPC Systems Administrator/Engineer

London, UK
Cognizant
computing (HPC) initiatives. The position requires hands-on expertise with high-end workstation hardware and scientific applications, as well as a strong background in HPC techniques, including clustering and workload management with tools like Slurm. The ideal candidate will be proficient in RedHat Enterprise Linux (RHEL 8 & 9) and have experience with scientific and high-performance computing environments, and … research demands and IT infrastructure. Leverage any scientific computing experience to optimize system performance and manage specialized applications. Assist with management of high-performance compute resources, including experience with Slurm, clustering, and related HPC technologies. Collaboration and Stakeholder Management: Work closely with other technical teams and stakeholders to align IT services with organizational needs. Build and maintain strong stakeholder … 9. Proven experience with high-end workstation hardware setups and scientific application support. Demonstrated knowledge of scientific computing and experience in high performance compute environments, including experience with Slurm and clustering, is highly desirable. Strong troubleshooting skills for both hardware and software issues. Interpersonal Skills: Excellent communication skills with a proven ability to engage and build relationships with stakeholders More ❯
Posted:
Slurm Workload Manager
25th Percentile
£107,500
Median
£115,000
75th Percentile
£126,250
90th Percentile
£128,500