Research Scientist, Systems and Infrastructure (PhD) We build systems that allow billions of people all over the world to connect and communicate using whatever devices they have available. Our researchers and engineers are constant innovators as they design and build scalable, fast, reliable, and efficient systems. Moreover, fast … evolving social apps and highly dynamic social workloads present many unique research opportunities. From distributedsystems, to data centers, hardware, storage, mobile and beyond, the entire Meta platform is our lab for research, development, and innovation. As a Research Scientist at Meta, you will help build the systems … people around the globe. You will have a keen interest in relevant engineering fields, including (but not limited to) machine learning and artificial intelligence, distributed software systems, storage systems, data warehousing and analytics, database systems, operating systems, networking systems, programming languages, compilers & runtime systemsMore ❯
feature of Dremio's query engine that combines the power of materialized views with sophisticated automatic query rewrites through deep integration in Dremio's distributed query optimizer. You will grow as a leader through mentoring, collaborating with other developers and taking ownership of complex issues to deliver high quality … distributedsystems at a massive scale. What you'll be doing Own design, implementation, testing, and support of next-generation features related to Dremio's Query Planner and Reflections technologies Work with open source projects like Apache Calcite and Apache Iceberg Use modular design patterns to deliver an … level software Fluency in Java/C++ Strong foundation in data structures, algorithms, multi-threaded and asynchronous programming models, and their use in developing distributed and scalable systems Strong database fundamentals including SQL, performance, and schema design is a big plus Experience in developing complex and scalable distributedMore ❯
Job Title: Staff Software Engineer (DistributedSystems) Location: US (EST)/UK (Remote) We are working with a fast-growing and award-winning FinTech company that has recently achieved unicorn status with a valuation exceeding $1 billion. They are transforming the fixed-income eTrading space, working with top … top engineering talent. The Role: Our client is looking for a Staff Software Engineer to take the lead in designing and developing mission-critical distributedsystems that power one of the largest corporate bond e-trading platforms. This role will involve solving complex technical challenges in distributed … role where you will influence technical strategy, mentor a high-performing engineering team, and deliver scalable and resilient solutions. Key Responsibilities: Design and Architect Systems: Lead the development of scalable, fault-tolerant, and low-latency distributedsystems to support trading workflows. Drive High-Impact Projects: Own and More ❯
Senior Software Engineer | DistributedSystems | Sustainable Cloud Native Platform 💸Up to £100,000 per annum plus bonus plus stock 🏠Central London office 🖱️Golang, Rust, C, Kubernetes, Linux, Cloud Native, Distributed systems... Are you a Senior Software Engineer with extensive knowledge of building distributedsystems? Looking … looking for in a Senior Software Engineer: Tech agnostic but preferable experience working with Golang, Rust or C/C++. In-depth knowledge of distributedsystems, building solutions at scale where performance, reliability and availability are key considerations. Experience working on software infrastructure or building Cloud native software … using Kubernetes Understanding of systems design and Open Source components. If you have knowledge of building virtual machines - Virtual servers, virtual networking or network programming - that's a plus... but definitely not needed! If this sounds like you, feel free to apply and I can run through more details More ❯
Lead Software Engineer | DistributedSystems | Sustainable Cloud Native Platform 💸Up to £130,000 per annum plus bonus plus stock 🏠Central London office 🖱️Golang, Rust, C, Kubernetes, Linux, Cloud Native, Distributed systems... Are you a Lead Software Engineer with extensive knowledge of building distributedsystems? Looking … a Lead Software Engineer: Tech agnostic but preferable experience working with Golang, Rust, Java and/or C/C++. In-depth knowledge of distributedsystems, building solutions at scale where performance, reliability and availability are key considerations. Confident in leading the definition and creation of solutions from … solving ambiguous and complex challenges. Experience working on software infrastructure or building Cloud native software using Kubernetes. Understanding of systems design and Open Source components. If you have knowledge of building virtual machines - Virtual servers, virtual networking or network programming - that's a plus... but definitely not needed! If More ❯
Job Title: Software Engineer (DistributedSystems) Client: Elite Technology Firm Experience Level: 5+ years' Salary: Up to £180k + Bonus Location: London (Hybrid) An elite tech firm are searching for Software Developers with experience in DistributedSystems to join a group of the highest-regarded talent … environment - it's allowed them to attract the best talent worldwide Skills/Experience Strong knowledge of Python (background in JVM preferred) Knowledgeable in distributedsystems or infrastructure (Kubernetes experience etc) STEM degree - strong computer science fundamentals (Preferred More ❯
System Software Engineer (GPU, HPC) We are looking for system engineers with a passion for distributedsystems, HPC, operating systems and cloud-native applications to join our client in Edinburgh and work on cutting edge research and development in the field of systems infrastructure and heterogeneous … hardware. You will be involved in analysis of new and emerging technology trends in systems infrastructure related technologies, and the design and implementation of key technologies and related algorithms. The successful candidate should demonstrate some or all of the following: PhD/MSc in Computer Science or related discipline. … In-depth knowledge of distributedsystems, HPC, GPU, compilers, operating systems or cloud computing. Solid system programming skills in C/C++, Go, Python, Rust etc. System level & low-level development experience either through industry work or academic papers. Key Words: System Engineer/Software Engineer/ More ❯
Dropbox, and Ginkgo Bioworks. The Role As a Senior Software Engineer focused on full-stack product development, you'll build the critical interfaces and systems that enable scientists to work with our cell manufacturing platform. You'll create intuitive, reliable software that abstracts away complexity while giving users powerful … tools to customize and monitor cell growth protocols. Any experience with distributedsystems will translate well to our environment, where you'll find familiar concepts like asynchronous communication and managing state consistency across components - with the added excitement of components including physical hardware! Your work will directly impact … our frontend applications using TypeScript, React, and GraphQL to create intuitive interfaces for complex biological workflows. Build backend services that communicate with our hardware systems and ensure data integrity throughout the cell manufacturing process. If you've worked on microservices you already understand many of the challenges we face More ❯
C++ Software Engineer - Trading Systems (JS-272) Location: Central London, England Sector: IT Salary: £95,000.00 to £115,000.00 per annum Benefits: Bens + Bonus C++ Software Engineer required with 1-3 years of C++ and Python experience, combined with a strong academic background and skills in distributedsystems, network programming, C++ template metaprogramming (TMP), low latency, multithreading, and concurrency. This role is offered by a world-leading boutique quantitative investment company to join a small, highly skilled international engineering team building bespoke core systematic trading systems at the forefront of systematic trading. This is … graduate/associate-level engineer to join a world-leading boutique quantitative investment company and work in a collaborative team on cutting-edge trading systems with entrepreneurial and problem-solving mindsets. The team is based in central London (hybrid role: 4 days per week in the office). Key More ❯
organization's needs with the obvious benefit of pay-what-you-use model. If you are an engineer and interested in building hyper-scale distributed database services in the cloud, have direct and immediate impact on hundreds of thousands of users who are using AWS database services, come and … PostgreSQL Limitless team is looking for an engineer to work and collaborate with other engineers, database engineers (DBEs), stakeholders and build the high quality distributed databases for cloud. An ideal candidate for this position will be an engineer who has extensive experience working on designs and implementation of highly … available communication layer components and protocols, experience with building scalable components in distributedsystems, and knowledge of distributed databases. BASIC QUALIFICATIONS 7+ years of non-internship professional software development experience 7+ years of programming with at least one software programming language experience 5+ years of leading design More ❯
Lead Software Engineer Sustainable Cloud Native Platform️ DistributedSystems Up to £120,000 per annum plus 20% bonus plus stock London office ️Golang, Rust, C, Kubernetes, Linux, Cloud Native, Distributedsystems Are you a Lead Software Engineer with considerable cloud/infrastructure experience? Looking to join … experience working with Golang, Rust or C/C++ Experience working on software infrastructure or building cloud native software Experience working with high scale, distributedsystems Understanding of systems design and how different components interact If this sounds like you, feel free to apply and I can More ❯
Software Engineer, DistributedSystems (Go and/or Rust) Hybrid About Us At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging … members to help build high-growth products, help scale our expanding network, build tools to improve availability, performance, uptime, and response times, and make systems failure-resistant and ready-to-scale. As a DistributedSystems Engineer at Cloudflare, you'll work on the code that runs close … to the end user, runs at a massive scale, and is deployed to 250+ cities around the world. Examples of software systems at Cloudflare include: A TLS/HTTP proxy to make the connection from eyeball to edge as fast and secure as possible Filtering malicious traffic and handling More ❯
and help us make the world a better place. As an SRE on our team, you'll be responsible for architecting, optimizing, and scaling distributed storage and analytics systems. You'll collaborate closely with development teams to help them grasp the broader picture of distributedsystems, beyond … We firmly believe in ownership, with software engineers accountable for the code they write. Description The Apple Services Engineering (ASE) organization builds and provides systems and infrastructure that fuel Apple's services (such as iCloud, iTunes, Siri, and Maps). At ASE, we are building and scaling high-performance … resilient, and efficient storage and analytics platforms that power critical insights across the company. Our team sits at the heart of distributedsystems, big data, and large-scale infrastructure, ensuring that petabyte-scale workloads run smoothly, efficiently, and reliably. ASE runs the majority of its systems on More ❯
building complex relationship graphs with trillions of nodes and edges excite you? The AWS Resource Explorer team owns and operates highly scalable, highly available distributedsystems that are used by all AWS customers and internal service teams alike. We are pioneers in areas such as identity, access management … resource management, security and providing massively scaled, distributed platform capabilities to all of AWS. We enable our customers to control and secure the most sensitive data on the internet. We are building some of the largest and most complex distributedsystems in the world, and we need … Identity features such as access management and sign on, cryptography, console, builder & developer tools, and even projects like automating all of our contractual billing systems, AWS Platform is always innovating with the customer in mind. The AWS Platform team sustains over 750 million transactions per second. Security is the More ❯
worldwide impact in robotics while solving challenges at the edge of robotics research? Our team in Amazon Robotics builds high-performance, real-time robotic systems that can perceive, learn, and act intelligently alongside humans - at Amazon scale. Our mission is to enable robots to interact safely, efficiently, and fluently … are seeking an experienced Robotics Control Engineer to join our team. In this role, you will leverage your experience and insights gained from delivering systems to help lead us in developing products that are both innovative and scalable. Working with a world-class team of experts in motion planning … Work closely with hardware and software engineering team members to drive scalable, real-time implementations. Partner closely with hardware engineering team members on developing systems from prototyping to production level. Launch, test, and debug our systems both in the lab and in the field. Develop tools and best More ❯
with experienced cross-disciplinary Amazonians to conceive, design, and bring to market innovative products and services. Design and build innovative technologies in a large distributed computing environment and help lead fundamental changes in the industry. Create solutions to run predictions on distributedsystems with exposure to innovative … technologies at incredible scale and speed. Build distributed storage, index, and query systems that are scalable, fault-tolerant, low cost, and easy to manage/use. Work in an agile environment to deliver high quality software. BASIC QUALIFICATIONS Graduated less than 24 months ago or about to complete … fields at time of application. Knowledge of Computer Science fundamentals. Programming experience in C or Java/Rust. Knowledge in databases. Experience with Operating systems and Kernel Internals, Rust programming. PREFERRED QUALIFICATIONS Previous technical internship(s) if applicable. Experience with distributed, multi-tiered systems, algorithms, and relational More ❯
Amazon Selection and Catalog Systems (ASCS) builds the systems that host and run the world's largest e-Commerce products catalog. We power the online buying experience for customers worldwide so they can find, discover, and buy anything they want. Our massively scaled out distributedsystems … millions of updates on the billions of products across physical, digital, and services offerings. To achieve this, we develop the algorithms and build the systems and tools to ensure the safety of the selection and maximize their impact on the Amazon top-line, all the while continuously growing the … driving Amazon's business. You will be responsible for providing Service Level issue support to all ASCS Engineering teams who built tier 1-3 systems and keep their software packages/configurations free from vulnerability, setup and monitor new metrics and alarms for their systems and drive root More ❯
you will collaborate closely with product management and engineering, serving as a key advocate for Confluent's platform. This role demands expertise in Kafka, distributedsystems, and pre-sales engineering, while engaging with cross-functional teams to drive product success and customer outcomes. Ideal candidates may have backgrounds … Customers with Real-Time Architectures Understand customer challenges with traditional Data Warehouses, Data Lakes, and Batch Analytics workflows, and guide them toward real-time, distributed architectures using Kafka, Flink, Kafka Streams, and modern ETL/ELT frameworks. Help customers optimize their data platforms by focusing on early-stage data … and cost efficiency. Provide Technical Expertise Assist customers and sales teams in designing, deploying, and optimizing real-time data streaming platforms, integrating Kafka with distributed processing, and ensuring alignment with business goals. Architect solutions to unify operational and analytical workloads, enabling a data mesh or streaming-first architecture. Partner More ❯
Client's Security Team Overview The client's security team is the central engine driving data and systems security at the company, supporting it and all of its family of apps. The org is responsible for inhibiting malicious actors from compromising our environment, detecting, and responding to them before … teams or conducting Information Security consulting engagements Experience in writing custom scripts in Python/C++ and PHP/Hack Experience building large-scale distributedsystems or similar experience Experience in assessing security deficiencies in first-party/internal information systems and recommending mitigating controls Knowledge of … evaluating systems architectural designs, data-flow diagrams and technical security implementations, particularly in the context of access management in different geographical locations. Experience with developing security reporting and recommendations that are meaningful, defensible and actionable for a variety of audiences Ability to manage competing priorities and simultaneous projects in More ❯
Rate: £550 per day (umbrella) Our client, a forward-thinking organisation, is looking for a passionate and skilled DevOps Engineer to join their dynamic Systems Engineering team. Reporting to the Head of Systems Engineering, you will play a vital role in designing, implementing, and managing both cloud and … on-premise infrastructures, tooling, and systems that support our development teams. Key Responsibilities: Infrastructure Design & Deployment : Architect and deploy scalable, high-performance systems that align with the organisation's long-term operational and business objectives. Performance Monitoring : Continuously monitor, analyse, and enhance the reliability of large-scale distributedsystems to ensure seamless uptime and operational efficiency. Automation Solutions : Develop automation solutions for configuration management, system provisioning, and deployment, significantly streamlining operations and minimising manual processes. Security Compliance : Implement and maintain robust security protocols, ensuring that systems comply with regulatory standards and best practises, including data More ❯
in one or more programming languages. 5 years of experience with data structures or algorithms. 3 years of experience in designing, analyzing, and troubleshooting distributedsystems, and 2 years of experience leading projects and providing technical leadership. Experience in SRE or incident management/response environments. Preferred qualifications … Experience working in computing, distributedsystems, storage, or networking. Experience in telemetry systems, incident and risk management. Expertise in designing, analyzing, and troubleshooting large-scale distributed systems. Ability to debug, optimize code, and to automate routine tasks. Excellent problem-solving approach, with verbal and written communication … skills. About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to customer's More ❯
in one or more programming languages. 5 years of experience with data structures or algorithms. 3 years of experience in designing, analyzing, and troubleshooting distributedsystems, and 2 years of experience leading projects and providing technical leadership. Preferred Qualifications: Experience working in computing, distributedsystems, storage … or networking. Expertise in designing, analyzing, and troubleshooting large-scale distributed systems. Ability to debug, optimize code, and to automate routine tasks. Systematic problem-solving approach, coupled with effective verbal and written communication skills. About the Job Site Reliability Engineering (SRE) combines software and systems engineering to build … and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services-both our internally critical and our externally-visible systems-have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally, SREs will keep an ever-watchful eye on More ❯
Join Team Purple. Join IFS. Job Description We are looking for a technical leader who is passionate about building scalable, resilient, and high-performance distributed systems. As the Lead Software Engineer, you will play a pivotal role in the design, development, and deployment of microservices architectures running on Kubernetes … Lead the architecture, deployment, and management of containerized applications using Kubernetes, ensuring they are highly available, scalable, and resilient. System Scalability : Ensure that the systems are designed for horizontal scalability and can handle large volumes of traffic by effectively utilizing Kubernetes and container orchestration tools. Continuous Improvement : Drive the … adoption of best practices and improve existing microservices, systems, and workflows to enhance system performance, reliability, and efficiency. Mentorship & Leadership : Provide leadership and mentorship to engineering teams, helping to develop junior and mid-level engineers by fostering a culture of learning and knowledge sharing. Collaboration : Work closely with product More ❯
Apache Pulsar , Apache Airflow , Temporal , and Apache Flink , sharing knowledge and suggesting improvements. Documentation: Contribute to clear and concise documentation for software, processes, and systems to ensure team alignment and knowledge sharing. Your Qualifications: Experience: Professional experience in Python development or related software engineering roles. Python Proficiency: Strong knowledge … Django , Flask , or FastAPI . Database Management: Solid experience with relational databases like PostgreSQL or MySQL and familiarity with NoSQL databases like Redis . DistributedSystems: A basic understanding of distributedsystems and microservices architecture, as well as cloud-based infrastructure (AWS, GCP, Azure), is a … Tools: Familiarity with containerization ( Docker ) and infrastructure automation tools like Terraform or Ansible . Real-time Data Streaming: Experience with Apache Pulsar or similar systems for real-time messaging and stream processing is a plus. Data Engineering: Experience with Apache Spark , Databricks , or similar big data platforms for processing More ❯
Responsibilities: Collaborative engineering: Work within a larger team to rapidly develop proof-of-concept prototypes to validate research ideas and integrate them into production systems and infrastructure Performance Analysis: Conduct in-depth profiling and tuning of operating systems and large-scale distributedsystems, leveraging heterogeneous hardware … to do the job: Required: Bachelor's or Master's degree in Computer Science or a related technical field. A solid background in operating systems and/or distributedsystems and/or ML systems. Excellent programming skills, master of at least one language, such as C … communication and teamwork skills. Be comfortable with research methodology. Desired: Familiarity with current LLM architectures (e.g. Llama3, DeepSeek V3) Familiarity with production LLM serving systems and inference optimizations (e.g. VLLM) Experience with accelerator programming (e.g. CUDA, Triton) and communication libraries (e.g. NCCL More ❯