Senior ML Systems Engineer - Simulations

Oriole Networks
London

We are looking for a Senior ML Systems Engineer to build and validate simulation infrastructure for large-scale machine learning systems. This role focuses on modelling the compute and communication behaviour of systems used for ML training and inference, and using simulation to guide architecture, performance optimization, and capacity planning.

The ideal candidate combines strong systems experience with hands-on experience in measurement, benchmarking, and performance analysis of modern ML systems.

What You’ll Do:

  • Build simulation models for compute, memory, interconnect, and communication behavior in ML systems.

  • Develop tools to simulate performance for training and inference workloads.

  • Model distributed execution across accelerators, hosts, and network fabrics, including collectives, synchronization, and communication bottlenecks.

  • Use simulation and analytical modelling to evaluate tradeoffs, identify bottlenecks, and guide system design.

  • Run performance experiments and benchmarks on real ML systems to calibrate and validate simulation models.

  • Analyze end-to-end performance, including throughput, latency, scaling efficiency, utilization, and cost/performance tradeoffs.

  • Partner with hardware/software/Networking/ML teams to align simulation with real workloads and constraints.

  • Create reproducible benchmarking methodologies across models, system configurations, and compare against real system measurements to prove validity.

  • Communicate findings through technical reports and design recommendations.

Qualifications

Required:

  • Master’s, or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field.

  • Strong experience in ML systems, distributed systems, performance engineering, computer architecture, or simulation.

  • Understanding of systems used for machine learning training and inference.

  • Experience analyzing compute, communication, and memory behavior in large-scale ML systems.

  • Hands-on experience with performance benchmarking, profiling, and measurement of ML systems.

  • Experience with distributed training concepts such as data parallelism, tensor/model parallelism, pipeline parallelism, collectives, and synchronization overheads.

  • Proficiency in one of the following Python, C++, or Rust.

  • Strong analytical skills and the ability to connect simulation results to real system behavior.

Preferred:

  • Experience with system performance modelling, network simulation, or architecture evaluation tools. - this background is ideal

  • Familiarity with accelerator-based systems such as GPUs, TPUs, or custom ML hardware.

  • Experience with PyTorch, JAX, TensorFlow, NCCL, XLA, CUDA, or similar tools.

  • Knowledge of interconnect and networking technologies such as InfiniBand, Ethernet/RDMA, NVLink, PCIe, or equivalent.

  • Experience evaluating both training throughput and inference latency/serving efficiency.

  • Background in workload characterization, trace-driven simulation, or model calibration.

  • Ability to work across hardware and software boundaries in a cross-functional environment.

What Success Looks Like:

  • Build simulation models that accurately predict performance trends and inform architectural decisions.

  • Identify compute and communication bottlenecks in ML training and inference systems.

  • Correlate simulation outputs with real-world benchmark data.

  • Improve system efficiency, scalability, and cost effectiveness through data-driven insights.

Posted 2026-05-15

Recommended Jobs

Senior Medical Writer

ApotheCom
London

About ApotheCom ApotheCom is an exciting leader in the field of medical communications. We have been drawn together from several backgrounds, including the pharmaceutical industry, research, marke…

View Details
Posted 2026-06-21

Exams Officer - Outstanding Girls’ School in Sutton

Marchant Recruitment
Sutton, Greater London

Exams Officer – Outstanding Girls’ School in Sutton (January Start) Location: Sutton Start Date: January 2026 Contract Type: Full-time, Permanent Salary: Paid to scale An Outstandin…

View Details
Posted 2025-12-18

Year 4 Teacher — Good School — Merton — January 2026 start

Marchant Recruitment
Merton, Greater London

An ambitious Good primary in Merton is recruiting a reflective Year 4 Teacher to join the KS2 team on a Full-Time basis from January 2026 . The Year 4 Teacher will begin pre-term planning …

View Details
Posted 2025-10-16

Mid Market Account Executive - UKI (f/m/d)

Contentful
London

About the opportunity At Contentful, we are always searching for top candidates to join our global team of Account Executives. We are particularly interested in individuals with experience in the …

View Details
Posted 2026-04-18

Manager, Compensation Consulting

Capital One
London

White Collar Factory (95009), United Kingdom, London, London Manager, Compensation Consulting About this role Here at Capital One, Compensation is an important component of our Total Rew…

View Details
Posted 2026-06-24

Business Analyst / Product Manager - Equities

London

Business Analyst / Product Manager –Equities An exciting and varied role within an established and growing organisation predominantly working as a Business Analyst / Product Manager with some Proj…

View Details
Posted 2026-06-10

Occupational Therapist (Grade I)

Wax Recruitment Ltd
Colindale, Greater London

Job title: Occupational Therapist (Grade I) Job Category: Social Care Qualified Location: 2 Bristol Avenue, Colindale, London, London, NW9 4EW, Barnet Council Hours Per Week :36.00 Pay £21.1…

View Details
Posted 2026-06-10

Procurement Finance Manager

Michael Page
Uxbridge, Greater London

The Senior Commercial Finance Manager - Procurement will be responsible for: Act as finance business partner to Procurement teams Lead analysis of commodity movements, inflation and cost driver…

View Details
Posted 2026-06-03

Disputes & Valuations AD: international firm

HAYS
London

Job Description Work with young partners in a challenger Forensic function and international brand Your new company My client is a national advisory and accounting firm with a strong internati…

View Details
Posted 2026-06-25