Senior ML Systems Engineer - Simulations

Oriole Networks
London

We are looking for a Senior ML Systems Engineer to build and validate simulation infrastructure for large-scale machine learning systems. This role focuses on modelling the compute and communication behaviour of systems used for ML training and inference, and using simulation to guide architecture, performance optimization, and capacity planning.

The ideal candidate combines strong systems experience with hands-on experience in measurement, benchmarking, and performance analysis of modern ML systems.

What You’ll Do:

  • Build simulation models for compute, memory, interconnect, and communication behavior in ML systems.

  • Develop tools to simulate performance for training and inference workloads.

  • Model distributed execution across accelerators, hosts, and network fabrics, including collectives, synchronization, and communication bottlenecks.

  • Use simulation and analytical modelling to evaluate tradeoffs, identify bottlenecks, and guide system design.

  • Run performance experiments and benchmarks on real ML systems to calibrate and validate simulation models.

  • Analyze end-to-end performance, including throughput, latency, scaling efficiency, utilization, and cost/performance tradeoffs.

  • Partner with hardware/software/Networking/ML teams to align simulation with real workloads and constraints.

  • Create reproducible benchmarking methodologies across models, system configurations, and compare against real system measurements to prove validity.

  • Communicate findings through technical reports and design recommendations.

Qualifications

Required:

  • Master’s, or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field.

  • Strong experience in ML systems, distributed systems, performance engineering, computer architecture, or simulation.

  • Understanding of systems used for machine learning training and inference.

  • Experience analyzing compute, communication, and memory behavior in large-scale ML systems.

  • Hands-on experience with performance benchmarking, profiling, and measurement of ML systems.

  • Experience with distributed training concepts such as data parallelism, tensor/model parallelism, pipeline parallelism, collectives, and synchronization overheads.

  • Proficiency in one of the following Python, C++, or Rust.

  • Strong analytical skills and the ability to connect simulation results to real system behavior.

Preferred:

  • Experience with system performance modelling, network simulation, or architecture evaluation tools. - this background is ideal

  • Familiarity with accelerator-based systems such as GPUs, TPUs, or custom ML hardware.

  • Experience with PyTorch, JAX, TensorFlow, NCCL, XLA, CUDA, or similar tools.

  • Knowledge of interconnect and networking technologies such as InfiniBand, Ethernet/RDMA, NVLink, PCIe, or equivalent.

  • Experience evaluating both training throughput and inference latency/serving efficiency.

  • Background in workload characterization, trace-driven simulation, or model calibration.

  • Ability to work across hardware and software boundaries in a cross-functional environment.

What Success Looks Like:

  • Build simulation models that accurately predict performance trends and inform architectural decisions.

  • Identify compute and communication bottlenecks in ML training and inference systems.

  • Correlate simulation outputs with real-world benchmark data.

  • Improve system efficiency, scalability, and cost effectiveness through data-driven insights.

Posted 2026-05-15

Recommended Jobs

Duty Manager - London Marriott Hotel County Hall

Marriott
Westminster, Greater London

LONDON MARRIOTT COUNTY HALL Embrace history and luxury at London Marriott Hotel County Hall, located in bustling South Bank, steps away from Westminster Bridge. Occupying London’s former City Ha…

View Details
Posted 2026-05-12

Early Years Nursery Practitioner in Barking

Ethos Education
Barking, Greater London

Are you a dedicated Early Years Nursery Practitioner looking for your next exciting opportunity ?    We are looking for a full time, dedicated Nursery Practitioner to join the team at this wonderf…

View Details
Posted 2025-09-11

EYFS Room Leader

Smart Teachers
Hammersmith, Greater London

A well-established private nursery in the Hammersmith & City area is seeking an experienced and enthusiastic Room Leader to lead their 2–3 year old room on a full-time basis. About the role Roo…

View Details
Posted 2026-05-15

Interventions Teacher | Southwark

Marchant Recruitment
London

Are you an experienced Interventions Teacher looking to lead targeted support from January 2026? Do you want to work in a Southwark school that prioritises evidence-based catch-up and small-group tea…

View Details
Posted 2025-11-29

Religious Studies Teacher - Independent Mixed School -...

Marchant Recruitment
London

A highly established independent Mixed School in Highgate (Haringey), North London, with a strong focus on philosophy, ethics, and critical thinking, is seeking a meticulous Religious Studies Teacher…

View Details
Posted 2025-10-16

Site Assistant - Wembley

Marchant Recruitment
Harrow, Greater London

A high-performing secondary academy in Wembley is seeking a reliable and proactive Site Assistant to join their facilities team ASAP . This is a full-time, permanent role supporting the safe, secur…

View Details
Posted 2026-02-28

School Business Manager | Havering High Performing Academy

Marchant Recruitment
Havering, Greater London

We are recruiting for a talented School Business Manager for a high-performing academy in Upminster. This position starts in April 2026 and offers a brilliant opportunity for a professional to join a…

View Details
Posted 2026-03-13

IT Technician - Secondary School - London

Marchant Recruitment
London

A well-regarded secondary school in London is seeking a knowledgeable and proactive IT Technician to support its IT services. About the Role: This role involves providing first-line IT supp…

View Details
Posted 2026-01-09

SEN Teaching Assistant - Inclusive Primary School in...

Marchant Recruitment
London

A vibrant and forward-thinking primary school in Islington is seeking to appoint a Special Educational Needs Teaching Assistant to support pupils with a range of additional needs across Key Stage…

View Details
Posted 2026-01-10

Employment Tax Senior Manager

Brewer Morris
London

Job Details The Opportunity Join a market-leading tax advisory team that's shaping the future of employment tax. This is your chance to work on complex, multi-country projects in a dynamic, col…

View Details
Posted 2026-05-12