CUDA Kernel Optimizer ML Engineer
1) Role Overview
Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization performance profiling and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while maintaining correctness and reproducibility
2) Key Responsibilities
-
Develop tune and benchmark CUDA kernels for tensor and operator workloads.
-
Optimize for occupancy memory coalescing instruction-level parallelism and warp scheduling.
-
Profile and diagnose performance bottlenecks using Nsight Systems Nsight Compute and comparable tools.
-
Report performance metrics analyze speedups and propose architectural improvements.
-
Collaborate asynchronously with PyTorch Operator Specialists to integrate kernels into production frameworks.
-
Produce well-documented reproducible benchmarks and performance write-ups.
3) Ideal Qualifications
-
Deep expertise in CUDA programming GPU architecture and memory optimization.
-
Proven ability to achieve quantifiable performance improvements across hardware generations.
-
Proficiency with mixed precision Tensor Core usage and low-level numerical stability considerations.
-
Familiarity with frameworks like PyTorch TensorFlow or Triton (not required but beneficial).
-
Strong communication skills and independent problem-solving ability.
-
Demonstrated open-source research or performance benchmarking contributions.
4) More About the Opportunity
-
Ideal for independent contractors who thrive in performance-critical systems-level work.
-
Engagements focus on measurable high-impact kernel optimizations and scalability studies.
-
Work is fully remote and asynchronous; deliverables are outcome-driven.
-
Access to shared benchmarking infrastructure and reproducibility tooling via Mercor support resources.
5) Compensation & Contract Terms
-
Typical range: $120$250/hour depending on scope specialization and results achieved. Payments will be based on accepted task output over flat hourly.
-
Structured as a contract-based engagement not an employment relationship.
-
Compensation tied to measurable deliverables or agreed milestones.
-
Confidentiality IP and NDA terms as defined per engagement.
6) Application Process
-
Submit a brief overview of prior CUDA optimization experience profiling results or performance reports.
-
Include links to relevant GitHub repos papers or benchmarks if available.
-
Indicate your hourly rate time availability and preferred engagement length.
-
Selected experts may complete a small paid pilot kernel optimization project
7) About Mercor
-
Mercor connects domain experts with top AI research and technology organizations through project-based contracts.
-
Contractors operate independently with full flexibility over methods timelines and tools.
-
Our mission is to help top engineers and researchers access frontier technical work without rigid employment structures.
Recommended Jobs
Teacher of Mathematics - Enfield Independent School
School Status & Location Sector: Leading Independent School (with Sixth Form). Borough: Enfield (Outer London, England). Start Date: Permanent, full-time role commencing January 2026. T…
EYFS Practitioner | Enfield | January 2026
Are you a committed EYFS Practitioner seeking a rewarding Early Years role from January 2026? Do you want to join an Enfield school that champions high-quality EYFS provision, close family partners…
Hgv Technician
HGV Technician Day shifts | Pay DOE and Qualifications Are you an experienced HGV Mechanic / Technician looking for your next career opportunity? This role could be right for you if you are looking …
Mathematics Teacher - Inner London Opportunity
Teacher of Mathematics - Lambeth, Inner London Pay Zone &##128208; Exceptional Opportunity for a Dedicated Maths Teacher in Lambeth! Are you a highly motivated and engaging Maths Teacher ready …
Bar Staff
Bar Staff – Twickenham Stadium, London Join the action at Twickenham Stadium, the home of unforgettable sporting and entertainment events, and be part of the excitement right at the heart of the …
Research Engineer, Autonomous Agents
Snapshot Artificial Intelligence could be one of humanitys most useful inventions. At Google DeepMind were a team of scientists engineers machine learning experts and more working together to adva…
Bank Care Team Leader
Bank Care Team Leader Haven Residential Care Home, 36-38 Wellington Road, Hatch End, Pinner, Middlesex, HA5 4NL £15.18 per hour Hours as and when required Why work for us? We…
Ongoing Locum Rota available in North East London| Rates of £80.00 per hour in London NE
Ongoing Locum Rota available in North East London| Rates of £80.00 per hour Dream Medical are working in conjunction with a purpose built health centre In North East London, and are looking for …
Admin - Westminster
A central London school in Westminster is recruiting an experienced Admin professional to start January 2026. The successful Admin candidate will manage reception and front-of-house duties, maintain …
Operations Executive (Logistics)
THE ROLE & RESPONSIBILITIES Reports to: Supply Operations Manager Key internal relationships: Supply Operations Manager, Head of Operations, Operations Executives Key external relat…