CUDA Kernel Optimizer ML Engineer
1) Role Overview
Mercor is engaging advanced CUDA experts who specialize in GPU kernel optimization performance profiling and numerical efficiency. These professionals possess a deep mental model of how modern GPU architectures execute deep learning workloads. They are comfortable translating algorithmic concepts into finely tuned kernels that maximize throughput while maintaining correctness and reproducibility
2) Key Responsibilities
-
Develop tune and benchmark CUDA kernels for tensor and operator workloads.
-
Optimize for occupancy memory coalescing instruction-level parallelism and warp scheduling.
-
Profile and diagnose performance bottlenecks using Nsight Systems Nsight Compute and comparable tools.
-
Report performance metrics analyze speedups and propose architectural improvements.
-
Collaborate asynchronously with PyTorch Operator Specialists to integrate kernels into production frameworks.
-
Produce well-documented reproducible benchmarks and performance write-ups.
3) Ideal Qualifications
-
Deep expertise in CUDA programming GPU architecture and memory optimization.
-
Proven ability to achieve quantifiable performance improvements across hardware generations.
-
Proficiency with mixed precision Tensor Core usage and low-level numerical stability considerations.
-
Familiarity with frameworks like PyTorch TensorFlow or Triton (not required but beneficial).
-
Strong communication skills and independent problem-solving ability.
-
Demonstrated open-source research or performance benchmarking contributions.
4) More About the Opportunity
-
Ideal for independent contractors who thrive in performance-critical systems-level work.
-
Engagements focus on measurable high-impact kernel optimizations and scalability studies.
-
Work is fully remote and asynchronous; deliverables are outcome-driven.
-
Access to shared benchmarking infrastructure and reproducibility tooling via Mercor support resources.
5) Compensation & Contract Terms
-
Typical range: $120$250/hour depending on scope specialization and results achieved. Payments will be based on accepted task output over flat hourly.
-
Structured as a contract-based engagement not an employment relationship.
-
Compensation tied to measurable deliverables or agreed milestones.
-
Confidentiality IP and NDA terms as defined per engagement.
6) Application Process
-
Submit a brief overview of prior CUDA optimization experience profiling results or performance reports.
-
Include links to relevant GitHub repos papers or benchmarks if available.
-
Indicate your hourly rate time availability and preferred engagement length.
-
Selected experts may complete a small paid pilot kernel optimization project
7) About Mercor
-
Mercor connects domain experts with top AI research and technology organizations through project-based contracts.
-
Contractors operate independently with full flexibility over methods timelines and tools.
-
Our mission is to help top engineers and researchers access frontier technical work without rigid employment structures.
Recommended Jobs
Senior Java Developer
At U.S. Bank, we’re on a journey to do our best. Helping the customers and businesses we serve to make better and smarter financial decisions, enabling the communities we support to grow and succeed i…
Operations Performance Data Analyst
About IAG Cargo Looking for a challenge in one of the world’s largest airfreight logistics organisations? At IAG Cargo we are in the business of moving things. From antibiotics to rhinoceros, go…
Affiliate Account Manager
About Vervaunt Vervaunt is a London-based eCommerce and paid media consultancy agency, focused on driving growth for aspirational retail brands. Our team has worked with some amazing brands, inclu…
Year 4 Teacher — Good School — Richmond
Are you an ambitious Year 4 Teacher ready to take a Full-Time post with a January 2026 start? A forward-looking Good primary in Richmond is recruiting a creative Year 4 Teacher to join its KS2 team f…
HR Apprentice
HR Apprentice Length: 24 months FTC Start date: March 2026 Location: London Course: Level 5 CIPD Your Team: HR is a highly motivated and hardworking team that puts people at the core o…
Nursery Practitioner in Canning Town
Are you a Nursery Practitioner looking to make a difference to your community? This diverse nursery in the London Borough of Newham, specifically in Canning Town, is now looking for a Nursery Practi…
Art & DT Technician - Creative and Technical Support -...
Art & DT Technician – Provide Essential Technical and Practical Support for a Thriving Creative Arts Faculty – Newham An ambitious and structured secondary school in Newham requires a highly…
Office Manager - Media Organisation
We are looking for an organised and proactive Office Manager to support the day-to-day operations of a leading national media organisation. This role sits at the heart of a fast-paced, 24-hour editor…
Computer Science Teacher - Ofsted Good Grammar School,...
A respected Grammar School in Watford , rated Good by Ofsted , is looking to appoint a talented Computer Science Teacher to teach KS3–KS5 from January 2026 . The role offers the opportunity t…
Staff Nurse-Outpatients
Staff Nurse - Outpatients London: London Bridge Satellite sites ( 31 & 120 Old Broad Street Canary Wharf ) Full Time 37.5 hours per week 7.5 hour shifts between 08:00 and 20:00 plus occa…