AI Inference Engineer
- Develop APIs for AI inference that will be used by both internal and external customers
- Benchmark and address bottlenecks throughout our inference stack
- Improve the reliability and observability of our systems and respond to system outages
- Explore novel research and implement LLM inference optimizations
- Experience with ML systems and deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
- Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
- Experience with deploying reliable, distributed, real-time model serving at scale
- (Optional) Understanding of GPU architectures or experience with GPU kernel programming using CUDA
Our prominent investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, and many other visionary individuals. Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above. Equity: In addition to the base salary, equity may be part of the total compensation package. Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.
Recommended Jobs
Data Architect Fintech
Data Architect Fintech £70,000 + benefits Quant Capital is urgently looking for a Data Architect to join our high profile fintech client. My client is a well known financi…
Senior Process Technologist
I am looking for hands on Senior Process Technologist who is looking to join a booming and growing business. This role will be fast paced, working across multiple top retailers and offers a large vari…
Data Migration Lead
Data Migration LeadSurrey (2 days onsite)£650 day rate, outside IR35 Your new roleIn your new role, you will be leading data-related activities during transition and working with Data Mapping, ETL m…
Director level - Salesforce Architect (Insurance...
About the Role Our client, a leading global financial institution, is seeking an accomplished Salesforce Architect at Director level to play a pivotal role in a large-scale transformation program…
Senior GoLang Software Engineer
Must-Have (Non-Negotiable): ~10+ years of professional software engineering experience, with 5+ years in Go. ~ Proven ability to solve complex problems end-to-end, not just implement tickets. ~…
Head of Product - Payments
Reapit – Who are we? Reapit is the original, end-to-end business technology provider for estate agencies of all sizes. We’ve been helping sales and lettings agents to build relationships and grow…
Temporary Assistant - Online Order Management
We are looking for a temporary team member (30h/week) to support our Online Order operations for our stores Bimba y Lola Regent and Bimba y Lola Kings Road. This person will help manage all incomi…
CDM Principal Designer
An leading architectural practice is seeking a CDM Consultant to aid with the Design Risk Management on their portfolio of large-scale new-build projects within London. You will act as Principal Desig…