AI Inference Engineer

Perplexity AI
London
Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some of the world's most visionary and successful leaders, including Elad Gil, Daniel Gross, Jeff Bezos, Accel, IVP, NEA, NVIDIA, Samsung, and many more. Our objective is to build accurate, trustworthy AI that powers decision-making for people and assistive AI wherever decisions are being made. Throughout human history, change and innovation have always been driven by curious people. Today, curious people use Perplexity to answer more than 780 million queries every month-a number that's growing rapidly for one simple reason: everyone can be curious.

We are looking for an AI Inference engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities
  • Develop APIs for AI inference that will be used by both internal and external customers
  • Benchmark and address bottlenecks throughout our inference stack
  • Improve the reliability and observability of our systems and respond to system outages
  • Explore novel research and implement LLM inference optimizations
Qualifications
  • Experience with ML systems and deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
  • Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
  • Experience with deploying reliable, distributed, real-time model serving at scale
  • (Optional) Understanding of GPU architectures or experience with GPU kernel programming using CUDA
At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine just over a year ago. Our AI-powered search assistant has amassed 10 million monthly active users as of early 2024, with our mobile apps installed over 1 million times across iOS and Android devices. In 2023 alone, we served over 500 million queries from users around the globe.

To support our rapid expansion, we've raised significant funding from some of the most respected investors in technology. In January 2024, we raised $73.6 million in a Series B round led by IVP, with participation from NVIDIA, Jeff Bezos' investment fund, NEA, Databricks, and other prominent firms. We followed that up with a $62.7 million Series B1 round in April 2024 led by Daniel Gross, valuing Perplexity at over $1 billion.
Our prominent investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, and many other visionary individuals.

Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.

Equity: In addition to the base salary, equity may be part of the total compensation package.

Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.
Posted 2025-10-15

Recommended Jobs

Financial advisors wanted in Portugal

Prestige IFA Jobs
London

Due to aggressive and ambitious expansion our client is now accepting job applications from financial advisors to work in Portugal. Our client stands as a beacon of sustainability, recognition, an…

View Details
Posted 2025-10-12

Care Assistant

Radfield Home Care
Hornchurch, Greater London

Job Title: Care Assistant Location: Havering & Romford Shifts: Weekday & Weekend: Morning Shifts: 7am - 2:30pm / Evening Shifts: 4pm - 10pm Contract: Full-Time & Part-Time  Salary: £…

View Details
Posted 2025-09-30

Team Leader Care Bank

New Addington, Greater London

Pay rate effective from 1st April 2025 Are you a passionate and caring individual looking for a rewarding career with excellent training and opportunities for development? Join Care UK, a multi awa…

View Details
Posted 2025-10-12

HVAC ENGINEER - LONDON

SRS Recruitment Solutions
Southwark, Greater London

Vacancy No 5378 Vacancy Title HVAC ENGINEER Location: CENTRAL LONDON (INNER M25) PLEASE NOTE: Candidates must be within a comfortable working remotely in the field with minimal supervision, ide…

View Details
Posted 2025-09-11

Graduate SEN Teaching Assistant

KPI Recruiting Ltd
Enfield, Greater London

Graduate SEN Teaching Assistant – Enfield, North London Immediate Start | Full-Time | Up to £600 per Week Fantastic Opportunity for Psychology or Education Graduates Are you a recent graduat…

View Details
Posted 2025-10-12

Real Estate Analyst 2/3 (Living)

Michael Page
London

Analyse RE investment opportunities, providing detailed financial modelling and market research. Lead on development reporting, budgeting, and fund-level analysis across a portfolio of Build-to-Re…

View Details
Posted 2025-09-13

Recruitment Specialist

Stratford, Greater London

As a result, we are seeking applications from experienced Recruitment Professionals to join our Glasgow operation with a view to helping us expand our footprint within the sector whilst also taking a…

View Details
Posted 2025-10-09

Corporate Finance Valuations

Brimstone Consulting
London

Corporate Finance Valuations London, United Kingdom The Role Our client, a leading global advisory firm, is looking for a Valuations Senior Consultant to lead on engagements through to the…

View Details
Posted 2025-10-09

Commercial, Rights & Business Affairs Manager EXTEND TALENT POOL

BBC
London

PACKAGE DESCRIPTION Band: D Salary between: £50,000 -£70,000 per annum based on knowledge and experience. London Weighting Allowance of £5319 also offered. The expected salary range for this ro…

View Details
Posted 2025-09-18