Applied Researcher (Monitoring)

Apollo Research

London

Application deadline: We accept submissions until 16 January 2026. We review applications on a rolling basis and encourage early submissions.

THE OPPORTUNITY

Join our new AGI safety monitoring team and help transform complex AI research into practical tools that reduce risks from AI. As an applied researcher, you'll work closely with our CEO, monitoring engineers and Evals team software engineers to build tools that make AI agent safety accessible at scale. We are building tools that monitor AI coding agents for safety and security failures. You will join a small team and will have significant ability to shape the team & tech, and have the ability to earn responsibility quickly.

You will like this opportunity if you're passionate about using empirical research to make AI systems safer in practice. You enjoy the challenge of translating theoretical AI risks into concrete detection mechanisms. You thrive on rapid iteration and learning from data. You want your research to directly impact real-world AI safety.

KEY RESPONSIBILITIES

Research & Development

- Systematically collect and catalog coding agent failure modes from real-world instances, public examples, research literature, and theoretical predictions

- Design and conduct experiments to test monitor effectiveness across different failure modes and agent behaviors

- Build and maintain evaluation frameworks to measure progress on monitoring capabilities

- Iterate on monitoring approaches based on empirical results, balancing detection accuracy with computational efficiency

- Stay current with research on AI safety, agent failures, and detection methodologies

- Stay current with research into coding security and safety vulnerabilities

Monitor Design & Optimization

- Develop a comprehensive library of monitoring prompts tailored to specific failure modes (e.g., security vulnerabilities, goal misalignment, deceptive behaviors)

- Experiment with different reasoning strategies and output formats to improve monitor reliability

- Design and test hierarchical monitoring architectures and ensemble approaches

- Optimize log pre-processing pipelines to extract relevant signals while minimizing latency and computational costs

- Implement and evaluate different scaffolding approaches for monitors, including chain-of-thought reasoning, structured outputs, and multi-step verification

Future projects (likely not in the first 6 months)

- Fine-tune smaller open-source models to create efficient, specialized monitors for high-volume production environments

- Design and build agentic monitoring systems that autonomously investigate logs to identify both known and novel failure modes

JOB REQUIREMENTS

2+ years of experience conducting empirical research with large language models or AI systems
Strong experience with AI coding agents. For example, having extensively used and compared frontier coding agents. For example, having designed / developed coding agents
Experience with LLM-as-a-judge setups
Experience designing and running experiments, analyzing results, and iterating based on empirical findingse.g. prompting, scaffolding, agent design, fine-tuning, or RL.
Strong Python programming skills
Demonstrated ability to work independently on open-ended research problems

Bonus:

Experience with AI evaluation frameworks, in particular Inspect (though other frameworks are relevant as well)
Familiarity with AI safety concepts, particularly agent-related risks
Familiarity with computer security, e.g. security testing and secure system design
Experience fine-tuning language models or working with smaller open-source models
Previous work building developer tools or monitoring systems
Publications or contributions to AI safety or ML research
Experience with production log systems or production log analysis

We want to emphasize that people who feel they don't fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.

> WHAT YOU'LL ACCOMPLISH IN YOUR FIRST YEAR

Build a comprehensive failure mode database : Systematically collect and categorize 100+ distinct AI agent failure modes across safety and security dimensions, creating the foundation for our monitoring library.
Develop and validate monitoring approaches : Create and empirically test monitoring prompts and strategies for key failure categories, establishing clear metrics for monitor performance and building evaluation frameworks to track progress.
Optimize the monitoring pipeline : Improve log preprocessing and monitor scaffolding to achieve measurable improvements in detection accuracy, false positive rates, and computational efficiency.
Advance monitoring capabilities : Begin work on advanced approaches such as fine-tuned specialized monitors or agentic investigation systems, moving our monitoring from reactive detection toward proactive risk identification.

REPRESENTATIVE PROJECTS

Hierarchical monitoring for coding agent security : Design a multi-layer monitoring system for detecting security vulnerabilities introduced by coding agents. Start by cataloging common security failure modes (e.g., hardcoded credentials, SQL injection vulnerabilities, insecure API calls). Build specialized monitors for each category, then create a hierarchical system where fast, efficient first-pass monitors flag potentially problematic code for deeper investigation by more sophisticated monitors. Validate the system on synthetic test cases and real agent outputs, iterating to optimize the tradeoff between detection rates and false positives while maintaining sub-second latency for most monitoring decisions.

BENEFITS

Salary: 100k - 180k GBP (~135k - 245k USD)
Flexible work hours and schedule
Unlimited vacation
Unlimited sick leave
Lunch, dinner, and snacks are provided for all employees on workdays
Paid work trips, including staff retreats, business trips, and relevant conferences
A yearly $1,000 (USD) professional development budget

LOGISTICS

Start Date: Target of 2-3 months after the first interview
Time Allocation: Full-time
Location: The office is in London, and the building is next to the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis
Work Visas: We can sponsor UK visas

ABOUT THE TEAM

The monitoring team is a new team. Especially early on, you will work closely with Marius Hobbhahn (CEO), Jeremy Neiman (engineer) and others on the monitoring team. You'll also sometimes work with our SWEs, Rusheb Shah, Andrei Matveiakin, Alex Kedrik, and Glen Rodgers to translate our internal tools into externally usable tools. Furthermore you will interact with our researchers, since we intend to be "our own customer" by using our tools internally for our research work. You can find our full team here.

ABOUT APOLLO RESEARCH

The rapid rise in AI capabilities offer tremendous opportunities, but also present significant risks. At Apollo Research, we're primarily concerned with risks from Loss of Control, i.e. risks coming from the model itself rather than e.g. humans misusing the AI. We're particularly concerned with deceptive alignment / scheming, a phenomenon where a model appears to be aligned but is, in fact, misaligned and capable of evading human oversight. We work on the detection of scheming (e.g., building evaluations), the science of scheming (e.g., model organisms), and scheming mitigations (e.g., anti-scheming and control). We closely work with multiple frontier AI companies, e.g. to test their models before deployment or collaborate on scheming mitigations.At Apollo, we aim for a culture that emphasizes truth-seeking, being goal-oriented, giving and receiving constructive feedback, and being friendly and helpful. If you're interested in more details about what it's like working at Apollo, you can find more information here.

Equality Statement : Apollo Research is an Equal Opportunity Employer. We value diversity and are committed to providing equal opportunities to all, regardless of age, disability, gender reassignment, marriage and civil partnership, pregnancy and maternity, race, religion or belief, sex, or sexual orientation.

HOW TO APPLY

How to apply: Please complete the application form with your CV. The provision of a cover letter is neither required nor encouraged. Please also feel free to share links to relevant work samples.

About the interview process: Our multi-stage process includes a screening interview, a take-home test (approx. 3 hours), 3 technical interviews, and a final interview with Marius (CEO). The technical interviews will be closely related to tasks the candidate would do on the job. There are no leetcode-style general coding interviews. If you want to prepare for the interviews, we suggest getting familiar with the evaluations framework Inspect, or by building simple monitors for coding agents and running them on your own Claude Code / Cursor / Codex / etc. traffic.

Your Privacy and Fairness in Our Recruitment Process: We are committed to protecting your data, ensuring fairness, and adhering to workplace fairness principles in our recruitment process. To enhance hiring efficiency, we use AI-powered tools to assist with tasks such as resume screening. These tools are designed and deployed in compliance with internationally recognized AI governance frameworks. Your personal data is handled securely and transparently. We adopt a human-centred approach: all resumes are screened by a human and final hiring decisions are made by our team. If you have questions about how your data is processed or wish to report concerns about fairness, please contact us at [email protected].

Posted 2025-12-20

Recommended Jobs

Sales Consultant

coty

London

SALES CONSULTANT LONDON HEATHROW AIRPORT, WDF FULL TIME, 37.5 HOURS OVER THE WEEK Travel Retail is a division of Coty; we are the world leaders in Luxury fragrance and are proud to hold th…

View Details

Posted 2025-12-21

PPA Teacher | Wandsworth

Marchant Recruitment

London

A vibrant primary school in Wandsworth is seeking a creative and reliable PPA Teacher to join the staff team from January 2026. The successful PPA Teacher will plan and deliver high-quality PPA lesso…

View Details

Posted 2025-11-22

Managing Director

Stratford, Greater London

Strategic Leadership: Define and execute a growth strategy aligned with premium positioning in packaging Commercial Growth: Expand market share through new business development, strategic partners…

View Details

Posted 2025-12-20

Bank Catering Assistant

Sanctuary Group

London

Proud to be not-for-profit, at Sanctuary Care we provide high quality care homes where people are looked after with the utmost dignity and respect. At the very heart of everything we do is our miss…

View Details

Posted 2025-12-21

Accounts Payable Supervisor

9-2-3 Jobs Limited

Westminster, Greater London

We are seeking a detail-oriented and proactive Accounts Payable Supervisor to join our busy finance team 5-10 Minutes fro Marylebone Train Station, London. Hyrbid role (3 to 4 days in the office) Full…

View Details

Posted 2025-12-03

Sales Advisor (Bexley)

Wolseley UK Limited

Crayford, Greater London

Salary: Competitive Salary + Bonus + Excellent Benefits Sales Advisor Crayford - Climate Centre So, who are we? We are Climate Centre, part of the Wolseley Group - a leading specialist trade merchant…

View Details

Posted 2025-12-21

Global Mobility Relocation Consultant

JAM Recruitment Ltd

London

Global Mobility / Relocation Consultant - London Package: £Negotiable + Bonus + Benefits Location: North London, work from home flexibility also available Job Type: Global Mobility / Relocati…

View Details

Posted 2025-11-24

Night Veterinary Surgeon

GLG Vets

London

Night Veterinary Surgeon, 7 on 14 off - Central London An exciting opportunity has arisen for an experienced Veterinary Surgeon to join a progressive and friendly team at a prestigious practice lo…

View Details

Posted 2025-11-14

Seamstress / Tailor

Christian Dior

London

Christian Dior Couture is seeking a skilled Seamstress/Tailor in London to provide exceptional tailoring services. The role demands a minimum of 5 years of experience, a passion for luxury fashion, an…

View Details

Posted 2025-10-09

Deputy Manager

Simmons Bars

London

Simmons Bars London ~£33,500 ~ Full time What’s In It For You Basic salary £33,500 Industry leading staff discounts, including HAPPY HOUR ALL NIGHT! Flexible shift patterns Great …

View Details

Posted 2025-12-15