Applied Researcher (Monitoring)
- 2+ years of experience conducting empirical research with large language models or AI systems
- Strong experience with AI coding agents. For example, having extensively used and compared frontier coding agents. For example, having designed / developed coding agents
- Experience with LLM-as-a-judge setups
- Experience designing and running experiments, analyzing results, and iterating based on empirical findingse.g. prompting, scaffolding, agent design, fine-tuning, or RL.
- Strong Python programming skills
- Demonstrated ability to work independently on open-ended research problems Bonus:
- Experience with AI evaluation frameworks, in particular Inspect (though other frameworks are relevant as well)
- Familiarity with AI safety concepts, particularly agent-related risks
- Familiarity with computer security, e.g. security testing and secure system design
- Experience fine-tuning language models or working with smaller open-source models
- Previous work building developer tools or monitoring systems
- Publications or contributions to AI safety or ML research
- Experience with production log systems or production log analysis We want to emphasize that people who feel they don't fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.
- Build a comprehensive failure mode database : Systematically collect and categorize 100+ distinct AI agent failure modes across safety and security dimensions, creating the foundation for our monitoring library.
- Develop and validate monitoring approaches : Create and empirically test monitoring prompts and strategies for key failure categories, establishing clear metrics for monitor performance and building evaluation frameworks to track progress.
- Optimize the monitoring pipeline : Improve log preprocessing and monitor scaffolding to achieve measurable improvements in detection accuracy, false positive rates, and computational efficiency.
- Advance monitoring capabilities : Begin work on advanced approaches such as fine-tuned specialized monitors or agentic investigation systems, moving our monitoring from reactive detection toward proactive risk identification.
- Hierarchical monitoring for coding agent security : Design a multi-layer monitoring system for detecting security vulnerabilities introduced by coding agents. Start by cataloging common security failure modes (e.g., hardcoded credentials, SQL injection vulnerabilities, insecure API calls). Build specialized monitors for each category, then create a hierarchical system where fast, efficient first-pass monitors flag potentially problematic code for deeper investigation by more sophisticated monitors. Validate the system on synthetic test cases and real agent outputs, iterating to optimize the tradeoff between detection rates and false positives while maintaining sub-second latency for most monitoring decisions.
- Salary: 100k - 180k GBP (~135k - 245k USD)
- Flexible work hours and schedule
- Unlimited vacation
- Unlimited sick leave
- Lunch, dinner, and snacks are provided for all employees on workdays
- Paid work trips, including staff retreats, business trips, and relevant conferences
- A yearly $1,000 (USD) professional development budget
- Start Date: Target of 2-3 months after the first interview
- Time Allocation: Full-time
- Location: The office is in London, and the building is next to the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis
- Work Visas: We can sponsor UK visas
Recommended Jobs
Sales Consultant
SALES CONSULTANT LONDON HEATHROW AIRPORT, WDF FULL TIME, 37.5 HOURS OVER THE WEEK Travel Retail is a division of Coty; we are the world leaders in Luxury fragrance and are proud to hold th…
PPA Teacher | Wandsworth
A vibrant primary school in Wandsworth is seeking a creative and reliable PPA Teacher to join the staff team from January 2026. The successful PPA Teacher will plan and deliver high-quality PPA lesso…
Managing Director
Strategic Leadership: Define and execute a growth strategy aligned with premium positioning in packaging Commercial Growth: Expand market share through new business development, strategic partners…
Bank Catering Assistant
Proud to be not-for-profit, at Sanctuary Care we provide high quality care homes where people are looked after with the utmost dignity and respect. At the very heart of everything we do is our miss…
Accounts Payable Supervisor
We are seeking a detail-oriented and proactive Accounts Payable Supervisor to join our busy finance team 5-10 Minutes fro Marylebone Train Station, London. Hyrbid role (3 to 4 days in the office) Full…
Sales Advisor (Bexley)
Salary: Competitive Salary + Bonus + Excellent Benefits Sales Advisor Crayford - Climate Centre So, who are we? We are Climate Centre, part of the Wolseley Group - a leading specialist trade merchant…
Global Mobility Relocation Consultant
Global Mobility / Relocation Consultant - London Package: £Negotiable + Bonus + Benefits Location: North London, work from home flexibility also available Job Type: Global Mobility / Relocati…
Night Veterinary Surgeon
Night Veterinary Surgeon, 7 on 14 off - Central London An exciting opportunity has arisen for an experienced Veterinary Surgeon to join a progressive and friendly team at a prestigious practice lo…
Seamstress / Tailor
Christian Dior Couture is seeking a skilled Seamstress/Tailor in London to provide exceptional tailoring services. The role demands a minimum of 5 years of experience, a passion for luxury fashion, an…
Deputy Manager
Simmons Bars London ~£33,500 ~ Full time What’s In It For You Basic salary £33,500 Industry leading staff discounts, including HAPPY HOUR ALL NIGHT! Flexible shift patterns Great …