Applied Researcher (Monitoring)
- 2+ years of experience conducting empirical research with large language models or AI systems
- Strong experience with AI coding agents. For example, having extensively used and compared frontier coding agents. For example, having designed / developed coding agents
- Experience with LLM-as-a-judge setups
- Experience designing and running experiments, analyzing results, and iterating based on empirical findingse.g. prompting, scaffolding, agent design, fine-tuning, or RL.
- Strong Python programming skills
- Demonstrated ability to work independently on open-ended research problems Bonus:
- Experience with AI evaluation frameworks, in particular Inspect (though other frameworks are relevant as well)
- Familiarity with AI safety concepts, particularly agent-related risks
- Familiarity with computer security, e.g. security testing and secure system design
- Experience fine-tuning language models or working with smaller open-source models
- Previous work building developer tools or monitoring systems
- Publications or contributions to AI safety or ML research
- Experience with production log systems or production log analysis We want to emphasize that people who feel they don't fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.
- Build a comprehensive failure mode database : Systematically collect and categorize 100+ distinct AI agent failure modes across safety and security dimensions, creating the foundation for our monitoring library.
- Develop and validate monitoring approaches : Create and empirically test monitoring prompts and strategies for key failure categories, establishing clear metrics for monitor performance and building evaluation frameworks to track progress.
- Optimize the monitoring pipeline : Improve log preprocessing and monitor scaffolding to achieve measurable improvements in detection accuracy, false positive rates, and computational efficiency.
- Advance monitoring capabilities : Begin work on advanced approaches such as fine-tuned specialized monitors or agentic investigation systems, moving our monitoring from reactive detection toward proactive risk identification.
- Hierarchical monitoring for coding agent security : Design a multi-layer monitoring system for detecting security vulnerabilities introduced by coding agents. Start by cataloging common security failure modes (e.g., hardcoded credentials, SQL injection vulnerabilities, insecure API calls). Build specialized monitors for each category, then create a hierarchical system where fast, efficient first-pass monitors flag potentially problematic code for deeper investigation by more sophisticated monitors. Validate the system on synthetic test cases and real agent outputs, iterating to optimize the tradeoff between detection rates and false positives while maintaining sub-second latency for most monitoring decisions.
- Salary: 100k - 180k GBP (~135k - 245k USD)
- Flexible work hours and schedule
- Unlimited vacation
- Unlimited sick leave
- Lunch, dinner, and snacks are provided for all employees on workdays
- Paid work trips, including staff retreats, business trips, and relevant conferences
- A yearly $1,000 (USD) professional development budget
- Start Date: Target of 2-3 months after the first interview
- Time Allocation: Full-time
- Location: The office is in London, and the building is next to the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis
- Work Visas: We can sponsor UK visas
Recommended Jobs
Senior Temporary Works Design Engineer - RC Structures & Groundwork
Senior Temporary Works Design Engineer - RC Structures, Basements, Groundwork, Demolition £50,000 - £90,000 + Benefits London & Home Counties About the Company: This business is without doubt one …
Volunteer Support Manager
Description Location: MSSC NSC 200B Lambeth Road London SE1 7JY (Hybrid Working) Contract: Full time permanent Salary: 40000 to 42000 gross per annum depending on experience Closing Date…
Global Banking & Markets, FICC SMM Quantitative Researcher, Associate / VP, London
FICC Quantitative Researcher, Associate / VP, London We are a team of FICC Quantitative Researchers who work to transform the Fixed Income, Currencies, and Commodities (FICC) business through quan…
Senior Sales Development Representative
Role Overview The Sales Development Representative will be responsible for identifying, engaging, and qualifying prospective law firm clients. This is a critical, high-visibility role ideal for …
Nanny-Housekeeper, 40 hours per week, Job ID J1EC56
This lovely family based in Belsize Park, London, is seeking a Full-time Nanny-Housekeeper to keep their home clean and organised while caring primarily for their toddler and occasionally for their s…
School Administrator | Croydon
A busy primary school in Croydon is recruiting a friendly and organised Administrator to join the office team from January 2026. The Administrator will manage reception duties, handle enquiries, upda…
Senior Maximo Consultant
Senior Maximo Consultant (Security Clearance Required) Location: UK (Hybrid/ Office) Travel: 30% travel is expected This role requires UK Security Clearance (SC). Candidates who current…
Computer Science ECT - Barnet - January start
School Status & Location Sector: Leading Independent School, Outer London. Borough: Barnet. Start Date: Permanent, full-time role commencing January 2026. The Opportunity & School Profi…
Senior Compliance Associate
Were building a relationship-oriented bank for the modern world. We need talented passionate professionals who are dedicated to doing whats right for our clients. At CIBC we embrace your stren…
Part-time Housekeeper, Job ID J1AF51
A wonderful family based in Twickenham is looking for a child-friendly Part-time Housekeeper to help them maintain the cleanliness of their property. An ideal candidate will be someone organised and …