Research Scientist - CAST Propensity
- How can we clearly define when a change in one scenario is the "same" as another change in a different scenario, so we can determine whether they should be expected to have consistent, context-independent effects?
- How can we iterate on our scenario specifications to ensure they do not have bugs and obvious misunderstandings, without jeopardising the statistical validity of the data by overfitting to a particular outcome?
- What kinds of research questions transfer to the behaviour of future, more capable models which haven't yet been developed? What are the propensity analogues of the clear capability trends we've seen in large language models over time?
- A proven ability to identify and operationalise key uncertainties in a research area, and propose and improve on experimental approaches for collecting evidence on these uncertainties,
- Knowledge of and experience in selecting and applying statistical inference methods in order to draw risk-relevant and action-guiding conclusions from experimental evidence,
- Ability to engage critically with existing or proposed research methodology, assessing to what extent such critiques impact the central conclusions of the work, and how a proposal could be adapted to address them,
- Strong enough Python knowledge to get hands-on with developing and iterating on our Inspect tasks (though Inspect itself can be learned on the job),
- A sufficient understanding of transformer architecture and training dynamics to inform interpretations and predictions of their observable behaviour (how output is sampled, the loss function used for pre-training, the differences between pre-training and post-training, what inference-time compute scaling is, etc.) - hands-on experience with MLE tasks like fine-tuning or RL is not required.
- 3+ years of experience in a quantitative research discipline (e.g. as a PhD student or data scientist or researcher) involving experimental design and analysis,
- Experience writing Python code meeting quality standards, e.g. in production environments or in collaboration with others,
- Professional or educational (or significant hobbyist) contact with LLMs and transformer theory.
- Incredibly talented, mission-driven and supportive colleagues.
- Direct influence on how frontier AI is governed and deployed globally.
- Work with the Prime Minister's AI Advisor and leading AI companies.
- Opportunity to shape the first & best-resourced public-interest research team focused on AI security.
- Pre-release access to multiple frontier models and ample compute.
- Extensive operational support so you can focus on research and ship quickly.
- Work with experts across national security, policy, AI research and adjacent sciences.
- If you're talented and driven, you'll own important problems early.
- 5 days off learning and development, annual stipends for learning and development and funding for conferences and external collaborations.
- Freedom to pursue research bets without product pressure.
- Opportunities to publish and collaborate externally.
- Modern central London office (cafes, food court, gym).
- Hybrid working, flexibility for occasional remote work abroad and stipends for work-from-home equipment.
- At least 25 days' annual leave, 8 public holidays, extra team-wide breaks and 3 days off for volunteering.
- Generous paid parental leave (36 weeks of UK statutory leave shared between parents + 3 extra paid weeks + option for additional unpaid time).
- On top of your salary, we contribute 28.97% of your base salary to your pension.
- Discounts and benefits for cycling to work, donations and retail/gyms.
- Level 3: £65,000-£75,000 (Base £35,720 + Technical Allowance £29,280-£39,280)
- Level 4: £85,000-£95,000 (Base £42,495 + Technical Allowance £42,505-£52,505)
- Level 5: £105,000-£115,000 (Base £55,805 + Technical Allowance £49,195-£59,195)
- Level 6: £125,000-£135,000 (Base £68,770 + Technical Allowance £56,230-£66,230)
- Level 7: £145,000 (Base £68,770 + Technical Allowance £76,230)
- Initial assessment
- Initial screening call
- Research interview
- Technical assessment
- Take home test and interview
- Behavioural interview
- Final interview with members of the senior team
Security
Successful candidates must undergo a criminal record check and get baseline personnel security standard (BPSS) clearance before they can be appointed. Additionally, there is a strong preference for eligibility for counter-terrorist check (CTC) clearance. Some roles may require higher levels of clearance, and we will state this by exception in the job advertisement. See our vetting charter here. Nationality requirements We may be able to offer roles to applicant from any nationality or background . As such we encourage you to apply even if you do not meet the standard nationality requirements (opens in a new window). Working for the Civil Service The Civil Service Code (opens in a new window) sets out the standards of behaviour expected of civil servants. We recruit by merit on the basis of fair and open competition, as outlined in the Civil Service Commission's recruitment principles (opens in a new window). The Civil Service embraces diversity and promotes equal opportunities. As such, we run a Disability Confident Scheme (DCS) for candidates with disabilities who meet the minimum selection criteria. The Civil Service also offers a Redeployment Interview Scheme to civil servants who are at risk of redundancy, and who meet the minimum requirements for the advertised vacancy. Diversity and Inclusion The Civil Service is committed to attract, retain and invest in talent wherever it is found. To learn more please see the Civil Service People Plan (opens in a new window) and the Civil Service Diversity and Inclusion Strategy (opens in a new window).
Recommended Jobs
Sales Estimator
Sales Estimator £60,000 - £80,000 + Company Vehicle + Annual Bonus + Remote working + 33 Days' Holiday + Private Healthcare + Life Assurance + Benefits + Training + Career Progression Home based rol…
Insurer Relationship Executive
43635 Insurance Relationship Executive The Opportunity A leading London-based insurance firm is seeking an Insurance Relationship Executive to join its Market Management function. This is a…
Creche Assistant
Job Details Would you like to join Europe's leading premium health and wellness group? Our team members are the ambassadors of our business and the heart of what we do. W e are on the look out …
Supply Teacher
'Good' and 'Outstanding' Secondary schools in Barnet are seeking Supply Teachers for positions that guarantee work everyday for the school academic year. Supply Teacher - Barnet - Secondary …
Primary Supply Teacher
Supply Teacher – Primary School (Barking & Dagenham) – January Start We are currently seeking a dedicated and adaptable supply teacher to join a welcoming primary school in the Barking and Dagenham…
Business Development Manager - Liverpool / Chester
Location Required: Liverpool / Chester About us Since launching in 2018, DNA Payments has become one of the UK’s largest independent, fully integrated omnichannel payments providers. We enable …
Science (Chemistry) Teacher - Barnet Independent School
School Status & Location Sector: Prestigious Independent School Borough: Barnet. Start Date: Permanent, part-time (0.5 FTE) role commencing January 2026. The Opportunity & School Profil…
Drama Position - High-Achieving Independent School in...
A high-achieving independent Mixed School in Enfield, North London, known for its strong Arts provision and outstanding facilities, requires a Drama Teacher for a Permanent, Full-Time role starting J…
Senior IT & Systems Manager
About Us inforcer is a leading provider of innovative solutions in the cybersecurity sector and dedicated to enhancing efficiency, improving security and driving success for our clients. We focus…
UK Head of Sales - Denza Brand
Main location: Uxbridge – London About the role: As Sales Director for DENZA Brand in the UK, you will drive market expansion and sales growth, playing a pivotal role in establishing DENZA a…