LocationLondon, UK
Remote PolicyHybrid
Compensation£65,000 - £145,000
PostedApr 13, 2026
Employment TypeFull-time
SeniorityMid

Role Snapshot

About the Role This role focuses on building environments and challenges to benchmark the cyber capabilities of AI systems. The engineer will design cyber ranges, CTF-style tasks, and evaluation infrastructure to measure how well frontier AI models perform on real-world cybersecurity tasks. Core Responsibilities: Evaluation Design & Development (60%): Design cyber ranges and CTF challenges; build agentic scaffolding...

About the Role

This role focuses on building environments and challenges to benchmark the cyber capabilities of AI systems. The engineer will design cyber ranges, CTF-style tasks, and evaluation infrastructure to measure how well frontier AI models perform on real-world cybersecurity tasks.

Core Responsibilities

  • Evaluation Design & Development (60%): Design cyber ranges and CTF challenges; build agentic scaffolding with tools like packet capture utilities and penetration testing frameworks; design metrics for evaluations
  • Infrastructure Engineering (30%): Ensure evaluation environments are robust and scalable
  • Research & Communication (10%): Write reports and research papers; stay current with cybersecurity research

Essential Requirements

  • Strong Python skills for automation or security tooling
  • Proven experience in at least one red-teaming area: penetration testing, cyber range design, CTF competition/design, automated security testing, or vulnerability research
  • Strong interest in AI safety

Preferred

  • Virtualization technologies (Proxmox VE) and infrastructure-as-code experience
  • Familiarity with security tools (packet capture, penetration testing frameworks, reverse engineering)
  • Active in cybersecurity community
  • Experience building automation tools for red-teaming workflows

Benefits

  • 28.97% employer pension contribution
  • 25+ days annual leave, 8 public holidays, 3 volunteering days
  • Generous parental leave
  • Hybrid working with remote flexibility
Apply on AI Safety Institute View all AI Safety Institute roles