All benchmarks

ExploitBench (Cap%)

Cybersecurity

ExploitBench measures offensive cybersecurity capability: discovering and exploiting software vulnerabilities end to end. Scores are the capability percentage (Cap%) achieved across the exploit suite. Frontier scores here are why Mythos-class access is gated.

Model scores

  • Fable 578.0%
  • Opus 4.840.0%
  • GPT-5.534.0%
  • Opus 4.7
  • Gemini 3.1 Pro
  • Mythos Preview69.0%

Official source: Anthropic — Fable 5 / Mythos 5 announcement

Related reading