All benchmarks

GDPpdf

Knowledge work vision

GDPpdf tests visual document reasoning over the messy artifacts of real knowledge work — PDFs, scanned reports, slide decks and complex layouts — where the model must read figures, tables and structure directly from the page. Scores reported without tools.

Model scores

  • Fable 529.8%
  • Opus 4.822.5%
  • GPT-5.524.9%
  • Opus 4.7
  • Gemini 3.1 Pro16.7%
  • Mythos Preview

Official source: Anthropic — Fable 5 / Mythos 5 announcement

Related reading