All benchmarks

GDPval-AA

Knowledge work

GDPval-AA is Artificial Analysis’s independent run of GDPval, which measures model performance on real, economically valuable knowledge-work deliverables across dozens of occupations. Results are reported as an Elo-style rating from pairwise grader preferences, not a percentage.

Model scores

  • Fable 51932
  • Opus 4.81890
  • GPT-5.51769
  • Opus 4.7
  • Gemini 3.1 Pro1314
  • Mythos Preview

Official source: GDPval (OpenAI) — AA-run variant

Related reading