All benchmarks

Finance Agent v2

Agentic financial analysis

Finance Agent evaluates models on realistic financial-analysis workflows — pulling figures from filings and market data, running calculations and producing well-reasoned analyses under real-world constraints.

Model scores

  • Opus 4.853.9%
  • Opus 4.751.5%
  • GPT-5.551.8%
  • Gemini 3.1 Pro43.0%
  • Mythos Preview

Official source: Finance Agent (Vals AI)

Related reading