All benchmarks

MMMLU

Multilingual Q&A

MMMLU is a massively-multilingual version of MMLU, measuring general knowledge and reasoning across dozens of languages — from high-resource languages to low-resource ones — to gauge how evenly a model performs worldwide.

Model scores

  • Opus 4.8
  • Opus 4.791.5%
  • GPT-5.583.2%
  • Gemini 3.1 Pro92.6%
  • Mythos Preview

Official source: MMMLU dataset (OpenAI)

Related reading