Leaderboard · Reasoning & Math

AI Reasoning Models Leaderboard

Ranked comparison of Large Language Models on reasoning, math, and problem-solving. Sourced from official docs, sorted by reasoning score.

Data from models.json Data-driven and neutral
Rank Model Provider Reasoning Rating (0–10) GPQA Diamond MATH
Loading leaderboard rankings...

Methodology

This leaderboard ranks models based on their **Reasoning capability rating (0–10)** from the benchr index, which synthesizes human-evaluation and math/logic performance. We also display **GPQA Diamond** (graduate-level science reasoning) and **MATH** scores for reference.