Gemini 2.5 Pro Benchmark Scores & Performance

Benchmark analysis of Gemini 2.5 Pro by Google across 14 tests.

Creator

Google

Source Type

Proprietary

Reasoning

Non-Reasoning

Context Window

2M

Overall Score

65#33 of 88

Knowledge Benchmarks

MMLU
83
GPQA
83
SuperGPQA
81
OpenBookQA
79

Coding Benchmarks

HumanEval
75

Mathematics Benchmarks

AIME 2023
84
AIME 2024
86
AIME 2025
85
HMMT Feb 2023
80
HMMT Feb 2024
82
HMMT Feb 2025
81
BRUMO 2025
83

Reasoning Benchmarks

SimpleQA
81
MuSR
79

Frequently Asked Questions

How does Gemini 2.5 Pro perform overall in AI benchmarks?

Gemini 2.5 Pro ranks #33 out of 88 models with an overall score of 65. It is created by Google and features a 2M context window.

Is Gemini 2.5 Pro good for knowledge and understanding?

Gemini 2.5 Pro ranks #31 out of 88 models in knowledge and understanding benchmarks with an average score of 81.5. There are stronger options in this category.

Is Gemini 2.5 Pro good for coding and programming?

Gemini 2.5 Pro ranks #33 out of 88 models in coding and programming benchmarks with an average score of 75. There are stronger options in this category.

Is Gemini 2.5 Pro good for mathematics?

Gemini 2.5 Pro ranks #31 out of 88 models in mathematics benchmarks with an average score of 83. There are stronger options in this category.

Is Gemini 2.5 Pro good for reasoning and logic?

Gemini 2.5 Pro ranks #30 out of 88 models in reasoning and logic benchmarks with an average score of 80. There are stronger options in this category.

What is the context window size of Gemini 2.5 Pro?

Gemini 2.5 Pro has a context window of 2M tokens, which determines how much text it can process in a single interaction.