Gemini 2.5 Flash Benchmark Scores & Performance

Benchmark analysis of Gemini 2.5 Flash by Google across 14 tests.

Creator

Google

Source Type

Proprietary

Reasoning

Non-Reasoning

Context Window

1M

Overall Score

41#67 of 88

Knowledge Benchmarks

MMLU
50
GPQA
49
SuperGPQA
47
OpenBookQA
45

Coding Benchmarks

HumanEval
42

Mathematics Benchmarks

AIME 2023
50
AIME 2024
52
AIME 2025
51
HMMT Feb 2023
46
HMMT Feb 2024
48
HMMT Feb 2025
47
BRUMO 2025
49

Reasoning Benchmarks

SimpleQA
48
MuSR
46

Frequently Asked Questions

How does Gemini 2.5 Flash perform overall in AI benchmarks?

Gemini 2.5 Flash ranks #67 out of 88 models with an overall score of 41. It is created by Google and features a 1M context window.

Is Gemini 2.5 Flash good for knowledge and understanding?

Gemini 2.5 Flash ranks #67 out of 88 models in knowledge and understanding benchmarks with an average score of 47.8. There are stronger options in this category.

Is Gemini 2.5 Flash good for coding and programming?

Gemini 2.5 Flash ranks #67 out of 88 models in coding and programming benchmarks with an average score of 42. There are stronger options in this category.

Is Gemini 2.5 Flash good for mathematics?

Gemini 2.5 Flash ranks #67 out of 88 models in mathematics benchmarks with an average score of 49. There are stronger options in this category.

Is Gemini 2.5 Flash good for reasoning and logic?

Gemini 2.5 Flash ranks #67 out of 88 models in reasoning and logic benchmarks with an average score of 47. There are stronger options in this category.

What is the context window size of Gemini 2.5 Flash?

Gemini 2.5 Flash has a context window of 1M tokens, which determines how much text it can process in a single interaction.