Gemini 3.1 Flash-Lite Benchmark Scores & Performance

Benchmark analysis of Gemini 3.1 Flash-Lite by Google across 14 tests.

Creator

Google

Source Type

Proprietary

Reasoning

Non-Reasoning

Context Window

1M

Overall Score

53#55 of 88

Knowledge Benchmarks

MMLU
63
GPQA
62
SuperGPQA
60
OpenBookQA
58

Coding Benchmarks

HumanEval
55

Mathematics Benchmarks

AIME 2023
63
AIME 2024
65
AIME 2025
64
HMMT Feb 2023
59
HMMT Feb 2024
61
HMMT Feb 2025
60
BRUMO 2025
62

Reasoning Benchmarks

SimpleQA
60
MuSR
58

Frequently Asked Questions

How does Gemini 3.1 Flash-Lite perform overall in AI benchmarks?

Gemini 3.1 Flash-Lite ranks #55 out of 88 models with an overall score of 53. It is created by Google and features a 1M context window.

Is Gemini 3.1 Flash-Lite good for knowledge and understanding?

Gemini 3.1 Flash-Lite ranks #56 out of 88 models in knowledge and understanding benchmarks with an average score of 60.8. There are stronger options in this category.

Is Gemini 3.1 Flash-Lite good for coding and programming?

Gemini 3.1 Flash-Lite ranks #56 out of 88 models in coding and programming benchmarks with an average score of 55. There are stronger options in this category.

Is Gemini 3.1 Flash-Lite good for mathematics?

Gemini 3.1 Flash-Lite ranks #56 out of 88 models in mathematics benchmarks with an average score of 62. There are stronger options in this category.

Is Gemini 3.1 Flash-Lite good for reasoning and logic?

Gemini 3.1 Flash-Lite ranks #57 out of 88 models in reasoning and logic benchmarks with an average score of 59. There are stronger options in this category.

What is the context window size of Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite has a context window of 1M tokens, which determines how much text it can process in a single interaction.