Claude 4.1 Opus Thinking vs Gemini 3.1 Flash-Lite

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Gemini 3.1 Flash-Lite wins overall with a score of 53 vs 29 (24 point difference).Gemini 3.1 Flash-Lite wins 4 out of 4 categories.

Knowledge

Gemini 3.1 Flash-Lite

Claude 4.1 Opus Thinking

35.8

Gemini 3.1 Flash-Lite

60.8

38
MMLU
63
37
GPQA
62
35
SuperGPQA
60
33
OpenBookQA
58

Coding

Gemini 3.1 Flash-Lite

Claude 4.1 Opus Thinking

30

Gemini 3.1 Flash-Lite

55

30
HumanEval
55

Mathematics

Gemini 3.1 Flash-Lite

Claude 4.1 Opus Thinking

37

Gemini 3.1 Flash-Lite

62

38
AIME 2023
63
40
AIME 2024
65
39
AIME 2025
64
34
HMMT Feb 2023
59
36
HMMT Feb 2024
61
35
HMMT Feb 2025
60
37
BRUMO 2025
62

Reasoning

Gemini 3.1 Flash-Lite

Claude 4.1 Opus Thinking

35

Gemini 3.1 Flash-Lite

59

36
SimpleQA
60
34
MuSR
58

Frequently Asked Questions

Which is better, Claude 4.1 Opus Thinking or Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite scores higher overall with 53 vs 29, a difference of 24 points across all benchmarks.

Which is better for knowledge tasks, Claude 4.1 Opus Thinking or Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite leads in knowledge tasks with an average score of 60.8 vs 35.8.

Which is better for coding, Claude 4.1 Opus Thinking or Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite leads in coding with an average score of 55 vs 30.

Which is better for math, Claude 4.1 Opus Thinking or Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite leads in math with an average score of 62 vs 37.

Which is better for reasoning, Claude 4.1 Opus Thinking or Gemini 3.1 Flash-Lite?

Gemini 3.1 Flash-Lite leads in reasoning with an average score of 59 vs 35.