Gemini 3 Flash vs GPT-OSS 120B

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Gemini 3 Flash wins overall with a score of 58 vs 42 (16 point difference).Gemini 3 Flash wins 4 out of 4 categories.

Knowledge

Gemini 3 Flash

Gemini 3 Flash

67.8

GPT-OSS 120B

48.8

70
MMLU
51
69
GPQA
50
67
SuperGPQA
48
65
OpenBookQA
46

Coding

Gemini 3 Flash

Gemini 3 Flash

62

GPT-OSS 120B

43

62
HumanEval
43

Mathematics

Gemini 3 Flash

Gemini 3 Flash

69

GPT-OSS 120B

50

70
AIME 2023
51
72
AIME 2024
53
71
AIME 2025
52
66
HMMT Feb 2023
47
68
HMMT Feb 2024
49
67
HMMT Feb 2025
48
69
BRUMO 2025
50

Reasoning

Gemini 3 Flash

Gemini 3 Flash

66

GPT-OSS 120B

48

67
SimpleQA
49
65
MuSR
47

Frequently Asked Questions

Which is better, Gemini 3 Flash or GPT-OSS 120B?

Gemini 3 Flash scores higher overall with 58 vs 42, a difference of 16 points across all benchmarks.

Which is better for knowledge tasks, Gemini 3 Flash or GPT-OSS 120B?

Gemini 3 Flash leads in knowledge tasks with an average score of 67.8 vs 48.8.

Which is better for coding, Gemini 3 Flash or GPT-OSS 120B?

Gemini 3 Flash leads in coding with an average score of 62 vs 43.

Which is better for math, Gemini 3 Flash or GPT-OSS 120B?

Gemini 3 Flash leads in math with an average score of 69 vs 50.

Which is better for reasoning, Gemini 3 Flash or GPT-OSS 120B?

Gemini 3 Flash leads in reasoning with an average score of 66 vs 48.