GPT-OSS 20B vs Kimi K2

Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.

Quick Verdict

Kimi K2 wins overall with a score of 23 vs 22 (1 point difference).Kimi K2 wins 4 out of 4 categories.

Knowledge

Kimi K2

GPT-OSS 20B

28.8

Kimi K2

29.8

31
MMLU
32
30
GPQA
31
28
SuperGPQA
29
26
OpenBookQA
27

Coding

Kimi K2

GPT-OSS 20B

23

Kimi K2

24

23
HumanEval
24

Mathematics

Kimi K2

GPT-OSS 20B

30

Kimi K2

31

31
AIME 2023
32
33
AIME 2024
34
32
AIME 2025
33
27
HMMT Feb 2023
28
29
HMMT Feb 2024
30
28
HMMT Feb 2025
29
30
BRUMO 2025
31

Reasoning

Kimi K2

GPT-OSS 20B

28

Kimi K2

29

29
SimpleQA
30
27
MuSR
28

Frequently Asked Questions

Which is better, GPT-OSS 20B or Kimi K2?

Kimi K2 scores higher overall with 23 vs 22, a difference of 1 points across all benchmarks.

Which is better for knowledge tasks, GPT-OSS 20B or Kimi K2?

Kimi K2 leads in knowledge tasks with an average score of 29.8 vs 28.8.

Which is better for coding, GPT-OSS 20B or Kimi K2?

Kimi K2 leads in coding with an average score of 24 vs 23.

Which is better for math, GPT-OSS 20B or Kimi K2?

Kimi K2 leads in math with an average score of 31 vs 30.

Which is better for reasoning, GPT-OSS 20B or Kimi K2?

Kimi K2 leads in reasoning with an average score of 29 vs 28.