Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
MiniMax M2.5 wins overall with a score of 59 vs 37 (22 point difference).MiniMax M2.5 wins 4 out of 4 categories.
Llama 4 Maverick
43.8
MiniMax M2.5
70.8
Llama 4 Maverick
38
MiniMax M2.5
65
Llama 4 Maverick
45
MiniMax M2.5
72
Llama 4 Maverick
43
MiniMax M2.5
69
MiniMax M2.5 scores higher overall with 59 vs 37, a difference of 22 points across all benchmarks.
MiniMax M2.5 leads in knowledge tasks with an average score of 70.8 vs 43.8.
MiniMax M2.5 leads in coding with an average score of 65 vs 38.
MiniMax M2.5 leads in math with an average score of 72 vs 45.
MiniMax M2.5 leads in reasoning with an average score of 69 vs 43.