Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
MiniMax M2.5 wins overall with a score of 59 vs 38 (21 point difference).MiniMax M2.5 wins 4 out of 4 categories.
Llama 4 Scout
44.8
MiniMax M2.5
70.8
Llama 4 Scout
39
MiniMax M2.5
65
Llama 4 Scout
46
MiniMax M2.5
72
Llama 4 Scout
44
MiniMax M2.5
69
MiniMax M2.5 scores higher overall with 59 vs 38, a difference of 21 points across all benchmarks.
MiniMax M2.5 leads in knowledge tasks with an average score of 70.8 vs 44.8.
MiniMax M2.5 leads in coding with an average score of 65 vs 39.
MiniMax M2.5 leads in math with an average score of 72 vs 46.
MiniMax M2.5 leads in reasoning with an average score of 69 vs 44.