Side-by-side benchmark comparison across knowledge, coding, math, and reasoning.
MiniMax M2.5 wins overall with a score of 59 vs 58 (1 point difference).MiniMax M2.5 wins 4 out of 4 categories.
Llama 3.1 405B
68.5
MiniMax M2.5
70.8
Llama 3.1 405B
62
MiniMax M2.5
65
Llama 3.1 405B
69
MiniMax M2.5
72
Llama 3.1 405B
67
MiniMax M2.5
69
MiniMax M2.5 scores higher overall with 59 vs 58, a difference of 1 points across all benchmarks.
MiniMax M2.5 leads in knowledge tasks with an average score of 70.8 vs 68.5.
MiniMax M2.5 leads in coding with an average score of 65 vs 62.
MiniMax M2.5 leads in math with an average score of 72 vs 69.
MiniMax M2.5 leads in reasoning with an average score of 69 vs 67.