A multilingual software-engineering benchmark for real-world code issue resolution across multiple programming languages.
As of March 2026, MiniMax M2.7 leads the SWE Multilingual leaderboard with 76.5% , followed by MiMo-V2-Flash (71.7%).
MiniMax M2.7
MiniMax
MiMo-V2-Flash
Xiaomi
Year
2026
Tasks
Multilingual software-engineering tasks
Format
Repository task completion
Difficulty
Professional software engineering
MiniMax reports SWE Multilingual as a coding benchmark focused on multilingual software-engineering tasks beyond single-language Python issue fixing.
MiniMax M2.7: Early Echoes of Self-EvolutionA multilingual software-engineering benchmark for real-world code issue resolution across multiple programming languages.
MiniMax M2.7 by MiniMax currently leads with a score of 76.5% on SWE Multilingual.
2 AI models have been evaluated on SWE Multilingual on BenchLM.
Get notified when new models drop, benchmark scores change, or the leaderboard shifts. One email per week.
Free. No spam. Unsubscribe anytime. We only store derived location metadata for consent routing.