A repo-level code generation and full-project delivery benchmark spanning web, mobile, and simulation-style implementation tasks.
As of March 2026, MiniMax M2.7 leads the VIBE-Pro leaderboard with 55.6%.
Year
2026
Tasks
Full project delivery tasks
Format
Repository-level implementation benchmark
Difficulty
End-to-end software delivery
MiniMax describes VIBE-Pro as an end-to-end project delivery benchmark that tests whether a model can complete substantial product requirements rather than single-file snippets.
MiniMax M2.7: Early Echoes of Self-EvolutionA repo-level code generation and full-project delivery benchmark spanning web, mobile, and simulation-style implementation tasks.
MiniMax M2.7 by MiniMax currently leads with a score of 55.6% on VIBE-Pro.
1 AI models have been evaluated on VIBE-Pro on BenchLM.
Get notified when new models drop, benchmark scores change, or the leaderboard shifts. One email per week.
Free. No spam. Unsubscribe anytime. We only store derived location metadata for consent routing.