A multilingual instruction-following and understanding benchmark row published in Qwen's launch comparisons.
BenchLM mirrors the published score view for MAXIFE. Qwen3.7 Max leads the public snapshot at 89.2%. BenchLM does not use these results to rank models overall.
Year
2026
Tasks
Multilingual instruction following
Format
Cross-lingual benchmark
Difficulty
Advanced multilingual instruction following
MAXIFE appears as a high-level multilingual benchmark intended to capture both instruction compliance and language transfer. BenchLM tracks it as a display-only multilingual signal pending a fuller public benchmark specification.
Version
MAXIFE 2026
Refresh cadence
Quarterly
Staleness state
Current
Question availability
Public benchmark set
BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.
A multilingual instruction-following and understanding benchmark row published in Qwen's launch comparisons.
Qwen3.7 Max by Alibaba currently leads with a score of 89.2% on MAXIFE.
1 AI models have been evaluated on MAXIFE on BenchLM.
For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.
Free. No spam. Unsubscribe anytime.