BenchLM recommendation

Best Alibaba Qwen Models in 2026

Data verified July 20, 2026

As of July 20, 2026, the top model in best alibaba qwen models on the BenchLM leaderboard is Qwen3.7 Max with a score of 72.8.

Last verified: July 20, 2026

All Alibaba Qwen models ranked by benchmark performance.

Unless noted otherwise, ranking surfaces on this page use BenchLM's provisional leaderboard lane rather than the stricter sourced-only verified leaderboard.

Bottom line: Alibaba's Qwen series has improved significantly. Qwen3.5 397B (Reasoning) leads with strong math (92) and coding (85). The MoE variants offer efficient alternatives.

Qwen3.7 Max leads this ranking with a score of 72.8, followed by Qwen3.7 Plus (67.2) and Qwen3.6 Plus (65.2). There is meaningful separation between the top models, suggesting genuine performance differences.

The best open-weight option is Qwen3.5-27B (ranked #4 with a score of 60.7). While proprietary models lead, open-weight options are within striking distance for teams willing to trade a few points of performance for full model control.

This ranking is based on provisional overall weighted scores across BenchLM.ai's scoring formula tracked by BenchLM.ai. For detailed model profiles, click any model name below. To compare two specific models head-to-head, use the "vs #" links.

1Closed

Qwen3.7 Max

Alibaba · 1M

72.8BenchAlign v5

2Closed

Qwen3.7 Plus

Alibaba · 1M

67.2BenchAlign v5

3Closed

Qwen3.6 Plus

Alibaba · 1M

65.2BenchAlign v5

Newest Qwen. 1M context window with strong instruction following.

What changed

Qwen3.5 397B (Reasoning) leads Alibaba's lineup — best math (92) and coding (85).

Qwen3.6 Plus newest generation with 1M context and strong instruction following (90).

Qwen3.5-122B-A10B efficient MoE variant — good performance per compute.

How to choose

Best Alibaba model overall?

Qwen3.5 397B (Reasoning) — strongest across categories

Large context window?

Qwen3.6 Plus — 1M context with strong performance

Efficient self-hosting?

Qwen3.5-122B-A10B — MoE architecture

Open-weight alternative?

All Qwen models are open-weight

Full Rankings (20 models)

Qwen3.7 Max

Alibaba·Proprietary·1M

72.8

BenchAlign v5

vs #2

Qwen3.7 Plus

Alibaba·Proprietary·1M

67.2

BenchAlign v5

vs #3

Qwen3.6 Plus

Alibaba·Proprietary·1M

65.2

BenchAlign v5

vs #4

Qwen3.5-27B

Alibaba·Open Weight·262K

60.7

BenchAlign v5

vs #5

Qwen3.5-122B-A10B

Alibaba·Open Weight·262K

60.6

BenchAlign v5

vs #6

Qwen 3.6 Max (preview)

Alibaba·Proprietary·256K

59.7

BenchAlign v5

vs #7

Qwen3.5 397B (Reasoning)

Alibaba·Open Weight·128K

59.5

BenchAlign v5

vs #8

Qwen3 235B 2507 (Reasoning)

Alibaba·Open Weight·128K

BenchAlign v5

vs #9

Qwen3.5 397B

Alibaba·Open Weight·128K

BenchAlign v5

vs #10

Qwen3.5-35B-A3B

Alibaba·Open Weight·262K

BenchAlign v5

vs #11

Qwen3 235B 2507

Alibaba·Open Weight·128K

BenchAlign v5

vs #12

Qwen3.6-27B

Alibaba·Open Weight·262K

53.8

BenchAlign v5

vs #13

Qwen2.5-72B

Alibaba·Open Weight·128K

52.2

BenchAlign v5

vs #14

Qwen3.6-35B-A3B

Alibaba·Open Weight·262K

51.5

BenchAlign v5

vs #15

Qwen2.5-1M

Alibaba·Open Weight·1M

49.9

BenchAlign v5

vs #16

Qwen3 Max

Alibaba·Proprietary·1M

48.2

BenchAlign v5

vs #17

Qwen3.5 Flash

Alibaba·Proprietary·1M

47.7

BenchAlign v5

vs #18

Qwen3.5 Plus

Alibaba·Proprietary·1M

47.2

BenchAlign v5

vs #19

Qwen2.5-VL-32B

Alibaba·Open Weight·32K

39.9

BenchAlign v5

vs #20

Qwen2.5 Coder 32B Instruct

Alibaba·Open Weight·128K

34.7

BenchAlign v5

Key Takeaways

The top model is Qwen3.7 Max by Alibaba with a BenchAlign v5 score of 72.8 and Supported evidence.

The best open-weight model is Qwen3.5-27B at position #4.

20 models are included in this ranking.

Score in Context

What these scores mean

Models are ranked by the same overall BenchLM score used across all leaderboards. Comparing within Alibaba's lineup helps identify which model fits your use case and budget.

Known limitations

This page only shows Alibaba models. Cross-provider comparison requires the overall or category-specific leaderboards. Newer models may have limited benchmark coverage initially.

Explore More

Last updated: July 20, 2026

Choose a model with this week’s evidence

Join 2,000+ readers for ranking moves, pricing changes, and the claims that still need proof.

One email each week. Unsubscribe anytime.