VWT2k-lite (VWT2k-lite)

A lighter multilingual benchmark slice published in provider tables for broad cross-lingual transfer and understanding.

Top Models on VWT2k-lite — March 2026

As of March 2026, Qwen3.6 Plus leads the VWT2k-lite leaderboard with 84.3% , followed by GLM-5 (82.1%) and Claude Opus 4.5 (79.7%).

5 modelsMultilingualCurrentDisplay onlyUpdated April 2, 2026

According to BenchLM.ai, Qwen3.6 Plus leads the VWT2k-lite benchmark with a score of 84.3%, followed by GLM-5 (82.1%) and Claude Opus 4.5 (79.7%). The scores show moderate spread, with meaningful differences between the top tier and mid-tier models.

5 models have been evaluated on VWT2k-lite. The benchmark falls in the Multilingual category. This category carries a 7% weight in BenchLM.ai's overall scoring system. VWT2k-lite is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About VWT2k-lite

Year

2026

Tasks

Multilingual transfer tasks

Format

Cross-lingual benchmark

Difficulty

Broad multilingual capability

VWT2k-lite acts as a compact multilingual stress test. BenchLM tracks it separately because providers often publish it as a standalone row without enough public detail to merge it into existing multilingual benchmark families.

Qwen3.6 launch benchmarks

BenchLM freshness & provenance

Version

VWT2k-lite 2026

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Leaderboard (5 models)

#1Qwen3.6 Plus
84.3%
#2GLM-5
82.1%
#3Claude Opus 4.5
79.7%
#4Qwen3.5 397B
78.9%
#5Kimi K2.5
77.6%

FAQ

What does VWT2k-lite measure?

A lighter multilingual benchmark slice published in provider tables for broad cross-lingual transfer and understanding.

Which model scores highest on VWT2k-lite?

Qwen3.6 Plus by Alibaba currently leads with a score of 84.3% on VWT2k-lite.

How many models are evaluated on VWT2k-lite?

5 AI models have been evaluated on VWT2k-lite on BenchLM.

Last updated: April 2, 2026 · BenchLM version VWT2k-lite 2026

Weekly LLM Benchmark Digest

Get notified when new models drop, benchmark scores change, or the leaderboard shifts. One email per week.

Free. No spam. Unsubscribe anytime. We only store derived location metadata for consent routing.