Providers

Provider leaderboard surfaces

BenchLM groups canonical model families by creator so you can compare labs, not just single SKUs. Each provider page shows its ranked model depth, current releases, and top-performing families.

OpenAI

Top modelGPT-5.4
Avg. top 3 score80.3
Ranked models9
Current releases7

Anthropic

Top modelClaude Opus 4.6
Avg. top 3 score73.7
Ranked models7
Current releases4

Google

Top modelGemini 3.1 Pro
Avg. top 3 score73.5
Ranked models2
Current releases4

DeepSeek

Top modelDeepSeek Coder 2.0
Avg. top 3 score60.7
Ranked models5
Current releases1

Mistral

Top modelMistral Large 2
Avg. top 3 score59
Ranked models1
Current releases4

Alibaba

Top modelQwen2.5-1M
Avg. top 3 score58
Ranked models3
Current releases0

NVIDIA

Top modelNemotron 3 Ultra 500B
Avg. top 3 score57
Ranked models4
Current releases0

Zhipu AI

Top modelGLM-4.7
Avg. top 3 score56.3
Ranked models3
Current releases0

Moonshot AI

Top modelKimi K2
Avg. top 3 score56
Ranked models2
Current releases0

xAI

Top modelGrok 4
Avg. top 3 score55
Ranked models2
Current releases2

Meta

Top modelLlama 3.1 405B
Avg. top 3 score49.5
Ranked models2
Current releases0

Z

Top modelZ-1
Avg. top 3 score47
Ranked models1
Current releases0

Microsoft

Top modelPhi-4
Avg. top 3 score46
Ranked models1
Current releases0

Databricks

Top modelDBRX Instruct
Avg. top 3 score41
Ranked models1
Current releases0

Xiaomi

Top modelMiMo-V2-Pro
Avg. top 3 scoreN/A
Ranked models0
Current releases2

LG AI Research

Top modelExaone 4.0 32B
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Cursor

Top modelComposer 2
Avg. top 3 scoreN/A
Ranked models0
Current releases1

MiniMax

Top modelMiniMax M2.7
Avg. top 3 scoreN/A
Ranked models0
Current releases0

StepFun

Top modelStep 3.5 Flash
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Inception

Top modelMercury 2
Avg. top 3 scoreN/A
Ranked models0
Current releases0

ByteDance

Top modelSeed 1.6
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Aion Labs

Top modelAion-2.0
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Amazon

Top modelNova Pro
Avg. top 3 scoreN/A
Ranked models0
Current releases0

LiquidAI

Top modelLFM2-24B-A2B
Avg. top 3 scoreN/A
Ranked models0
Current releases0

SK Telecom

Top modelA.X series
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Community

Top modelDNA 1.0 8B
Avg. top 3 scoreN/A
Ranked models0
Current releases0

H Company

Top modelHolo2-235B-A22B
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Naver Cloud

Top modelHyperClova X Think 32B
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Kakao

Top modelKanana Flag
Avg. top 3 scoreN/A
Ranked models0
Current releases0

LightOn

Top modelOriOn-Qwen-32B
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Aleph Alpha

Top modelPharia-1-LLM-7B-control
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Upstage

Top modelSolar Pro 2
Avg. top 3 scoreN/A
Ranked models0
Current releases0

Academic

Top modelThunder-LLM 8B
Avg. top 3 scoreN/A
Ranked models0
Current releases0

NC AI

Top modelVarco
Avg. top 3 scoreN/A
Ranked models0
Current releases0