Skip to main content
Providers

Provider leaderboard surfaces

BenchLM groups canonical model families by creator so you can compare labs, not just single SKUs. Each provider page shows provisional-ranked depth, verified-ranked depth, current releases, and top-performing families.

Showing 47 of 47 providers

Anthropic

Top modelClaude Mythos 5
Avg. top 3 score95.7
Provisional-ranked13
Verified-ranked5
Current releases5

OpenAI

Top modelGPT-5.4
Avg. top 3 score87
Provisional-ranked15
Verified-ranked2
Current releases3

Google

Top modelGemini 3.1 Pro
Avg. top 3 score85
Provisional-ranked10
Verified-ranked1
Current releases4

Alibaba

Top modelQwen3.7 Max
Avg. top 3 score82.3
Provisional-ranked12
Verified-ranked9
Current releases12

Z.AI

Top modelGLM-5.2
Avg. top 3 score77.7
Provisional-ranked6
Verified-ranked3
Current releases1

xAI

Top modelGrok 4.1
Avg. top 3 score76
Provisional-ranked6
Verified-ranked0
Current releases3

MiniMax

Top modelMiniMax M3
Avg. top 3 score65.5
Provisional-ranked2
Verified-ranked1
Current releases1

DeepSeek

Top modelDeepSeek V4 Pro (Max)
Avg. top 3 score65
Provisional-ranked8
Verified-ranked1
Current releases1

Moonshot AI

Top modelKimi K2.6
Avg. top 3 score61.7
Provisional-ranked4
Verified-ranked2
Current releases1

Xiaomi

Top modelMiMo-V2-Flash
Avg. top 3 score59
Provisional-ranked1
Verified-ranked0
Current releases2

Microsoft

Top modelMAI-Thinking-1
Avg. top 3 score46.5
Provisional-ranked2
Verified-ranked1
Current releases1

NVIDIA

Top modelNemotron 3 Ultra
Avg. top 3 score45
Provisional-ranked5
Verified-ranked1
Current releases6

Sarvam

Top modelSarvam 105B
Avg. top 3 score39
Provisional-ranked1
Verified-ranked0
Current releases2

Mistral

Top modelMistral Large 3
Avg. top 3 score36.3
Provisional-ranked5
Verified-ranked0
Current releases4

OpenBMB

Top modelMiniCPM5-1B
Avg. top 3 score34
Provisional-ranked1
Verified-ranked1
Current releases1

Databricks

Top modelDBRX Instruct
Avg. top 3 score32
Provisional-ranked1
Verified-ranked0
Current releases0

Meta

Top modelLlama 3.1 405B
Avg. top 3 score30.3
Provisional-ranked5
Verified-ranked0
Current releases4

Z

Top modelZ-1
Avg. top 3 score24
Provisional-ranked1
Verified-ranked0
Current releases0

Amazon

Top modelNova Pro
Avg. top 3 score10
Provisional-ranked1
Verified-ranked0
Current releases0

H Company

Top modelHolo3-122B-A10B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

Cursor

Top modelComposer 2.5
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

Interfaze

Top modelInterfaze Beta
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

StepFun

Top modelStep 3.7 Flash
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

LG AI Research

Top modelExaone 4.0 32B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Tencent

Top modelHy3 Preview
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

Zyphra

Top modelZAYA1-8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases2

Poolside

Top modelLaguna M.1
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

Prism ML

Top modelTernary Bonsai 8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases2

LiquidAI

Top modelLFM2.5-8B-A1B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases7

Cohere

Top modelCommand A+
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

InclusionAI

Top modelLing 2.6 Flash
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

JetBrains

Top modelMellum2-12B-A2.5B-Instruct
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

Arcee AI

Top modelTrinity-Large-Thinking
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

Inception

Top modelMercury 2
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

IBM

Top modelGranite-4.0-H-1B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

ByteDance

Top modelSeed 1.6
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

Aion Labs

Top modelAion-2.0
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Upstage

Top modelSolar Pro 2
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Tencent Hunyuan

Top modelHy-MT1.5-1.8B-1.25bit
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases1

SK Telecom

Top modelA.X series
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Community

Top modelDNA 1.0 8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Naver Cloud

Top modelHyperClova X Think 32B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Kakao

Top modelKanana Flag
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

LightOn

Top modelOriOn-Qwen-32B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Aleph Alpha

Top modelPharia-1-LLM-7B-control
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

Academic

Top modelThunder-LLM 8B
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0

NC AI

Top modelVarco
Avg. top 3 scoreN/A
Provisional-ranked0
Verified-ranked0
Current releases0