Skip to main content

Best models

Best AI model rankings

Browse BenchLM ranking surfaces by benchmark category, workflow, provider, license, and value.

Core leaderboards

Use cases

Model groups

Best Open Source LLMs

Top open weight AI models you can download and run locally, ranked by benchmark performance.

Best Proprietary LLMs

Top proprietary/closed-source AI models ranked by benchmark performance.

Best Reasoning AI Models

Top AI models with dedicated reasoning capabilities, ranked by benchmark performance.

Best OpenAI Models

All OpenAI models ranked by benchmark performance — GPT-5, GPT-4o, o1, o3, and more.

Best Anthropic Models

All Anthropic Claude models ranked by benchmark performance.

Best Google AI Models

All Google Gemini and Gemma models ranked by benchmark performance.

Best Meta AI Models

All Meta Llama models ranked by benchmark performance.

Best DeepSeek Models

All DeepSeek models ranked by benchmark performance.

Best AI Models Overall

The top AI models ranked by overall benchmark performance across all categories.

Best Large Context Window LLMs

AI models with the largest context windows (200K+ tokens), ranked by benchmark performance.

Best Chinese AI Models

Top AI models from Chinese labs — DeepSeek, Alibaba Qwen, Zhipu GLM, Moonshot Kimi, and more — ranked by benchmark performance.

European AI Models

European AI models from Mistral, H Company, LightOn, and Aleph Alpha — ranked models first, then tracked sparse rows.

Best Non-Reasoning LLMs

Top standard AI models (no chain-of-thought reasoning) ranked by benchmark performance. Faster and cheaper than reasoning models.

Best Mistral Models

All Mistral AI models ranked by benchmark performance — Mistral Large, Mixtral, and more.

Best xAI Grok Models

All xAI Grok models ranked by benchmark performance.

Best Alibaba Qwen Models

All Alibaba Qwen models ranked by benchmark performance.

Value rankings