Trends

Release, provider, and benchmark trends

BenchLM trend pages are derived from the same normalized model and benchmark dataset as the leaderboard. This page focuses on release cadence, provider depth, and whether the benchmark mix is staying fresh enough to separate current models.

Recent release momentum

All models

Mar 2024

2 releases

49

Top score

Claude 3 Haiku

May 2024

1 releases

52

Top score

GPT-4o

Jun 2024

1 releases

60

Top score

Claude 3.5 Sonnet

Jul 2024

1 releases

59

Top score

Mistral Large 2

Dec 2024

3 releases

67

Top score

o1

Jan 2025

3 releases

70

Top score

o3-mini

Feb 2025

1 releases

43

Top score

Grok 3 [Beta]

Apr 2025

5 releases

68

Top score

o3

May 2025

1 releases

65

Top score

Claude 4 Sonnet

Jul 2025

1 releases

67

Top score

Grok 4

Aug 2025

3 releases

49

Top score

GPT-OSS 120B

Oct 2025

1 releases

62

Top score

Claude Haiku 4.5

Dec 2025

4 releases

77

Top score

GPT-5.2

Feb 2026

5 releases

83

Top score

Gemini 3.1 Pro

Mar 2026

2 releases

87

Top score

GPT-5.4 Pro

Provider progression snapshot

Benchmark freshness snapshot