LiveCodeBench Pro

Name: LiveCodeBench Pro
Creator: BenchLM

A harder competitive-programming benchmark family built from Codeforces, ICPC, and IOI problems, with quarter-specific public leaderboards and difficulty-aware reporting.

Benchmark score on LiveCodeBench Pro — July 7, 2026

BenchLM mirrors the published score view for LiveCodeBench Pro. Sakana Fugu-Ultra leads the public snapshot at 90.8% , followed by Sakana Fugu (87.8%) and GPT-5.4 (87.5%). BenchLM does not use these results to rank models overall.

1Closed

Sakana Fugu-Ultra

Sakana AI

90.8%

Overall —Context 1M

2Closed

Sakana Fugu

Sakana AI

87.8%

Overall —Context 1M

3Closed

GPT-5.4

OpenAI

87.5%

Overall 86Context 1.05M

8 modelsCodingCurrentDisplay onlyUpdated July 7, 2026

The published LiveCodeBench Pro snapshot is tightly clustered at the top: Sakana Fugu-Ultra sits at 90.8%, while the third row is only 3.3 points behind. The broader top-10 spread is 68.1 points, so the benchmark still separates strong models even when the leaders cluster.

8 models have been evaluated on LiveCodeBench Pro. The benchmark falls in the Coding category. This category carries a 20% weight in BenchLM.ai's overall scoring system. LiveCodeBench Pro is currently displayed for reference but excluded from the scoring formula, so it does not directly affect overall rankings.

About LiveCodeBench Pro

Year

2025

Tasks

Quarter-specific contest programming sets

Format

Competitive programming

Difficulty

High-end contest programming

LiveCodeBench Pro is distinct from the original LiveCodeBench family. It excludes LeetCode, emphasizes stronger contest difficulty, and the official site publishes quarter-specific leaderboards such as 25Q2 with hard, medium, and easy pass rates.

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

BenchLM freshness & provenance

Version

LiveCodeBench Pro 2025

Refresh cadence

Quarterly

Staleness state

Current

Question availability

Public benchmark set

CurrentDisplay only

BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.

Benchmark score table (8 models)

Sakana Fugu-Ultra

Sakana AIClosed

90.8%

Sakana Fugu

Sakana AIClosed

87.8%

GPT-5.4

OpenAIClosed

87.5%

Gemini 3.1 Pro

GoogleClosed

82.9%

Muse Spark

MetaClosed

80.0%

Grok 4.20

xAIClosed

74.2%

Claude Opus 4.6

AnthropicClosed

70.7%

MiniCPM5-1B

OpenBMBOpen

22.7%

FAQ

What does LiveCodeBench Pro measure?

A harder competitive-programming benchmark family built from Codeforces, ICPC, and IOI problems, with quarter-specific public leaderboards and difficulty-aware reporting.

Which model scores highest on LiveCodeBench Pro?

Sakana Fugu-Ultra by Sakana AI currently leads with a score of 90.8% on LiveCodeBench Pro.

How many models are evaluated on LiveCodeBench Pro?

8 AI models have been evaluated on LiveCodeBench Pro on BenchLM.

Compare Top Models on LiveCodeBench Pro

Sakana Fugu-Ultra vs Sakana Fugu Sakana Fugu vs GPT-5.4 GPT-5.4 vs Gemini 3.1 Pro Gemini 3.1 Pro vs Muse Spark

Last updated: July 7, 2026 · BenchLM version LiveCodeBench Pro 2025

The AI models change fast. We track them for you.

For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.

Free. No spam. Unsubscribe anytime.