Vals AI private benchmark for understanding long-context credit agreements.
BenchLM mirrors the public Vals AI CorpFin v2 leaderboard captured from vals.ai and updated by Vals on Thu, 18 Jun 2026 20:17:36 GMT. The snapshot preserves overall scores, uncertainty, latency, cost-per-test metadata, and task-level scores where Vals publishes them.
CorpFin v2 is display only on BenchLM. Vals proprietary or Vals-hosted aggregate views are useful context, but BenchLM does not use them as weighted ranking inputs or as a replacement for benchmark-native source records.
Year
2026
Tasks
Credit-agreement understanding tasks
Format
Accuracy score
Difficulty
Professional finance document reasoning
The Vals CorpFin v2 page reports overall, exact-page, max-fitting-context, and shared-max-context task views. BenchLM keeps it display only.
Version
CorpFin v2 2026
Refresh cadence
Quarterly
Staleness state
Current
Question availability
Public benchmark set
BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.
Vals AI private benchmark for understanding long-context credit agreements.
No models have been evaluated on CorpFin v2 yet.
0 AI models are included in BenchLM's mirrored CorpFin v2 snapshot, based on the public leaderboard captured on Thu, 18 Jun 2026 20:17:36 GMT.
For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.
Free. No spam. Unsubscribe anytime.