A display-only Liquid AI extraction metric measuring judged agreement between extracted values and the source image.
BenchLM mirrors the published score view for Liquid Extract VLM Judge. LFM2.5-VL-1.6B-Extract leads the public snapshot at 90.6% , followed by LFM2.5-VL-450M-Extract (84.5%). BenchLM does not use these results to rank models overall.
LFM2.5-VL-1.6B-Extract
LiquidAI
LFM2.5-VL-450M-Extract
LiquidAI
Year
2026
Tasks
Image-to-JSON extraction
Format
VLM-judged extraction accuracy
Difficulty
Structured visual extraction
Liquid reports VLM Judge Score using a separate vision model to compare extracted JSON content against the image. BenchLM stores it as a specialized visual extraction quality signal.
Version
Liquid Extract VLM Judge 2026
Refresh cadence
Quarterly
Staleness state
Current
Question availability
Public benchmark set
BenchLM uses freshness metadata to decide whether a benchmark should still be treated as a strong differentiator, a benchmark to watch, or a display-only reference. For the full scoring policy, see the BenchLM methodology page.
A display-only Liquid AI extraction metric measuring judged agreement between extracted values and the source image.
LFM2.5-VL-1.6B-Extract by LiquidAI currently leads with a score of 90.6% on Liquid Extract VLM Judge.
2 AI models have been evaluated on Liquid Extract VLM Judge on BenchLM.
For engineers, researchers, and the plain curious — a weekly brief on new models, ranking shifts, and pricing changes.
Free. No spam. Unsubscribe anytime.