GLM-5V-Turbo Benchmark Scores & Performance

BenchLM is tracking GLM-5V-Turbo by Zhipu AI. Some benchmark data is visible, but not enough non-generated coverage is available for a leaderboard rank yet.

BenchLM is tracking GLM-5V-Turbo, but this profile is currently excluded from the public leaderboard because it still lacks enough verified benchmark coverage to rank safely. Only verified public benchmark rows appear below.

GLM-5V-Turbo is a proprietary model with a 200K token context window. It processes queries without explicit chain-of-thought reasoning, offering faster response times and lower token usage.

GLM-5V-Turbo sits inside the GLM-5 family alongside GLM-5, GLM-5 (Reasoning), GLM-5-Turbo, GLM-5.1. This profile currently has 15 of 83 tracked benchmarks. BenchLM only exposes verified benchmark rows publicly, so missing categories stay blank until a sourced evaluation is available.

Its strongest category is Agentic (#41). This performance profile makes it particularly useful for coding agents, browser research, and computer-use workflows.

Provider

Zhipu AI

Source Type

Proprietary

Reasoning

Non-Reasoning

Context Window

200K

Model Status

Tracked

Overall Score

Unranked

Pricing

$1.20 / $4.00

Input / output per 1M

Runtime

N/A

Latency unavailable

Family & Lineage

Family

GLM-5

Vision-turbo

Canonical Entry

GLM-5

Rankings Overview

BenchLM is still missing enough verified benchmark coverage to rank this model across the public leaderboard. Only verified public benchmark rows are shown below.

Agentic Benchmarks

BrowseCompCurrentDetails
51.9%

BrowseComp 2026 · Quarterly refresh · updated April 1, 2026

OSWorld-VerifiedCurrentDetails
62.3%

OSWorld Verified · Quarterly refresh · updated April 1, 2026

BrowseComp-VLCurrentDisplay onlyDetails
51.9%

BrowseComp-VL 2026 · Quarterly refresh · updated April 1, 2026

OSWorldCurrentDisplay onlyDetails
62.3%

OSWorld 2026 · Quarterly refresh · updated April 1, 2026

AndroidWorldCurrentDisplay onlyDetails
75.7%

AndroidWorld 2026 · Quarterly refresh · updated April 1, 2026

WebVoyagerCurrentDisplay onlyDetails
88.5%

WebVoyager 2026 · Quarterly refresh · updated April 1, 2026

Multimodal & Grounded Benchmarks

Design2CodeCurrentDisplay onlyDetails
94.8%

Design2Code 2026 · Quarterly refresh · updated April 1, 2026

Flame-VLM-CodeCurrentDisplay onlyDetails
93.8%

Flame-VLM-Code 2026 · Quarterly refresh · updated April 1, 2026

Vision2WebCurrentDisplay onlyDetails
31.0%

Vision2Web 2026 · Quarterly refresh · updated April 1, 2026

ImageMiningCurrentDisplay onlyDetails
30.7%

ImageMining 2026 · Quarterly refresh · updated April 1, 2026

MMSearchCurrentDisplay onlyDetails
72.9%

MMSearch 2026 · Quarterly refresh · updated April 1, 2026

MMSearch-PlusCurrentDisplay onlyDetails
30.0%

MMSearch-Plus 2026 · Quarterly refresh · updated April 1, 2026

SimpleVQACurrentDisplay onlyDetails
78.2%

SimpleVQA 2026 · Quarterly refresh · updated April 1, 2026

Facts-VLMCurrentDisplay onlyDetails
58.6%

Facts-VLM 2026 · Quarterly refresh · updated April 1, 2026

V*CurrentDisplay onlyDetails
89.0%

V* 2026 · Quarterly refresh · updated April 1, 2026

Frequently Asked Questions

How does GLM-5V-Turbo perform overall in AI benchmarks?

GLM-5V-Turbo has 15 verified benchmark scores on BenchLM, but it does not yet have enough coverage to receive a global overall rank.

Is GLM-5V-Turbo good for agentic tool use and computer tasks?

GLM-5V-Turbo ranks #41 out of 97 models in agentic tool use and computer tasks benchmarks with an average score of 58. There are stronger options in this category.

Is GLM-5V-Turbo good for multimodal and grounded tasks?

GLM-5V-Turbo has visible benchmark coverage in multimodal and grounded tasks, but BenchLM does not currently assign it a global category rank there.

Which sibling models are related to GLM-5V-Turbo?

GLM-5V-Turbo belongs to the GLM-5 family. Related variants on BenchLM include GLM-5, GLM-5 (Reasoning), GLM-5-Turbo, GLM-5.1.

Does GLM-5V-Turbo have full benchmark coverage on BenchLM?

Not yet. GLM-5V-Turbo currently has 15 verified benchmark scores out of the 83 benchmarks BenchLM tracks. BenchLM only exposes verified public benchmark rows, so missing categories stay blank until a sourced evaluation is available.

What is the context window size of GLM-5V-Turbo?

GLM-5V-Turbo has a context window of 200K, which determines how much text it can process in a single interaction.

Last updated: April 1, 2026 · Runtime metrics stay blank until BenchLM has a sourced snapshot.

Weekly LLM Updates

New model releases, benchmark scores, and leaderboard changes. Every Friday.

Free. Your signup is stored with a derived country code for compliance routing.