Name: GLM-5.1
Rating: 74 (34 reviews)
Author: Z.AI

Question 1

How does GLM-5.1 perform overall in AI benchmarks?

Accepted Answer

GLM-5.1 currently ranks #34 out of 124 models on BenchLM's provisional leaderboard with an overall score of 74. It also ranks #30 out of 33 on the verified leaderboard. It is created by Z.AI and features a 203K context window.

Question 2

Is GLM-5.1 good for knowledge and understanding?

Accepted Answer

GLM-5.1 ranks #12 out of 124 models in knowledge and understanding benchmarks with an average score of 83. There are stronger options in this category.

Question 3

Is GLM-5.1 good for coding and programming?

Accepted Answer

GLM-5.1 ranks #20 out of 124 models in coding and programming benchmarks with an average score of 81.3. There are stronger options in this category.

Question 4

Is GLM-5.1 good for mathematics?

Accepted Answer

GLM-5.1 ranks #13 out of 124 models in mathematics benchmarks with an average score of 91.3. There are stronger options in this category.

Question 5

Is GLM-5.1 good for reasoning and logic?

Accepted Answer

GLM-5.1 ranks #32 out of 124 models in reasoning and logic benchmarks with an average score of 60.5. There are stronger options in this category.

Question 6

Is GLM-5.1 good for agentic tool use and computer tasks?

Accepted Answer

GLM-5.1 has visible benchmark coverage in agentic tool use and computer tasks, but BenchLM does not currently assign it a global category rank there.

Question 7

Is GLM-5.1 good for multimodal and grounded tasks?

Accepted Answer

GLM-5.1 has visible benchmark coverage in multimodal and grounded tasks, but BenchLM does not currently assign it a global category rank there.

Question 8

Is GLM-5.1 good for instruction following?

Accepted Answer

GLM-5.1 ranks #9 out of 124 models in instruction following benchmarks with an average score of 93.8. It is among the top performers in this category.

Question 9

Is GLM-5.1 open source?

Accepted Answer

Yes, GLM-5.1 is an open weight model created by Z.AI, meaning it can be downloaded and run locally or fine-tuned for specific use cases.

Question 10

Which sibling models are related to GLM-5.1?

Accepted Answer

GLM-5.1 belongs to the GLM-5 family. Related variants on BenchLM include GLM-5, GLM-5.2, GLM-5 (Reasoning), GLM-5V-Turbo, GLM-5-Turbo.

Question 11

Does GLM-5.1 have full benchmark coverage on BenchLM?

Accepted Answer

Not yet. GLM-5.1 currently has 34 published benchmark scores out of the 249 benchmarks BenchLM tracks. BenchLM only exposes non-generated public benchmark rows, so missing categories stay blank until a sourced evaluation is available.

Question 12

What is the context window size of GLM-5.1?

Accepted Answer

GLM-5.1 has a context window of 203K, which determines how much text it can process in a single interaction.

GLM-5.1

Self-host vs API cost

Ranking Distribution

Category Performance

Category Breakdown

Agentic

Coding

Reasoning

Knowledge

Math

Multilingual

Multimodal

Inst. Following

Chatbot Arena Performance

Benchmark Details

GLM-5 Family

Compare This Model

Frequently Asked Questions