GLM-5-Turbo Benchmark Update
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
View sourceZ.ai
Lower-latency GLM-5 variant focused on efficient coding and long-context agent runs.
---
Quality Score
---
Arena ELO
Undisclosed
Parameters
128K
Context
Sign in to join the discussion
0
Downloads
0
Likes
Mar 2026
Released
Benchmarks
20
General
3
Recent launch, pricing, benchmark, and API signals linked to this model or its provider.
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
View sourceQuality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Navigation Language Models GLM-5.2 Guides API Reference Coding Plan Released Notes Terms and Policy Help Center Get Started Quick Start Overview Pricing Core Parameters SDKs Guide Migrate to GLM-5.2 Language Models GLM-5.2 HOT GLM-5.1 GLM-5 GLM-5-Turbo GLM-4.7 GLM-4.6 GLM-4.5 GLM-4-32B-0414-128K Vision Language Models GLM-5V-Turbo GLM-4.6V GLM-OCR AutoGLM-Phone-Multilingual GLM-4.5V Image Generation Models GLM-Image CogView-4 Video Generation Models CogVideoX-3 Vidu Q1 Vidu 2
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
View sourceQuality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
View sourceQuality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
View sourceAs models, contexts, and workloads grow, hidden assumptions in inference infrastructure can surface as output anomalies. Reliability requires more than throughput, latency, and availability. It also requires preserving the correctness of model state behind every generation.

After fixing correctness issues, we turned to the next bottleneck: Prefill throughput and GPU memory pressure in long-context Coding Agent serving. To address this, we introduced LayerSplit, a layer-wise KV Cache storage scheme. Instead of duplicating all layers on every GPU, https://t.co/OGptVovbtf
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Quality: 38.1/100 | Price: $0/M tokens | Output: 0 tok/s | HumanEval: 0.436%
Navigation Language Models GLM-5 Guides API Reference Coding Plan Released Notes Terms and Policy Help Center Get Started Quick Start Overview Pricing Core Parameters SDKs Guide Migrate to GLM-5.2 Language Models GLM-5.2 HOT GLM-5.1 GLM-5 GLM-5-Turbo GLM-4.7 GLM-4.6 GLM-4.5 GLM-4-32B-0414-128K Vision Language Models GLM-5V-Turbo GLM-4.6V GLM-OCR AutoGLM-Phone-Multilingual GLM-4.5V Image Generation Models GLM-Image CogView-4 Video Generation Models CogVideoX-3 Vidu Q1 Vidu 2 I