glm-5.1 - GAIA
GAIA score 42.5 from M&M
View sourceZ.ai
Z.ai's updated flagship reasoning and coding model family with stronger agentic performance and longer-context production workflows.
Z.ai's updated flagship reasoning and coding model family with stronger agentic performance and longer-context production workflows.
41.2
Quality Score
---
Arena ELO
Undisclosed
Parameters
203K
Context
Sign in to join the discussion
0
Downloads
0
Likes
Apr 2026
Released
Benchmarks
5
API
2
General
5
Recent launch, pricing, benchmark, and API signals linked to this model or its provider.
GAIA score 42.5 from M&M
View sourceNavigation Guide How to Switch Models Guides API Reference Scenario Example Coding Plan Released Notes Terms and Policy Help Center GLM Coding Plan Overview Usage Policy FAQ Legacy Plan Migration Notice Guide Quick Start Coding Tool Helper Tool Integration How to Switch Models MCP Integration Learning Resources Best Practice Memory-mechanism Campaign Rules Invite Friends, Get Credits On this page Switching Models in Claude Code Step 0 Claude Code default configuration Step 1
Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. https://t.co/YQZLhKVwik
View sourceZ.ai published benchmark or leaderboard evidence for GLM-5.1-FP8.
View sourceZ.ai published benchmark or leaderboard evidence for GLM-5.1, GLM 5.1.
View sourceAs models, contexts, and workloads grow, hidden assumptions in inference infrastructure can surface as output anomalies. Reliability requires more than throughput, latency, and availability. It also requires preserving the correctness of model state behind every generation.

After fixing correctness issues, we turned to the next bottleneck: Prefill throughput and GPU memory pressure in long-context Coding Agent serving. To address this, we introduced LayerSplit, a layer-wise KV Cache storage scheme. Instead of duplicating all layers on every GPU, https://t.co/OGptVovbtf
GLM-5.1 Tool Calling Issue Fix & Chat Template Update If you are running GLM-5.1 with vLLM/SGLang and using tool calling, please update your chat template. https://t.co/YNi99exkB1 Issue When using tool calling, frameworks including vLLM automatically convert plain-text tool

Introducing GLM-5.1: The Next Level of Open Source - Top-Tier Performance: #1 in open source and #3 globally across SWE-Bench Pro, Terminal-Bench, and NL2Repo. - Built for Long-Horizon Tasks: Runs autonomously for 8 hours, refining strategies through thousands of iterations. https://t.co/YQZLhKVwik
GLM-5.1 is now available through Ollama Cloud. 198K context window listed. GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.
Navigation Guide How to Switch Models Guides API Reference Scenario Example Coding Plan Released Notes Terms and Policy Help Center GLM Coding Plan Overview Usage Policy FAQ Legacy Plan Migration Notice Guide Quick Start Coding Tool Helper Tool Integration How to Switch Models MCP Integration Learning Resources Best Practice Memory-mechanism Campaign Rules Invite Friends, Get Credits On this page Switching Models in Claude Code Step 0 Claude Code default configuration Step 1
Z.ai published benchmark or leaderboard evidence for GLM-5.1-FP8.
Z.ai published benchmark or leaderboard evidence for GLM-5.1, GLM 5.1.