zai-org/GLM-OCR · Hugging Face
Z.ai published benchmark or leaderboard evidence for GLM-OCR, GLM OCR.
View sourcelucataco
Compact 0.9B multimodal OCR model from Z.ai. State-of-the-art on OmniDocBench V1.5 (94.62, 1 overall). Four modes: text recognition, formula (LaTeX), table parsing, and JSON-schema information extraction. Fits on a single T4.
Running this yourself: can likely run on your own machine.
State-of-the-art on OmniDocBench V1.5 (94.62, #1 overall).
This model is still tracked for research and discovery, but it is excluded from default public rankings until it returns to active status.
---
Quality Score
---
Arena ELO
900M
Parameters
---
Context
Sign in to join the discussion
0
Downloads
0
Likes
May 2026
Released
Benchmarks
1
Open Source
1
General
1
Recent launch, pricing, benchmark, and API signals linked to this model or its provider.
Z.ai published benchmark or leaderboard evidence for GLM-OCR, GLM OCR.
View sourceglm-ocr is now available through local Ollama runtime. 128K context window listed. GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.
glm-ocr is now available through local Ollama runtime. 128K context window listed. GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.
Z.ai published benchmark or leaderboard evidence for GLM-OCR, GLM OCR.