Zixuan Li

Zixuan Li

Z.aiZ.ai
CMU & MIT | Z.ai Project Director
102 points
All activity
A 744B MoE model (40B active) built for complex systems & agentic tasks. #1 open-source on Vending Bench 2, narrowing the gap with Claude Opus 4.5. Features DeepSeek Sparse Attention and "slime" RL infra.
GLM-5
GLM-5Open-weights model for long-horizon agentic engineering
A lightweight (0.9B) professional OCR model. Achieves SOTA (94.6 on OmniDocBench) on complex layouts, tables, and handwriting. Supports vLLM/SGLang for ultra-fast inference.
GLM-OCR
GLM-OCRSOTA document parsing & OCR in just 0.9B parameters
GLM-Image combines a 9B Auto-regressive model with a 7B Diffusion decoder. This hybrid architecture excels at knowledge-dense generation, perfect for posters, diagrams, and precise text rendering. Open-source and ready for T2I & I2I tasks.
GLM-Image
GLM-ImageAuto-regressive for dense-knowledge & high-fidelity images
GLM-4.7 is a SOTA open-weight model optimized for coding and reasoning. It features "Preserved Thinking" to maintain reasoning context across multi-turn agentic tasks. Compatible with tools like Cline and Claude Code.
GLM-4.7
GLM-4.7Advanced coding & reasoning with multi-turn thinking
GLM-4.6V is GLM's newest open-source multimodal model with a 128k context window. It features native function calling, bridging visual perception with executable actions for complex agentic workflows like web search and coding.
GLM-4.6V
GLM-4.6VOpen-source multimodal model with native tool use
GLM-4.6 is the new flagship model from Z.ai, with a 200K context window and major upgrades to its coding, reasoning, and agentic skills. It achieves near-parity with Claude Sonnet 4 in real-world tests and is available now via API and in popular coding agents.
GLM-4.6
GLM-4.6Advanced Agentic, Reasoning and Coding Capabilities
GLM-4.5 is a new 355B parameter open-weight MoE model (32B active). It delivers state-of-the-art performance on reasoning, code, and agentic tasks. Both the 355B flagship and a 106B Air version are now available, featuring dual-mode inference.
GLM-4.5
GLM-4.5Unifying agentic capabilities in one open model