Zhipu AI Launches GLM-5.2 Language Model

Zhipu AI has launched its GLM-5.2 language model, featuring a 1 million token context and designed for advanced programming tasks.

The Chinese startup Zhipu AI has released its flagship language model, GLM-5.2, designed for long-agent tasks and programming. This open-source solution features a context window of 1 million tokens, an MIT license, and supports local deployment.

On its Hugging Face page, the model is described as capable of generating text in both English and Chinese, with a size of 753 billion parameters.

GLM-5.2 offers multiple levels of "reasoning intensity" to balance quality and latency. Its architecture also incorporates IndexShare and an updated MTP layer for speculative decoding.

According to the developers, IndexShare reuses a single indexer for every four layers of sparse attention, reducing the number of operations per token by 2.9 times. The MTP update increases confirmation length by up to 20%.

In three key benchmarks—FrontierSWE, PostTrainBench, and SWE-Marathon—GLM-5.2 outperformed other open-source models.

Source: Zhipu AI.

In standard programming performance tests, GLM-5.2 also emerged as the most powerful open-source model.

Source: Zhipu AI.

GLM-5.2 is distributed under the open MIT license. It supports local deployment with SGLang, vLLM, Transformers, KTransformers, and Docker Model Runner. Quantization options are available for llama.cpp, Ollama, and LM Studio.

As a reminder, in June, the Rio de Janeiro IT company IplanRIO introduced Rio 3.5 Open 397B as an open AI model funded by public resources. However, a day later, the Nex team claimed that the tool appeared to be a direct merger of Nex-N2-Pro and Qwen3.5-397B-A17B.

Zhipu AI Launches GLM-5.2 with 1 Million Token Context