A Chinese artificial intelligence startup is making waves in the global AI race after claiming its latest model can outperform OpenAI's GPT-5.5 on key benchmarks, drawing attention across the tech industry.
According to a Moneycontrol report, Beijing-based Z.ai (formerly Zhipu AI) has unveiled its new large language model, GLM-5.2, positioning it as a significant leap forward in reasoning and software engineering capabilities. The company says the model is specifically designed for “long-horizon†tasks, complex assignments that require sustained reasoning, planning and execution over extended periods.
Benchmark results shared by Z.ai suggest that GLM-5.2 edges past GPT-5.5 on several critical tests used to evaluate advanced AI systems. On Humanity's Last Exam — a rigorous benchmark featuring thousands of expert-level questions across disciplines — the model scored 54.7, compared to GPT-5.5's 52.2.
The model also showed gains on FrontierSWE, a benchmark that measures an AI system's ability to complete complex, open-ended software engineering projects. Here too, GLM-5.2 reportedly outperformed GPT-5.5 by a small margin, reinforcing its strength in coding and engineering workflows.
Industry observers say such benchmarks are increasingly important as AI systems move beyond chat-based interactions and evolve into autonomous tools capable of handling real-world tasks like debugging, software development and multi-step problem-solving.
One of GLM-5.2's standout features is its ability to process extremely large volumes of data at once. The model supports a context window of up to one million tokens, enabling it to work with large codebases, long documents and extended conversations without losing track of context.
Z.ai says the model has been trained to handle full-scale development workflows, including analysing requirements, writing code, debugging and managing complex engineering processes — all within a single continuous task.
The launch highlights intensifying competition between Chinese and American AI firms, particularly as open-source and hybrid models begin to rival proprietary systems in performance. Analysts note that China's rapid advancements in AI development are narrowing the gap with global leaders, especially in areas like coding and applied engineering.
However, experts also caution that benchmark performance alone may not fully capture real-world reliability or usability. Independent validation and broader deployment will be critical in determining whether GLM-5.2 can truly challenge established models.
Also read: OpenAI's India partnerships, explained
First Published on June 19, 2026, 18:17:02 IST






