AI News AI Platforms Open-source LLMs Tech

Z.ai releases 754-billion-parameter open-source model built for hours-long autonomous work

GLM-5.1 is designed to sustain complex engineering tasks across thousands of tool calls without human intervention.

by Defused News Writer

Updated April 08, 2026

Z.ai releases 754-billion-parameter open-source model built for hours-long autonomous work — Photo by Aerps.com / Unsplash

Z.ai, the Chinese AI laboratory, has released GLM-5.1, a 754-billion-parameter open-source large language model designed to operate autonomously on a single task for up to eight hours, pushing the boundaries of what AI agents can accomplish without human input.

The model's weights are publicly available on Hugging Face and ModelScope, the AI model hosting platforms, and it can be deployed locally via vLLM, SGLang and xLLM.

The release follows a proprietary GLM-5 Turbo variant and reflects Z.ai's strategy of pairing open-source flagship models with commercial, optimised releases designed to generate subscription and API revenue.

Z.ai describes the model's core technical advance as a staircase pattern of optimisation, in which the system makes sustained, incremental progress across long task sequences rather than plateauing early.

In one benchmark scenario using VectorDBBench, a database performance testing suite, the model ran 655 iterations and more than 6,000 tool calls over the course of a single task, improving throughput from an initial 3,547 queries per second to 21,500 queries per second after six rounds of structural improvement.

On KernelBench Level 3, a test of low-level code optimisation, GLM-5.1 achieved a 3.6x geometric mean speedup across 50 problems, ahead of the earlier GLM-5's plateau of 2.6x, though behind Claude Opus 4.6's 4.2x result on the same test.

The model posted a score of 58.4 on SWE-Bench Pro, a benchmark measuring AI performance on real-world software engineering tasks, narrowly ahead of GPT-5.4 at 57.7 and Claude Opus 4.6 at 57.3.

Z.ai said GLM-5.1 also recorded gains on Terminal-Bench 2.0, CyberGym, MCP-Atlas and T3-Bench, though independent verification of those figures was not immediately available.

Z.ai's leader, known on X as Lou, said autonomous work duration may prove as significant a development curve for AI as scaling laws, the relationship between model size and capability that has driven progress in the field for years.

Lou noted that AI agents capable of around 20 steps at the end of last year can now, with GLM-5.1, execute up to 1,700 steps, adding that the open-source community would be able to verify the claim directly.

Z.ai has published subscription tiers and API pricing alongside the release, including quarter-based coding plans and per-token rates, with a promotional period offering reduced off-peak billing.

The recap

Z.ai releases GLM-5.1 under MIT License, weights on Hugging Face
Model has 754 billion parameters and 202,752 token context window
Commercial tiers and API pricing announced, local deployment supported

by Defused News Writer

Updated April 08, 2026

Explore stories

Microsoft developer division head Julia Liuson to leave after 34 years post image

Tech Giants

Microsoft developer division head Julia Liuson to leave after 34 years

by Defused News Writer

April 08, 2026

Claude suffers repeated outages as Anthropic battles Sonnet 4.6 errors across multiple days post image

AI News Tech

Claude suffers repeated outages as Anthropic battles Sonnet 4.6 errors across multiple days

by Defused News Writer

April 08, 2026

Memento-Skills lets AI agents rewrite their own capabilities without retraining underlying models post image

AI News agentic AI Tech AI Models & Research

Memento-Skills lets AI agents rewrite their own capabilities without retraining underlying models

by Defused News Writer

April 08, 2026

X rolls out Grok-powered translation and AI image editing tools post image

XAi Grok Elon Musk Artificial Intelligence AI News

X rolls out Grok-powered translation and AI image editing tools

by Defused News Writer

April 08, 2026

Google Colab adds Learn Mode and Custom Instructions post image

AI News AI Platforms Tech Tech Giants

Google Colab adds Learn Mode and Custom Instructions

by Defused News Writer

April 08, 2026

Alibaba opens chip-powered data centre as China pushes for AI self-sufficiency post image

AI News Artificial Intelligence Big Tech

Alibaba opens chip-powered data centre as China pushes for AI self-sufficiency

by Defused News Writer

April 08, 2026

Meta launches Muse Spark as Zuckerberg bets on proprietary AI post image

Meta Platforms Facebook AI News Big Tech

Meta launches Muse Spark as Zuckerberg bets on proprietary AI

by Defused News Writer

April 08, 2026

Google Finance expands to 100+ countries post image

AI News Markets Tech

Google Finance expands to 100+ countries

by Defused News Writer

April 08, 2026

British Bitcoin developer denies he is Satoshi Nakamoto after New York Times investigation post image

Markets Cryptocurrency Bitcoin

British Bitcoin developer denies he is Satoshi Nakamoto after New York Times investigation

by Defused News Writer

April 08, 2026

Anthropic's most powerful AI model yet scans for security flaws, but could also create them post image

Anthropic Claude Artificial Intelligence AI News

Anthropic's most powerful AI model yet scans for security flaws, but could also create them

by Defused News Writer

April 08, 2026

Meta urges Europe to scale artificial intelligence leadership post image

AI News Tech

Meta urges Europe to scale artificial intelligence leadership

Meta hosted a Brussels symposium to press European leaders to move faster on artificial intelligence investment and regulation.

by Defused News Writer

April 08, 2026

Google: Retailers must clean product data for artificial intelligence post image

AI News eCommerce & Retail Tech Tech

Google: Retailers must clean product data for artificial intelligence

by Defused News Writer

April 08, 2026

Android XR adds five features, including experimental 3D conversion post image

Consumer Electronics Tech Giants Gaming

Android XR adds five features, including experimental 3D conversion

by Defused News Writer

April 07, 2026

Gmail says Gemini won’t train on personal email post image

AI News Privacy & Data Tech

Gmail says Gemini won’t train on personal email

by Defused News Writer

April 07, 2026