Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
47 changes: 47 additions & 0 deletions docs/llmservice/models/glm-5-2.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
# GLM-5.2

## Overview

GLM-5.2 is a GLM-family text foundation model developed by Z.AI and released on June 16, 2026. It is positioned for long-horizon coding and engineering tasks, with a 1M-token context window, 128K maximum output, and a `reasoning_effort` control for adjusting reasoning depth.

## Key Features

* **1M Context Window**: Supports up to 1M tokens of context for project-scale codebases, long documents, and multi-step engineering workflows.
* **Long-Horizon Coding**: Z.AI reports 81.0 on Terminal-Bench 2.1 and 62.1 on SWE-bench Pro, with emphasis on project-level code understanding and sustained task execution.
* **Configurable Reasoning**: Supports deep-thinking mode and the GLM-5.2-specific `reasoning_effort` parameter, with `high` and `max` reasoning levels for complex tasks.
* **Agent and Tool Integration**: Supports function calling, streaming tool calls, structured output, context caching, and MCP-based tool/data-source integration.

## Best Use Cases

* **Project-Scale Codebase Work**: Reviewing, refactoring, migrating, or extending repositories where the model needs to retain architecture, module boundaries, API contracts, and engineering conventions.
* **Long-Horizon Engineering Tasks**: Multi-file implementation, dependency-aware refactoring, SDK adaptation, debugging loops, and test-fix-verify cycles that require sustained progress.
* **Tool-Using Agent Workflows**: Coding agents, internal automation, MCP-connected workflows, and structured-output systems that need reliable tool invocation and streamed tool-call arguments.

## Capabilities and Limitations

| Capability | Description |
| :------------------- | :-------------------------------------------------------------------------------------------------------------------------------------------- |
| **Reasoning** | Supports deep-thinking mode and `reasoning_effort`; Z.AI positions it for complex engineering, debugging, and long-chain reasoning workflows. |
| **Creative Writing** | Supports general text generation through the chat completion API, but official GLM-5.2 materials emphasize coding and engineering use cases. |
| **Coding** | Z.AI reports Terminal-Bench 2.1 score of 81.0 and SWE-bench Pro score of 62.1, with focus on long-horizon coding-agent scenarios. |
| **Multimodal** | Text input and text output. Vision and multimodal workflows are handled by separate Z.AI models such as GLM-5V-Turbo. |
| **Response Speed** | Official docs do not publish latency or tokens-per-second figures; streaming responses and streaming tool calls are supported. |
| **Context Window** | 1M tokens. |
| **Max Output** | 128K tokens. |
| **Tool Use** | Function calling, streaming tool calls, structured output, context caching, and MCP integration. |
| **Multilingual** | Suitable for Chinese and English developer workflows; official docs do not publish a GLM-5.2 language coverage benchmark. |

### Known Limitations

* Text-only model; image, video, and GUI-understanding tasks require a separate vision-language model such as GLM-5V-Turbo.
* Very long contexts and 128K outputs can increase latency and cost; cap `max_tokens` and use context caching where applicable.

## Credits Usage

| Model | Input (Credits/Token) | Cache Write (Credits/Token) | Cache Read (Credits/Token) | Output (Credits/Token) | Web Search (Credits/Use) | Billing Notes |
| :--- | --------------------: | --------------------------: | -------------------------: | ---------------------: | -----------------------: | :--- |
| **GLM-5.2** | `1.40` | `1.40` | `0.28` | `4.40` | `-` | - |

:::info Pricing note
Prices shown in the documentation are B.AI standard reference prices for base billing purposes. B.AI may provide lower actual usage costs through top-up bonuses and account benefits. Specific prices, bonus Credits, and account benefits are subject to the platform display and final billing records.
:::
1 change: 1 addition & 0 deletions docs/llmservice/pricing-and-usage.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@ The platform uses a unified Credits system to measure and settle usage across al
| MiniMax M2.7 | 0.30 | 0.375 | 0.06 | 1.20 | - |
| Kimi K2.5 | 0.59 | 0.59 | 0.177 | 3.00 | - |
| Qwen3.6-27B | 0.19 | 0.19 | 0.19 | 2.99 | - |
| GLM-5.2 | 1.40 | 1.40 | 0.28 | 4.40 | - |
| GLM-5.1 | 1.40 | 1.40 | 0.26 | 4.40 | - |
| GLM-5 | 1.00 | 1.00 | 0.20 | 3.20 | - |
| DeepSeek V3.2 | 0.29 | 0.29 | 0.145 | 0.44 | - |
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
# GLM-5.2

## 概述

GLM-5.2 是由 Z.AI 开发的 GLM 系列文本基础模型,于 2026 年 6 月 16 日发布。该模型面向长周期代码和工程任务,支持 1M tokens 上下文窗口、128K 最大输出,并提供 `reasoning_effort` 参数用于调整推理深度。

## 核心特性

* **1M 上下文窗口**:支持最高 1M tokens 上下文,适合项目级代码库、长文档和多步骤工程工作流。
* **长周期代码能力**:Z.AI 报告其 Terminal-Bench 2.1 得分为 81.0,SWE-bench Pro 得分为 62.1,重点面向项目级代码理解和持续任务执行。
* **可配置推理深度**:支持 deep-thinking 模式,以及 GLM-5.2 专用的 `reasoning_effort` 参数;复杂任务可使用 `high` 和 `max` 推理等级。
* **Agent 与工具集成**:支持函数调用、流式工具调用、结构化输出、上下文缓存,以及基于 MCP 的工具和数据源集成。

## 适用场景

* **项目级代码库工作**:适合代码审查、重构、迁移或扩展仓库,需要模型保持架构、模块边界、API 合约和工程约定的场景。
* **长周期工程任务**:适合多文件实现、依赖感知重构、SDK 适配、调试循环,以及测试、修复、验证一体化流程。
* **工具调用型 Agent 工作流**:适合 Coding Agent、内部自动化、MCP 连接工作流,以及需要可靠工具调用和流式工具参数的结构化输出系统。

## 能力与限制

| 能力维度 | 说明 |
| :--- | :--- |
| **推理能力** | 支持 deep-thinking 模式和 `reasoning_effort`;Z.AI 将其定位于复杂工程、调试和长链路推理工作流 |
| **创意写作** | 支持通过 chat completion API 进行通用文本生成,但官方 GLM-5.2 材料更强调代码和工程场景 |
| **编程能力** | Z.AI 报告 Terminal-Bench 2.1 得分为 81.0,SWE-bench Pro 得分为 62.1,重点面向长周期 Coding Agent 场景 |
| **多模态能力** | 文本输入和文本输出;视觉和多模态工作流由 GLM-5V-Turbo 等独立 Z.AI 模型处理 |
| **响应速度** | 官方文档未公布延迟或 tokens-per-second 数据;支持流式响应和流式工具调用 |
| **上下文窗口** | 1M tokens |
| **最大输出** | 128K tokens |
| **工具调用** | 函数调用、流式工具调用、结构化输出、上下文缓存和 MCP 集成 |
| **多语言能力** | 适合中文和英文开发者工作流;官方文档未公布 GLM-5.2 语言覆盖基准 |

### 已知限制

* 该模型为文本模型;图像、视频和 GUI 理解任务需要使用 GLM-5V-Turbo 等独立视觉语言模型。
* 超长上下文和 128K 输出可能增加延迟和成本;建议按需限制 `max_tokens`,并在适用场景使用上下文缓存。

## 积分消耗

| 模型名称 | 输入 (Credits/Token) | Cache Write (Credits/Token) | Cache Read (Credits/Token) | 输出 (Credits/Token) | 网页搜索(Credits/次) | 计费说明 |
| :--- | --------------------: | --------------------------: | -------------------------: | -------------------: | ---------------------: | :--- |
| **GLM-5.2** | `1.40` | `1.40` | `0.28` | `4.40` | `-` | - |

:::info 价格说明
文档价格为 B.AI 平台模型标准参考价,仅供基础计费说明使用。B.AI 可能会通过充值赠送及账户权益等方式,为用户提供更低的实际使用成本。具体价格、赠送积分及账户权益请以平台页面展示及最终账单为准。
:::
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@
| MiniMax M2.7 | 0.30 | 0.375 | 0.06 | 1.20 | - |
| Kimi K2.5 | 0.59 | 0.59 | 0.177 | 3.00 | - |
| Qwen3.6-27B | 0.19 | 0.19 | 0.19 | 2.99 | - |
| GLM-5.2 | 1.40 | 1.40 | 0.28 | 4.40 | - |
| GLM-5.1 | 1.40 | 1.40 | 0.26 | 4.40 | - |
| GLM-5 | 1.00 | 1.00 | 0.20 | 3.20 | - |
| DeepSeek V3.2 | 0.29 | 0.29 | 0.145 | 0.44 | - |
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -237,6 +237,7 @@ const sidebars = {
'llmservice/models/gemini-3-1-pro',
'llmservice/models/gemini-3-5-flash',
'llmservice/models/gemini-3-flash',
'llmservice/models/glm-5-2',
'llmservice/models/glm-5-1',
'llmservice/models/glm-5',
'llmservice/models/kimi-k2.5',
Expand Down
2 changes: 1 addition & 1 deletion package.json
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
{
"name": "@x402-tron/docs",
"version": "1.2.27",
"version": "1.2.28",
"description": "x402-tron documentation",
"license": "MIT",
"scripts": {
Expand Down
1 change: 1 addition & 0 deletions sidebars.js
Original file line number Diff line number Diff line change
Expand Up @@ -234,6 +234,7 @@ const sidebars = {
'llmservice/models/gemini-3-1-pro',
'llmservice/models/gemini-3-5-flash',
'llmservice/models/gemini-3-flash',
'llmservice/models/glm-5-2',
'llmservice/models/glm-5-1',
'llmservice/models/glm-5',
'llmservice/models/kimi-k2.5',
Expand Down
Loading