GLM 4.5

glm-4.5
by z-ai|Created Sep 20, 2025

GLM-4.5 is the latest flagship foundation model from Z.AI, specifically designed for agent-based applications. It utilizes a Mixture-of-Experts (MoE) architecture and supports context lengths of up to 128k tokens. GLM-4.5 offers significantly improved capabilities in reasoning, code generation, and agent alignment. It features a hybrid inference mode with two options: a "thinking mode," tailored for complex reasoning and tool usage, and a "non-thinking mode," optimized for instant responses.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.17

Output Tokens (1M)

$0.78

Capabilities

Input Modalities

Text

Output Modalities

Text

Usage Analytics

Token usage across the last 30 active days

Throughput

Time-To-First-Token (TTFT)