glm-4.5

by z-ai

GLM-4.5 is the latest flagship foundation model from Z.AI, specifically designed for agent-based applications. It utilizes a Mixture-of-Experts (MoE) architecture and supports context lengths of up to 128k tokens. GLM-4.5 offers significantly improved capabilities in reasoning, code generation, and agent alignment. It features a hybrid inference mode with two options: a "thinking mode," tailored for complex reasoning and tool usage, and a "non-thinking mode," optimized for instant responses.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.20

Output Tokens (1M)

$0.83

Capabilities

Input Modalities

Text

Output Modalities

Text

Rate Limits

Requests per minute (RPM) and per day (RPD) by tier. More about tiers here

TierRPMRPD
Free
Tier 110
Tier 215
Tier 325
Tier 450

Usage Analytics

Token usage across the last 30 active days