GLM-4.5 is the latest flagship foundation model from Z.AI, specifically designed for agent-based applications. It utilizes a Mixture-of-Experts (MoE) architecture and supports context lengths of up to 128k tokens. GLM-4.5 offers significantly improved capabilities in reasoning, code generation, and agent alignment. It features a hybrid inference mode with two options: a "thinking mode," tailored for complex reasoning and tool usage, and a "non-thinking mode," optimized for instant responses.