Kandinsky 3.1

kandinsky-3.1
by nvidia|Created May 31, 2025

Kandinsky-3.1 is a large text-to-image diffusion model developed by Sber and AIRI, featuring 11.9 billion parameters. The model consists of a text encoder, U-Net, and decoder, enabling high-quality, detailed image generation from text prompts. It is trained on extensive datasets and is designed for both creative and scientific applications.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Image

$0.0050

Capabilities

Input Modalities

Text

Output Modalities

Image

Usage Analytics

Token usage across the last 30 active days

Throughput