Models

Explore a wide range of AI models available through the NagaAI platform.

flux-1-krea-dev

Flux-1-Krea-Dev is a 12B parameter rectified flow transformer developed by Black Forest Labs and Krea, focused on aesthetic photography and efficient, open-weight image generation. It leverages guidance distillation for efficient inference and is released with open weights for research and creative workflows.

byblack-forest-labs

~$0.01/image

qwen-image

Qwen-Image is a foundation image generation model from the Qwen team, excelling at high-fidelity text rendering, complex text integration (including English and Chinese), and diverse artistic styles. It supports advanced editing features such as style transfer, object manipulation, and human pose editing, and is suitable for both image generation and understanding tasks.

byqwen

~$0.01/image

flux-1-kontext-max

Flux-1-Kontext-Max is a premium text-based image editing model from Black Forest Labs, delivering maximum performance and advanced typography generation for transforming images through natural language prompts. It is designed for high-end creative and professional use.

byblack-forest-labs

~$0.04/image

flux-1-kontext-pro

Flux-1-Kontext-Pro is a state-of-the-art text-based image editing model from Black Forest Labs, providing high-quality, prompt-adherent output for transforming images using natural language. It is optimized for consistent results and advanced editing tasks.

byblack-forest-labs

~$0.02/image

imagen-4

Imagen-4 is Google’s latest text-to-image model, engineered for photorealistic quality, improved fine details, advanced spelling and typography rendering, and high accuracy across diverse art styles. It includes SynthID watermarking for AI-generated content identification and is benchmarked as a leader in human preference evaluations.

bygoogle

~$0.03/image

kandinsky-3.1

Kandinsky-3.1 is a large text-to-image diffusion model developed by Sber and AIRI, featuring 11.9 billion parameters. The model consists of a text encoder, U-Net, and decoder, enabling high-quality, detailed image generation from text prompts. It is trained on extensive datasets and is designed for both creative and scientific applications.

bynvidia

~$0.0050/image

sdxl

Stable Diffusion XL (SDXL) is a powerful text-to-image generation model from Stability AI, featuring a 3x larger UNet, dual text encoders (OpenCLIP ViT-bigG/14 and the original), and a two-stage process for generating highly detailed, controllable images. It introduces size and crop-conditioning for greater control and quality in image generation.

bystabilityai

~$0.0025/image

dall-e-3

DALL-E 3 is OpenAI’s third-generation text-to-image model, offering enhanced detail, accuracy, and the ability to understand complex prompts. It excels at generating realistic and creative images, handling intricate details like text and human anatomy, and supports various aspect ratios for flexible output.

byopenai

~$0.04/image

midjourney

Midjourney is a generative AI model developed by Midjourney, Inc., designed to create images from text descriptions (prompts). It is widely used for creative and design purposes, offering high-quality, imaginative visuals for a variety of applications.

bymidjourney

~$0.0080/image

stable-diffusion-3-large

Stable Diffusion 3 Large is the latest and most advanced addition to the Stable Diffusion family, featuring 8 billion parameters for intricate text understanding, typography, and highly detailed image generation. It is designed for creative and professional use cases requiring high fidelity and control.

bystabilityai

~$0.04/image

stable-diffusion-3.5-large

Stable Diffusion 3.5 Large is a powerful, text-to-image AI model from Stability AI, utilizing a Multimodal Diffusion Transformer (MMDiT) architecture with 8.1 billion parameters. It excels at generating high-resolution images (up to 1 megapixel) in diverse styles, with strong prompt adherence and advanced detail rendering.

bystabilityai

~$0.04/image

gpt-image-1

OpenAI’s new state-of-the-art image generation model. This is a natively multimodal language model that accepts both text and image inputs and produces image outputs. It powers image generation in ChatGPT, offering exceptional prompt adherence, a high level of detail, and quality.

byopenai

~$0.04/image

flux-1-schnell

Flux-1-Schnell is a high-speed, open-source text-to-image model from Black Forest Labs, optimized for rapid, high-quality image generation in just a few steps. It is ideal for applications where speed and efficiency are critical.

byblack-forest-labs

~$0.0015/image

flux-1-dev

Flux-1-Dev is an open-weight, non-commercial text-to-image model from Black Forest Labs, designed for high-quality image generation with a 12B parameter rectified flow transformer. It is optimized for research and creative experimentation.

byblack-forest-labs

~$0.01/image

flux-1-pro

Flux-1-Pro is an advanced text-to-image model from Black Forest Labs, generating high-quality, realistic images and clear text. It is suitable for a wide range of applications, including commercial and creative projects.

byblack-forest-labs

~$0.02/image

flux-1.1-pro

Flux-1.1-Pro is an enhanced version of Flux 1.0 Pro from Black Forest Labs, offering faster generation speeds, improved image quality, and better prompt adherence. It is optimized for both developer and commercial use.

byblack-forest-labs

~$0.04/image

flux-1.1-pro-ultra

Flux-1.1-Pro-Ultra is a high-resolution, high-speed image generation model from Black Forest Labs, capable of producing images up to 4 million pixels (4MP). It is designed for professional printing, fine art, and applications requiring exceptional detail and speed.

byblack-forest-labs

~$0.03/image

ideogram-v2-turbo

Ideogram-v2-turbo is the latest image generation model from Ideogram, designed for fast production of realistic visuals, graphic designs, and typography. It combines rapid image generation with high quality, making it ideal for posters, logos, and creative content.

byideogram

~$0.03/image

recraft-v3

Recraft-v3 is a state-of-the-art text-to-image model from Recraft, capable of generating images from long textual inputs in a wide range of styles. It is benchmarked as a leader in image generation and is designed for creative and professional applications.

byrecraft

~$0.02/image

imagen-3

Imagen-3 is Google’s high-quality text-to-image model, producing highly detailed images with rich lighting and minimal visual distractions. It is optimized for overall image quality and creative visual generation.

bygoogle

~$0.03/image

grok-2-aurora

Grok-2-Aurora is an autoregressive, mixture-of-experts model from xAI, trained on billions of text and image examples. It excels at photorealistic rendering, accurately following text instructions, and complex scene generation, leveraging deep world understanding built during training.

byx-ai

~$0.04/image

Models

flux-1-krea-dev

qwen-image

flux-1-kontext-max

flux-1-kontext-pro

imagen-4

kandinsky-3.1

sdxl

dall-e-3

midjourney

stable-diffusion-3-large

stable-diffusion-3.5-large

gpt-image-1

flux-1-schnell

flux-1-dev

flux-1-pro

flux-1.1-pro

flux-1.1-pro-ultra

ideogram-v2-turbo

recraft-v3

imagen-3

grok-2-aurora

Models

flux-1-krea-dev

qwen-image

flux-1-kontext-max

flux-1-kontext-pro

imagen-4

kandinsky-3.1

sdxl

dall-e-3

midjourney

stable-diffusion-3-large

stable-diffusion-3.5-large

gpt-image-1

flux-1-schnell

flux-1-dev

flux-1-pro

flux-1.1-pro

flux-1.1-pro-ultra

ideogram-v2-turbo

recraft-v3

imagen-3

grok-2-aurora

Categories

Input Modalities

Output Modalities

Tiers