Models

Explore AI models available through NagaAI.

18 models

Sort by

18 models

Nano Banana 2 (Gemini 3.1 Flash Image Preview)

302M Tokens

Nano Banana 2 (Gemini 3.1 Flash Image) is Google DeepMind’s flagship Flash image model for high-fidelity generation and fast, advanced editing at scale, optimized for price–performance. It follows complex prompts more reliably and adds configurable thinking levels (Minimal vs High/Dynamic) to balance latency and quality. Nano Banana 2 improves in-image text rendering and supports in-image localization (generate/translate text across languages directly in the image), while leveraging stronger world knowledge and web image search for more grounded, realistic outputs. It supports native aspect ratios (including 4:1, 1:4, 8:1, 1:8) and 512px/1K/2K/4K resolutions.

Google

$0.13/1M input tokens$0.75/1M output tokens

-50%

Seedream 5 Lite

10.2M Tokens

Seedream 5.0 lite is ByteDance’s latest proprietary image generation model. Compared to earlier Seedream versions, it extends beyond standard text-to-image generation by integrating multi-step logical reasoning, example-based editing, and deep domain knowledge into the creation workflow. These upgrades enable more accurate transformations, stronger structural correctness, and more reliable results in professional and technical scenarios. The model introduces example-based editing: instead of describing a complex change in words, users can provide a before/after reference pair and then a new image—the model infers the edit and applies the same transformation for tasks like material swaps, style transfers, and scene modifications. Seedream 5.0 lite also improves logical reasoning over spatial relationships, physics, and sequential processes, helping it place objects correctly, depict mechanisms faithfully, and illustrate multi-stage transformations with consistent details. In addition, its deep domain knowledge supports convention-aware outputs in fields such as architecture, science, health, and design, enabling workflows like turning rough floor plan sketches into photorealistic interior renders or producing labeled, accurate scientific diagrams.

ByteDance

~$0.02/image

-50%

Flux 2 Pro

1.61M Tokens

Ideal for high-quality image manipulation, style transfer, and sequential editing workflows

Black Forest Labs

~$0.01/image

-50%

Flux 2 Max

362K Tokens

FLUX.2 [max] delivers state-of-the-art image generation and advanced image editing with exceptional realism, precision, and consistency.

Black Forest Labs

~$0.04/image

-50%

Flux 2 Flex

79K Tokens

FLUX.2 [flex] excels at rendering complex text, typography, and fine details, and supports multi-reference editing within the same unified architecture.

Black Forest Labs

~$0.03/image

-50%

Flux 2 Klein 4B

2.39M Tokens

FLUX.2 [klein] 4B is the quickest and most budget-friendly model in the FLUX.2 family, designed for high-throughput workloads while still delivering excellent image quality.

Black Forest Labs

~$0.007/image

-50%

Qwen Image Edit 2511

891K Tokens

Qwen-Image-Edit-2511 is the latest proprietary image editing model from Qwen, delivering substantial upgrades over its predecessor, Qwen-Image-Edit-2509. The new version features notable improvements in editing consistency, especially in multi-subject scenarios and character preservation, allowing for more faithful subject representation across edited images. Integrated support for popular community LoRAs now enables advanced lighting control and novel viewpoint generation natively. In addition, Qwen-Image-Edit-2511 offers enhanced industrial design capabilities, robust geometric reasoning for technical annotations, and improved fusion of multiple images. These advances result in more reliable, visually coherent, and creative image editing—making Qwen-Image-Edit-2511 a powerful and versatile tool for both imaginative and practical visual applications.

Qwen

~$0.01/image

-50%

Seedream 4.5

1.29B Tokens

Seedream 4.5 is the newest proprietary image generation model from ByteDance. Compared to Seedream 4.0, it offers substantial overall improvements—particularly in editing consistency, where it better maintains subject details, lighting, and color tones. The model also delivers enhanced portrait clarity and improved small-text rendering. Its ability to compose multiple images has been significantly upgraded, and advances in both inference performance and visual aesthetics allow for more accurate and artistically expressive image creation.

ByteDance

~$0.02/image

-50%

GPT Image 1.5

12.6M Tokens

GPT-Image-1.5 is the flagship image generation and editing model from OpenAI, designed for precise, natural, and fast creation. It reliably follows user instructions down to fine details, preserving critical elements like lighting, composition, and facial likeness across edits and generations. GPT-Image-1.5 excels at a wide range of editing tasks—including addition, removal, stylization, combination, and advanced text rendering—producing images that closely match user intent. With up to 4× faster generation speeds compared to previous versions, it streamlines creative workflows, enabling quick iterations whether you need a simple fix or a total visual transformation. Enhanced integration and lower API costs make GPT-Image-1.5 ideal for marketing, product visualization, ecommerce, and creative tools scenarios, while its dedicated editor and presets provide a delightful, accessible creative space for both practical and expressive image work.

OpenAI

~$0.03/image

-50%

Gemini 3 Pro Image Preview (Nano Banana Pro)

95.4M Tokens

Gemini 3 Pro Image Preview (Nano Banana Pro) is Google’s most advanced image generation and editing model, built on Gemini 3 Pro. Building on the original Nano Banana, it offers much improved multimodal reasoning, real-world grounding, and high-fidelity visual synthesis. The model produces context-rich visuals—from infographics and diagrams to cinematic composites—and can incorporate up-to-the-minute information through Search grounding. It leads the industry with sophisticated text rendering in images, handles consistent multi-image blending, and maintains accurate identity preservation for up to five subjects. Nano Banana Pro gives users fine-grained creative controls like localized edits, lighting and focus adjustments, camera transformations, 2K/4K output, and flexible aspect ratios. Tailored for professional design, product visualization, storyboarding, and complex compositions, it remains efficient for everyday image creation needs.

Google

$1.00/1M input tokens$6.00/1M output tokens

-50%

Seedream 4

281M Tokens

Seedream 4.0 is ByteDance’s advanced text-to-image and image editing model, designed for high-speed, high-resolution image generation and robust contextual understanding. It unifies generation and editing in a single architecture, supports complex visual tasks with natural-language instructions, and excels at multi-reference batches and diverse style transfers. Seedream 4.0 stands out for its ability to handle both content creation and modification, offering creative professionals and enterprises an all-in-one, efficient solution for imaginative and knowledge-driven visual tasks.

ByteDance

~$0.01/image

-50%

Gemini 2.5 Flash Image

32.7M Tokens

Gemini 2.5 Flash Image, also known as "Nano Banana" is a state-of-the-art image generation model with strong contextual understanding. It supports image generation, editing, and multi-turn conversational interactions.

Google

$0.15/1M input tokens$1.25/1M output tokens

-50%

Qwen Image Edit 2509

254K Tokens

Qwen-Image-Edit-2509 is the latest iteration of the Qwen-Image-Edit model, released in September. It introduces multi-image editing capabilities by building on the original architecture and further training with image concatenation, supporting combinations like “person + person,” “person + product,” and “person + scene,” with optimal performance for 1 to 3 images. For single-image editing, Qwen-Image-Edit-2509 delivers improved consistency, particularly in person editing (better facial identity preservation and support for various portrait styles), product editing (enhanced product identity retention), and text editing (support for modifying fonts, colors, and materials in addition to content). The model also natively supports ControlNet features, such as depth maps, edge maps, and keypoint maps.

Qwen

~$0.01/image

-50%

Gemini 2.5 Flash Image Preview

30.3M Tokens

Gemini 2.5 Flash Image Preview is a state of the art image generation model with contextual understanding. It is capable of image generation, edits, and multi-turn conversations.

Google

$0.15/1M input tokens$1.25/1M output tokens

-50%

Qwen Image

883K Tokens

Qwen-Image is a foundation image generation model from the Qwen team, excelling at high-fidelity text rendering, complex text integration (including English and Chinese), and diverse artistic styles. It supports advanced editing features such as style transfer, object manipulation, and human pose editing, and is suitable for both image generation and understanding tasks.

Qwen

~$0.01/image

-50%

Flux 1 Kontext Max

516K Tokens

Flux-1-Kontext-Max is a premium text-based image editing model from Black Forest Labs, delivering maximum performance and advanced typography generation for transforming images through natural language prompts. It is designed for high-end creative and professional use.

Black Forest Labs

~$0.04/image

-50%

Flux 1 Kontext Pro

4.64M Tokens

Flux-1-Kontext-Pro is a state-of-the-art text-based image editing model from Black Forest Labs, providing high-quality, prompt-adherent output for transforming images using natural language. It is optimized for consistent results and advanced editing tasks.

Black Forest Labs

~$0.02/image

-50%

GPT Image 1

3.1M Tokens

OpenAI’s new state-of-the-art image generation model. This is a natively multimodal language model that accepts both text and image inputs and produces image outputs. It powers image generation in ChatGPT, offering exceptional prompt adherence, a high level of detail, and quality.

OpenAI

~$0.04/image

-50%