Models

Browse AI models on NagaAI and compare pricing, capabilities, uptime, and performance.

13 models

Sort by

13 models

GLM 4.5 Air (free)

835M Tokens

GLM-4.5-Air is the lightweight version of our newest flagship model family, designed specifically for agent-focused applications. Like GLM-4.5, it uses a Mixture-of-Experts (MoE) architecture, but with a smaller parameter footprint. GLM-4.5-Air also supports hybrid inference modes, including a "thinking mode" for deeper reasoning and tool usage, and a "non-thinking mode" for real-time interactions.

Z.ai

Free

Nemotron 3 Super (free)

246M Tokens

NVIDIA Nemotron 3 Super is an open hybrid MoE model with 120B parameters, using only 12B active parameters to achieve high computational efficiency and strong accuracy in complex multi-agent scenarios. Based on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it offers more than 50% faster token generation than leading open models. The model includes a 1M-token context window, enabling long-term agent consistency, cross-document reasoning, and multi-step task planning. Latent MoE makes it possible to engage 4 experts at the inference cost of just one, enhancing both intelligence and generalization. Reinforcement learning across more than 10 environments provides top-tier benchmark performance, including AIME 2025, TerminalBench, and SWE-Bench Verified. Released fully open with weights, datasets, and recipes under the NVIDIA Open License, Nemotron 3 Super supports simple customization and secure deployment in any environment — from local workstations to the cloud.

Nvidia

Free

Gemini 2.5 Flash (Free)

31.6B Tokens

Gemini 2.5 Flash is Google’s high-performance workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. Includes built-in "thinking" capabilities and is configurable through a "max tokens for reasoning" parameter for fine-tuned performance.

Google

Free

Eleven Multilingual v2 (Free)

3.65M Tokens

Eleven-Multilingual-v2 is ElevenLabs’ most advanced multilingual text-to-speech model, delivering high-quality voice synthesis across a wide range of languages with improved realism and expressiveness. It is optimized for both accuracy and naturalness in multilingual scenarios.

ElevenLabs

Free

DALL-E 3 (Free)

20.7M Tokens

DALL-E 3 is OpenAI’s third-generation text-to-image model, offering enhanced detail, accuracy, and the ability to understand complex prompts. It excels at generating realistic and creative images, handling intricate details like text and human anatomy, and supports various aspect ratios for flexible output.

OpenAI

Free

GPT-4o Mini TTS (Free)

823K Tokens

A text-to-speech model built on GPT-4o mini, a fast and powerful language model. Use it to convert text into natural-sounding spoken audio.

OpenAI

Free

Flux 1 Schnell (Free)

31.4M Tokens

Flux-1-Schnell is a high-speed, open-source text-to-image model from Black Forest Labs, optimized for rapid, high-quality image generation in just a few steps. It is ideal for applications where speed and efficiency are critical.

Black Forest Labs

Free

Llama 3.3 70B Instruct (Free)

288M Tokens

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction-tuned generative model with 70B parameters. Optimized for multilingual dialogue, it outperforms many open-source and closed chat models on industry benchmarks. Supported languages include English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

Meta Llama

Free

Whisper Large v3 (Free)

27.8M Tokens

Whisper Large v3 is OpenAI’s state-of-the-art model for automatic speech recognition (ASR) and speech translation. Trained on over 5 million hours of labeled data, it demonstrates strong generalization across datasets and domains, excelling in zero-shot transcription and translation tasks.

OpenAI

Free

Sonar (Free)

465M Tokens

Sonar is Perplexity’s lightweight, affordable, and fast question-answering model, now featuring citations and customizable sources. It is designed for companies seeking to integrate rapid, citation-enabled Q&A features optimized for speed and simplicity.

Perplexity

Free

SDXL (Free)

1.25M Tokens

Stable Diffusion XL (SDXL) is a powerful text-to-image generation model from Stability AI, featuring a 3x larger UNet, dual text encoders (OpenCLIP ViT-bigG/14 and the original), and a two-stage process for generating highly detailed, controllable images. It introduces size and crop-conditioning for greater control and quality in image generation.

StabilityAI

Free

Llama 4 Scout 17B 16E Instruct (Free)

167M Tokens

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model from Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, it is instruction-tuned for multilingual chat, captioning, and image understanding.

Meta Llama

Free

GPT-4.1 Mini (Free)

982M Tokens

A mid-sized GPT-4.1 model delivering performance competitive with GPT-4o at substantially lower latency and cost. Retains a 1 million token context window and demonstrates strong coding ability and vision understanding, making it suitable for interactive applications with tight performance constraints.

OpenAI

Free

Supported Parameters

Models

GLM 4.5 Air (free)

Nemotron 3 Super (free)

Gemini 2.5 Flash (Free)

Eleven Multilingual v2 (Free)

DALL-E 3 (Free)

GPT-4o Mini TTS (Free)

Flux 1 Schnell (Free)

Llama 3.3 70B Instruct (Free)

Whisper Large v3 (Free)

Sonar (Free)

SDXL (Free)

Llama 4 Scout 17B 16E Instruct (Free)

GPT-4.1 Mini (Free)

Input Modalities

Output Modalities

Supported Parameters

Startups

GLM 4.5 Air (free)

Nemotron 3 Super (free)

Gemini 2.5 Flash (Free)

Eleven Multilingual v2 (Free)

DALL-E 3 (Free)

GPT-4o Mini TTS (Free)

Flux 1 Schnell (Free)

Llama 3.3 70B Instruct (Free)

Whisper Large v3 (Free)

Sonar (Free)

SDXL (Free)

Llama 4 Scout 17B 16E Instruct (Free)

GPT-4.1 Mini (Free)