Token usage over time
Browse models from Google
Gemini 2.5 Flash (Free)
Gemini 2.5 Flash is Google’s high-performance workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. Includes built-in "thinking" capabilities and is configurable through a "max tokens for reasoning" parameter for fine-tuned performance.
Gemini 2.5 Pro
Gemini 2.5 Pro is Google’s state-of-the-art AI model, designed for advanced reasoning, coding, mathematics, and scientific tasks. Employs “thinking” capabilities for nuanced context handling and achieves top-tier performance on multiple benchmarks, including first-place on the LMArena leaderboard.
Gemini 2.0 Flash
Gemini Flash 2.0 offers significantly faster time to first token (TTFT) compared to previous versions, while maintaining quality on par with larger models. Introduces enhancements in multimodal understanding, coding, complex instruction following, and function calling for robust agentic experiences.
Gemini 2.5 Flash
Gemini 2.5 Flash is Google’s high-performance workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. Includes built-in "thinking" capabilities and is configurable through a "max tokens for reasoning" parameter for fine-tuned performance.
Gemini Embedding 001
Gemini-Embedding-001 is Google’s top-ranked multilingual embedding model, supporting over 100 languages and flexible output dimensions (3072, 1536, or 768). It is optimized for semantic search, clustering, and recommendations, and leverages Matryoshka Representation Learning for efficient, high-quality embeddings.
Gemini 2.5 Flash Lite Preview 09-2025
Gemini 2.5 Flash-Lite Preview September 2025 Checkpoint is a lightweight, high-throughput model from the Gemini 2.5 family, focused on ultra-low latency and cost efficiency. It delivers even faster token generation, concise output, and improved performance on standard benchmarks compared to earlier Flash-Lite models, making it ideal for large-scale, real-time applications.
Gemini 2.5 Flash Preview 09-2025
Gemini 2.5 Flash Preview September 2025 Checkpoint is Google’s high-performance model, built for advanced reasoning, code generation, mathematical tasks, and scientific applications. This version introduces faster, more efficient output and smarter tool use for complex, multi-step workflows.
Gemini 2.5 Flash Image
Gemini 2.5 Flash Image, also known as "Nano Banana," is a state-of-the-art image generation model with strong contextual understanding. It supports image generation, editing, and multi-turn conversational interactions.
Gemini 2.0 Flash (Free)
Gemini Flash 2.0 offers significantly faster time to first token (TTFT) compared to previous versions, while maintaining quality on par with larger models. Introduces enhancements in multimodal understanding, coding, complex instruction following, and function calling for robust agentic experiences.
Gemini 2.0 Flash Lite
Gemini 2.0 Flash Lite is optimized for extremely fast response times and low cost, while maintaining the quality of larger models. Ideal for real-time and large-scale applications.
Gemini 2.5 Flash Image Preview
Gemini 2.5 Flash Image Preview is a state of the art image generation model with contextual understanding. It is capable of image generation, edits, and multi-turn conversations.
Imagen 4
Imagen-4 is Google’s latest text-to-image model, engineered for photorealistic quality, improved fine details, advanced spelling and typography rendering, and high accuracy across diverse art styles. It includes SynthID watermarking for AI-generated content identification and is benchmarked as a leader in human preference evaluations.
Imagen 3
Imagen-3 is Google’s high-quality text-to-image model, producing highly detailed images with rich lighting and minimal visual distractions. It is optimized for overall image quality and creative visual generation.
Gemma 3 27B IT
Google’s latest open-source multimodal model, Gemma 3 27B, supports vision-language input and text outputs, handles context windows up to 128k tokens, and understands over 140 languages. Offers improved math, reasoning, and chat capabilities, including structured outputs and function calling.