Token usage over time
Browse models from Google
Gemini 2.5 Flash (Free)
Gemini 2.5 Flash is Google’s high-performance workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. Includes built-in "thinking" capabilities and is configurable through a "max tokens for reasoning" parameter for fine-tuned performance.
Gemini 3 Pro Preview
Gemini 3 Pro Preview is Google’s most advanced AI model, setting new records on leading benchmarks like LMArena (1501 Elo), GPQA Diamond (91.9%), and MathArena Apex (23.4%). With powerful reasoning and deep multimodal understanding across text, images, code, video, and audio, Gemini 3 Pro Preview delivers nuanced, context-aware responses and excels at complex problem-solving, scientific analysis, and creative coding tasks.
Gemini 2.0 Flash (Free)
Gemini Flash 2.0 offers significantly faster time to first token (TTFT) compared to previous versions, while maintaining quality on par with larger models. Introduces enhancements in multimodal understanding, coding, complex instruction following, and function calling for robust agentic experiences.
Gemini 2.0 Flash
Gemini Flash 2.0 offers significantly faster time to first token (TTFT) compared to previous versions, while maintaining quality on par with larger models. Introduces enhancements in multimodal understanding, coding, complex instruction following, and function calling for robust agentic experiences.
Gemini 2.5 Pro
Gemini 2.5 Pro is Google’s state-of-the-art AI model, designed for advanced reasoning, coding, mathematics, and scientific tasks. Employs “thinking” capabilities for nuanced context handling and achieves top-tier performance on multiple benchmarks, including first-place on the LMArena leaderboard.
Gemini 2.5 Flash
Gemini 2.5 Flash is Google’s high-performance workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. Includes built-in "thinking" capabilities and is configurable through a "max tokens for reasoning" parameter for fine-tuned performance.
Gemini 2.5 Flash Image
Gemini 2.5 Flash Image, also known as "Nano Banana" is a state-of-the-art image generation model with strong contextual understanding. It supports image generation, editing, and multi-turn conversational interactions.
Gemini 3 Pro Image Preview (Nano Banana Pro)
Gemini 3 Pro Image Preview (Nano Banana Pro) is Google’s most advanced image generation and editing model, built on Gemini 3 Pro. Building on the original Nano Banana, it offers much improved multimodal reasoning, real-world grounding, and high-fidelity visual synthesis. The model produces context-rich visuals—from infographics and diagrams to cinematic composites—and can incorporate up-to-the-minute information through Search grounding. It leads the industry with sophisticated text rendering in images, handles consistent multi-image blending, and maintains accurate identity preservation for up to five subjects. Nano Banana Pro gives users fine-grained creative controls like localized edits, lighting and focus adjustments, camera transformations, 2K/4K output, and flexible aspect ratios. Tailored for professional design, product visualization, storyboarding, and complex compositions, it remains efficient for everyday image creation needs.
Gemini Embedding 001
Gemini-Embedding-001 is Google’s top-ranked multilingual embedding model, supporting over 100 languages and flexible output dimensions (3072, 1536, or 768). It is optimized for semantic search, clustering, and recommendations, and leverages Matryoshka Representation Learning for efficient, high-quality embeddings.
Gemini 2.5 Flash Preview 09-2025
Gemini 2.5 Flash Preview September 2025 Checkpoint is Google’s high-performance model, built for advanced reasoning, code generation, mathematical tasks, and scientific applications. This version introduces faster, more efficient output and smarter tool use for complex, multi-step workflows.
Gemini 2.5 Flash Lite Preview 09-2025
Gemini 2.5 Flash-Lite Preview September 2025 Checkpoint is a lightweight, high-throughput model from the Gemini 2.5 family, focused on ultra-low latency and cost efficiency. It delivers even faster token generation, concise output, and improved performance on standard benchmarks compared to earlier Flash-Lite models, making it ideal for large-scale, real-time applications.
Gemini 2.0 Flash Lite
Gemini 2.0 Flash Lite is optimized for extremely fast response times and low cost, while maintaining the quality of larger models. Ideal for real-time and large-scale applications.
Imagen 4
Imagen-4 is Google’s latest text-to-image model, engineered for photorealistic quality, improved fine details, advanced spelling and typography rendering, and high accuracy across diverse art styles. It includes SynthID watermarking for AI-generated content identification and is benchmarked as a leader in human preference evaluations.
Gemini 2.5 Flash Image Preview
Gemini 2.5 Flash Image Preview is a state of the art image generation model with contextual understanding. It is capable of image generation, edits, and multi-turn conversations.
Gemma 3 27B IT
Google’s latest open-source multimodal model, Gemma 3 27B, supports vision-language input and text outputs, handles context windows up to 128k tokens, and understands over 140 languages. Offers improved math, reasoning, and chat capabilities, including structured outputs and function calling.