Gemini 2.0 Flash Lite is optimized for extremely fast response times and low cost, while maintaining the quality of larger models. Ideal for real-time and large-scale applications.
$0.04
$0.15
Not enough TTFT data
from openai import OpenAI client = OpenAI( base_url="https://api.naga.ac/v1", api_key="YOUR_API_KEY", ) resp = client.chat.completions.create( model="gemini-2.0-flash-lite-001", messages=[ {"role": "user", "content": "What's 2+2?"} ], temperature=0.2, ) print(resp.choices[0].message.content)