llama-3.2-90b-vision-instruct
by meta-llamaThe Llama 90B Vision model is a top-tier, 90-billion-parameter multimodal model from Meta, designed for challenging visual reasoning and language tasks. It excels at image captioning, visual question answering, and advanced image-text comprehension, and is pre-trained on vast mul...
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Input Tokens (1M)
$0.45
Output Tokens (1M)
$0.45
Capabilities
Input Modalities
Text
Image
Output Modalities
Text
Rate Limits
Requests per minute (RPM) and per day (RPD) by tier. More about tiers here
Tier | RPM | RPD |
---|---|---|
Free | — | — |
Tier 1 | 10 | 1000 |
Tier 2 | 15 | 1500 |
Tier 3 | 25 | 2500 |
Tier 4 | 50 | 5000 |
Usage Analytics
Token usage across the last 30 active days
Not enough activity data to display a chart