llama-3.2-90b-vision-instruct

by meta-llama

Try In Playground

The Llama 90B Vision model is a top-tier, 90-billion-parameter multimodal model from Meta, designed for challenging visual reasoning and language tasks. It excels at image captioning, visual question answering, and advanced image-text comprehension, and is pre-trained on vast mul...

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.45

Output Tokens (1M)

$0.45

Capabilities

Input Modalities

Text

Image

Output Modalities

Text

Rate Limits

Requests per minute (RPM) and per day (RPD) by tier. More about tiers here

Tier	RPM	RPD
Free	—	—
Tier 1	10	1000
Tier 2	15	1500
Tier 3	25	2500
Tier 4	50	5000

Usage Analytics

Token usage across the last 30 active days

Not enough activity data to display a chart

Try In Playground