phi-4-multimodal-instruct
by microsoftA versatile 5.6B-parameter foundation model from Microsoft, combining advanced reasoning and instruction-following across both text and visual inputs. Supports multiple languages and delivers strong performance on multimodal tasks involving mathematical, scientific, and document ...
Pricing
Pay-as-you-go rates for this model. More details can be found here.
Input Tokens (1M)
$0.02
Output Tokens (1M)
$0.05
Capabilities
Input Modalities
Text
Image
Output Modalities
Text
Rate Limits
Requests per minute (RPM) and per day (RPD) by tier. More about tiers here
Tier | RPM | RPD |
---|---|---|
Free | — | — |
Tier 1 | 10 | 1000 |
Tier 2 | 15 | 1500 |
Tier 3 | 25 | 2500 |
Tier 4 | 50 | 5000 |
Usage Analytics
Token usage across the last 30 active days