Llama 3.3 70B Instruct

llama-3.3-70b-instruct
by meta-llama|Created May 25, 2025

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction-tuned generative model with 70B parameters. Optimized for multilingual dialogue, it outperforms many open-source and closed chat models on industry benchmarks. Supported languages include English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.29

Output Tokens (1M)

$0.39

Capabilities

Input Modalities

Text

Output Modalities

Text

Usage Analytics

Token usage across the last 30 active days

Throughput

Time-To-First-Token (TTFT)