Llama 3.1 405B Instruct

llama-3.1-405b-instruct
by meta-llama|Created May 25, 2025

The highly anticipated 400B class of Llama3 is here, offering a 128k context window and impressive evaluation scores. This 405B instruct-tuned version is optimized for high-quality dialogue and demonstrates strong performance compared to leading closed-source models, including GPT-4o and Claude 3.5 Sonnet.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$1.50

Output Tokens (1M)

$1.50

Capabilities

Input Modalities

Text

Output Modalities

Text

Usage Analytics

Token usage across the last 30 active days

Throughput

Time-To-First-Token (TTFT)