Llama 3.1 8B Instruct

llama-3.1-8b-instruct
by meta-llama|Created May 26, 2025

Meta’s Llama 3.1 8B instruct-tuned model, designed for fast and efficient dialogue. It performs strongly in human evaluations and is ideal for applications requiring a balance of speed and quality.

Pricing

Pay-as-you-go rates for this model. More details can be found here.

Input Tokens (1M)

$0.05

Output Tokens (1M)

$0.05

Capabilities

Input Modalities

Text

Output Modalities

Text

Usage Analytics

Token usage across the last 30 active days

Throughput

Time-To-First-Token (TTFT)