DeepSeek V3 is a 685B-parameter, mixture-of-experts model and the latest iteration of the flagship chat model family from the DeepSeek team. Succeeds the previous DeepSeek V3 model and demonstrates strong performance across a variety of tasks.
Free
Not enough activity data to display a chart
Not enough throughput data
Not enough TTFT data
Not enough uptime data to display a chart