Qwen3.7 Plus vs MiniMax M3

Compare Qwen3.7 Plus and MiniMax M3 on key metrics including price, context length, throughput, and other model features.

AuthorQwen

Context Length1M

Supports Tools

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and image input with text output, building on the series' text capabilities with a comprehensive upgrade to its vision-language abilities while retaining full-stack, agent-level intelligence for coding, tool use, and productivity workflows. Its distinguishing trait is multi-modal interactive hybrid agent capability: it can perceive real-world scenes, read screens and interact with GUIs, generate code from visual references, and perform end-to-end navigation within mobile apps.

Activity

Last 14 days

Prompt

412M

Completion

Total

419M

Startup

Qwen

Latency (p50)1.74s

Throughput (p50)51.7 tok/s

Pricing

Input$0.20/M tokens

Output$0.80/M tokens

Cached input$0.04/M tokens

Features

Input Modalitiestext, image

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model

AuthorMinimax

Context Length1M

Supports Tools

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding, and tool use. It is built on MiniMax Sparse Attention (MSA), which replaces full attention with KV-block selection to cut per-token compute at long context — roughly 1/20 the cost of the previous generation at 1M tokens, with substantially faster prefill and decode while retaining quality across most tasks. Trained as a native multimodal model on interleaved data and tuned for multi-turn, production-like collaboration via an interactive user-simulator framework, the model is oriented toward sustained, multi-step tasks rather than single-turn execution.

Activity

Last 14 days

Prompt

224M

Completion

Total

227M

Startup

Minimax

Latency (p50)1.54s

Throughput (p50)65.9 tok/s

Pricing

Input$0.15/M tokens

Output$0.60/M tokens

Cached input$0.03/M tokens

Features

Input Modalitiestext, image

Output Modalitiestext

Supported EndpointsChat Completions

Vision

Supports Tools

Go to model