Mistral: Mistral Large 3 2512 vs NVIDIA: Nemotron 3 Super

Side-by-side comparison — pricing, context window, capabilities, and live leaderboard data.

Key differences

NVIDIA: Nemotron 3 Super has a 1000K-token context window — 3.8× larger than Mistral: Mistral Large 3 2512's 262K.
NVIDIA: Nemotron 3 Super is 82% cheaper per 1K input tokens than Mistral: Mistral Large 3 2512 ($0.0001 vs $0.0005).
Mistral: Mistral Large 3 2512 is built by Mistral; NVIDIA: Nemotron 3 Super is built by NVIDIA.
Mistral: Mistral Large 3 2512 supports vision (image inputs); the other does not.

Specifications

	Mistral: Mistral Large 3 2512	NVIDIA: Nemotron 3 Super
Provider	Mistral	NVIDIA
Context window	262K	1000K
Max output tokens	8192	8192
Input price / 1K tokens	$0.0005	$0.0001
Output price / 1K tokens	$0.0015	$0.0004
Vision support	Yes	No
Function calling	No	No
License	Proprietary	Proprietary

About Mistral: Mistral Large 3 2512

EU·Closed

Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-experts architecture with 41B active parameters (675B total), and released under the Apache 2.0 license.

View Mistral: Mistral Large 3 2512 reliability and benchmark history →

About NVIDIA: Nemotron 3 Super

US·Closed

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

View NVIDIA: Nemotron 3 Super reliability and benchmark history →

Try them yourself

Run the same prompt against Mistral: Mistral Large 3 2512 and NVIDIA: Nemotron 3 Super side-by-side on PromptPit.

Start a comparison

Or browse the full AI model leaderboard.