Mistral: Mistral Large 3 2512 vs NVIDIA: Nemotron 3 Super

Side-by-side comparison — pricing, context window, capabilities, and live leaderboard data.

Key differences

  • NVIDIA: Nemotron 3 Super has a 1000K-token context window — 3.8× larger than Mistral: Mistral Large 3 2512's 262K.
  • NVIDIA: Nemotron 3 Super is 82% cheaper per 1K input tokens than Mistral: Mistral Large 3 2512 ($0.0001 vs $0.0005).
  • Mistral: Mistral Large 3 2512 is built by Mistral; NVIDIA: Nemotron 3 Super is built by NVIDIA.
  • Mistral: Mistral Large 3 2512 supports vision (image inputs); the other does not.

Specifications

Mistral: Mistral Large 3 2512NVIDIA: Nemotron 3 Super
ProviderMistralNVIDIA
Context window262K1000K
Max output tokens81928192
Input price / 1K tokens$0.0005$0.0001
Output price / 1K tokens$0.0015$0.0004
Vision supportYesNo
Function callingNoNo
LicenseProprietaryProprietary

About Mistral: Mistral Large 3 2512

EU·Closed

Mistral Large 3 2512 is Mistral’s most capable model to date, featuring a sparse mixture-of-experts architecture with 41B active parameters (675B total), and released under the Apache 2.0 license.

View Mistral: Mistral Large 3 2512 reliability and benchmark history →

About NVIDIA: Nemotron 3 Super

US·Closed

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

View NVIDIA: Nemotron 3 Super reliability and benchmark history →

Try them yourself

Run the same prompt against Mistral: Mistral Large 3 2512 and NVIDIA: Nemotron 3 Super side-by-side on PromptPit.

Start a comparison

Or browse the full AI model leaderboard.