xAI: Grok 4.20 vs Microsoft: Phi 4

Side-by-side comparison — pricing, context window, capabilities, and live leaderboard data.

Key differences

xAI: Grok 4.20 has a 2000K-token context window — 122.1× larger than Microsoft: Phi 4's 16K.
Microsoft: Phi 4 is 95% cheaper per 1K input tokens than xAI: Grok 4.20 ($0.0001 vs $0.0013).
xAI: Grok 4.20 is built by xAI; Microsoft: Phi 4 is built by Microsoft.
xAI: Grok 4.20 supports vision (image inputs); the other does not.

Specifications

	xAI: Grok 4.20	Microsoft: Phi 4
Provider	xAI	Microsoft
Context window	2000K	16K
Max output tokens	8192	8192
Input price / 1K tokens	$0.0013	$0.0001
Output price / 1K tokens	$0.0025	$0.0001
Vision support	Yes	No
Function calling	No	No
License	Proprietary	Proprietary

About xAI: Grok 4.20

US·Closed

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...

View xAI: Grok 4.20 reliability and benchmark history →

About Microsoft: Phi 4

US·Closed

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

View Microsoft: Phi 4 reliability and benchmark history →

Try them yourself

Run the same prompt against xAI: Grok 4.20 and Microsoft: Phi 4 side-by-side on PromptPit.

Start a comparison

Or browse the full AI model leaderboard.