xAI: Grok 4.20 vs Microsoft: Phi 4

Side-by-side comparison — pricing, context window, capabilities, and live leaderboard data.

Key differences

  • xAI: Grok 4.20 has a 2000K-token context window — 122.1× larger than Microsoft: Phi 4's 16K.
  • Microsoft: Phi 4 is 95% cheaper per 1K input tokens than xAI: Grok 4.20 ($0.0001 vs $0.0013).
  • xAI: Grok 4.20 is built by xAI; Microsoft: Phi 4 is built by Microsoft.
  • xAI: Grok 4.20 supports vision (image inputs); the other does not.

Specifications

xAI: Grok 4.20Microsoft: Phi 4
ProviderxAIMicrosoft
Context window2000K16K
Max output tokens81928192
Input price / 1K tokens$0.0013$0.0001
Output price / 1K tokens$0.0025$0.0001
Vision supportYesNo
Function callingNoNo
LicenseProprietaryProprietary

About xAI: Grok 4.20

US·Closed

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...

View xAI: Grok 4.20 reliability and benchmark history →

About Microsoft: Phi 4

US·Closed

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

View Microsoft: Phi 4 reliability and benchmark history →

Try them yourself

Run the same prompt against xAI: Grok 4.20 and Microsoft: Phi 4 side-by-side on PromptPit.

Start a comparison

Or browse the full AI model leaderboard.