xAI: Grok 4.20 vs Microsoft: Phi 4
Side-by-side comparison — pricing, context window, capabilities, and live leaderboard data.
Key differences
- xAI: Grok 4.20 has a 2000K-token context window — 122.1× larger than Microsoft: Phi 4's 16K.
- Microsoft: Phi 4 is 95% cheaper per 1K input tokens than xAI: Grok 4.20 ($0.0001 vs $0.0013).
- xAI: Grok 4.20 is built by xAI; Microsoft: Phi 4 is built by Microsoft.
- xAI: Grok 4.20 supports vision (image inputs); the other does not.
Specifications
| xAI: Grok 4.20 | Microsoft: Phi 4 | |
|---|---|---|
| Provider | xAI | Microsoft |
| Context window | 2000K | 16K |
| Max output tokens | 8192 | 8192 |
| Input price / 1K tokens | $0.0013 | $0.0001 |
| Output price / 1K tokens | $0.0025 | $0.0001 |
| Vision support | Yes | No |
| Function calling | No | No |
| License | Proprietary | Proprietary |
About xAI: Grok 4.20
US·ClosedGrok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...
View xAI: Grok 4.20 reliability and benchmark history →About Microsoft: Phi 4
US·Closed[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...
View Microsoft: Phi 4 reliability and benchmark history →Try them yourself
Run the same prompt against xAI: Grok 4.20 and Microsoft: Phi 4 side-by-side on PromptPit.
Start a comparison