---
title: "AI Model Comparison"
type: index
id: "models"
description: "Compare 33+ AI models by provider, pricing, context window, coding score, reasoning score, open-source status, and best-use case."
last_updated: "2026-04-24"
---

# AI Model Comparison

Comprehensive comparison of current AI models with benchmarks, pricing, and recommendations. Every model has structured YAML metadata with typed fields — pricing, benchmarks, context windows — queryable via the [JSON API](/api/v1/models.json) and [recommendation engine](/api/v1/recommend.json).

## Proprietary Models

| Model | Provider | Context | Reasoning | Coding | Pricing (input) |
|-------|----------|---------|-----------|--------|-----------------|
| [GPT-5.4](gpt-5.4.md) | OpenAI | 1M tokens | 95 | 92 | $5.00 / 1M tokens |
| [GPT-5.4 Thinking](gpt-5.4-thinking.md) | OpenAI | 256K tokens | 98 | 93 | $10.00 / 1M tokens |
| [Claude Opus 4.6](claude-opus-4.6.md) | Anthropic | 1M tokens | 96 | 97 | $5.00 / 1M tokens |
| [Claude Sonnet 4.6](claude-sonnet-4.6.md) | Anthropic | 1M tokens | 91 | 93 | $3.00 / 1M tokens |
| [Claude Haiku 4.5](claude-haiku-4.5.md) | Anthropic | 200K tokens | 82 | 84 | $1.00 / 1M tokens |
| [Gemini 3.1 Pro](gemini-3.1-pro.md) | Google | 1M tokens | 93 | 91 | $2.00 / 1M tokens |
| [Gemini 3 Flash](gemini-3-flash.md) | Google | 1M tokens | 82 | 80 | $0.15 / 1M tokens |
| [Grok 4.1](grok-4.1.md) | xAI | 128K tokens | 91 | 90 | $3.00 / 1M tokens |
| [Grok 4.20](grok-4.20.md) | xAI | 2M tokens | 85 | 88 | $2.00 / 1M tokens |

## Open Source Models

| Model | Provider | Parameters | Context | Reasoning | Coding | License |
|-------|----------|------------|---------|-----------|--------|---------|
| [Llama 4 Maverick](llama-4-maverick.md) | Meta | 400B total (17B active) | 1M tokens | 87 | 82 | Llama Community License |
| [Llama 4 Scout](llama-4-scout.md) | Meta | 109B total (17B active) | 10M tokens | 80 | 79 | Llama Community License |
| [DeepSeek V3.2](deepseek-v3.2.md) | DeepSeek | 671B total (37B active) | 128K tokens | 88 | 88 | MIT |
| [DeepSeek R1](deepseek-r1.md) | DeepSeek | 671B total (37B active) | 128K tokens | 92 | 88 | MIT |
| [Mistral 3](mistral-3.md) | Mistral AI | 675B total (41B active) | 128K tokens | 86 | 87 | Apache 2.0 |
| [Qwen 3](qwen-3.md) | Alibaba | 1T+ total (MoE, various active sizes) | 128K tokens | 88 | 90 | Apache 2.0 |
| [Hermes 4 405B](hermes-4-405b.md) | Nous Research | 405B (also available in 14B, 70B) | 128K tokens | 88 | 84 | Llama Community License |
| [MiniMax M2.7](minimax-m2.7.md) | MiniMax | MoE (undisclosed active/total) | 128K tokens | 90 | 95 | Modified MIT |
| [GLM-5](glm-5.md) | Zhipu AI | 744B total (40B active) | 128K tokens | 90 | 93 | MIT |
| [Kimi K2.5](kimi-k2.5.md) | Moonshot AI | MoE (undisclosed) | 128K tokens | 93 | 85 | MIT |
| [Qwen 3.5 397B-A17B](qwen-3.5.md) | Alibaba | 397B total (17B active) | 256K tokens | 91 | 92 | Apache 2.0 |
| [GPT-OSS-120B](gpt-oss-120b.md) | OpenAI | 120B | 128K tokens | 85 | 86 | OpenAI Open Weight License |
| [Gemma 3](gemma-3.md) | Google | 1B to 27B variants | 128K tokens | 75 | 73 | Gemma Terms of Use |
| [Gemma 4](gemma-4.md) | Google | E2B, E4B, 26B MoE (3.8B active), 31B Dense | 256K tokens | 84 | 83 | Apache 2.0 |
| [Command R+](command-r-plus.md) | Cohere | 104B | 128K tokens | 82 | 78 | CC-BY-NC 4.0 |
| [Yi-1.5 34B](yi-1.5-34b.md) | 01.AI | 34B (also 6B, 9B variants) | 32K tokens | 80 | 79 | Apache 2.0 |
| [Phi-4](phi-4.md) | Microsoft | 14B | 16K tokens | 78 | 80 | MIT |
| [Falcon 3](falcon-3.md) | Technology Innovation Institute | 3B to 10B variants | 32K tokens | 70 | 68 | Apache 2.0 |
| [SmolLM3 3B](smollm3-3b.md) | Hugging Face | 3B | 32K tokens | 68 | 70 | Apache 2.0 |
| [Cohere Tiny Aya 3.35B](cohere-tiny-aya.md) | Cohere | 3.35B | 32K tokens | 65 | 62 | CC-BY-NC 4.0 |
| [Mistral Small 3 24B](mistral-small-3.md) | Mistral AI | 24B | 128K tokens | 79 | 80 | Apache 2.0 |
| [Mistral Small 4](mistral-small-4.md) | Mistral AI | 119B total (6.5B active) | 128K tokens | 76 | 78 | Apache 2.0 |
| [Nemotron 3 Super](nemotron-3-super.md) | NVIDIA | 120B total (12B active) | 128K tokens | 80 | 82 | NVIDIA Open Model License |
| [Nemotron-Cascade 2](nemotron-cascade-2.md) | NVIDIA | 30B total (3B active) | 1M tokens | 88 | 90 | NVIDIA Open Model License |

AI Model Comparison

Comprehensive comparison of current AI models with benchmarks, pricing, and recommendations. Every model has structured YAML metadata with typed fields — pricing, benchmarks, context windows — queryable via the JSON API and recommendation engine.

Proprietary Models

Model Provider Context Reasoning Coding Pricing (input)
GPT-5.4 OpenAI 1M tokens 95 92 $5.00 / 1M tokens
GPT-5.4 Thinking OpenAI 256K tokens 98 93 $10.00 / 1M tokens
Claude Opus 4.6 Anthropic 1M tokens 96 97 $5.00 / 1M tokens
Claude Sonnet 4.6 Anthropic 1M tokens 91 93 $3.00 / 1M tokens
Claude Haiku 4.5 Anthropic 200K tokens 82 84 $1.00 / 1M tokens
Gemini 3.1 Pro Google 1M tokens 93 91 $2.00 / 1M tokens
Gemini 3 Flash Google 1M tokens 82 80 $0.15 / 1M tokens
Grok 4.1 xAI 128K tokens 91 90 $3.00 / 1M tokens
Grok 4.20 xAI 2M tokens 85 88 $2.00 / 1M tokens

Open Source Models

Model Provider Parameters Context Reasoning Coding License
Llama 4 Maverick Meta 400B total (17B active) 1M tokens 87 82 Llama Community License
Llama 4 Scout Meta 109B total (17B active) 10M tokens 80 79 Llama Community License
DeepSeek V3.2 DeepSeek 671B total (37B active) 128K tokens 88 88 MIT
DeepSeek R1 DeepSeek 671B total (37B active) 128K tokens 92 88 MIT
Mistral 3 Mistral AI 675B total (41B active) 128K tokens 86 87 Apache 2.0
Qwen 3 Alibaba 1T+ total (MoE, various active sizes) 128K tokens 88 90 Apache 2.0
Hermes 4 405B Nous Research 405B (also available in 14B, 70B) 128K tokens 88 84 Llama Community License
MiniMax M2.7 MiniMax MoE (undisclosed active/total) 128K tokens 90 95 Modified MIT
GLM-5 Zhipu AI 744B total (40B active) 128K tokens 90 93 MIT
Kimi K2.5 Moonshot AI MoE (undisclosed) 128K tokens 93 85 MIT
Qwen 3.5 397B-A17B Alibaba 397B total (17B active) 256K tokens 91 92 Apache 2.0
GPT-OSS-120B OpenAI 120B 128K tokens 85 86 OpenAI Open Weight License
Gemma 3 Google 1B to 27B variants 128K tokens 75 73 Gemma Terms of Use
Gemma 4 Google E2B, E4B, 26B MoE (3.8B active), 31B Dense 256K tokens 84 83 Apache 2.0
Command R+ Cohere 104B 128K tokens 82 78 CC-BY-NC 4.0
Yi-1.5 34B 01.AI 34B (also 6B, 9B variants) 32K tokens 80 79 Apache 2.0
Phi-4 Microsoft 14B 16K tokens 78 80 MIT
Falcon 3 Technology Innovation Institute 3B to 10B variants 32K tokens 70 68 Apache 2.0
SmolLM3 3B Hugging Face 3B 32K tokens 68 70 Apache 2.0
Cohere Tiny Aya 3.35B Cohere 3.35B 32K tokens 65 62 CC-BY-NC 4.0
Mistral Small 3 24B Mistral AI 24B 128K tokens 79 80 Apache 2.0
Mistral Small 4 Mistral AI 119B total (6.5B active) 128K tokens 76 78 Apache 2.0
Nemotron 3 Super NVIDIA 120B total (12B active) 128K tokens 80 82 NVIDIA Open Model License
Nemotron-Cascade 2 NVIDIA 30B total (3B active) 1M tokens 88 90 NVIDIA Open Model License