AI news, benchmarks & engineering blog curation
Live ranking of large language models by ELO rating from community-voted head-to-head battles, with pricing and context length.
| # | Model | Org | ELO | Votes | Input $/M | Output $/M | Context | License |
|---|---|---|---|---|---|---|---|---|
| 1 | claude-opus-4-7-thinking | Anthropic | 1504 | 3,898 | $5/M | $25/M | NaN | Proprietary |
| 2 | claude-opus-4-6-thinking | Anthropic | 1502 | 18,888 | $5/M | $25/M | NaN | Proprietary |
| 3 | claude-opus-4-7 | Anthropic | 1497 | 4,646 | $5/M | $25/M | NaN | Proprietary |
| 4 | claude-opus-4-6 | Anthropic | 1496 | 20,158 | $5/M | $25/M | NaN | Proprietary |
| 5 | muse-spark | Meta | 1493 | 5,877 | $ | $ | - | Proprietary |
| 6 | gemini-3.1-pro-preview | 1493 | 23,766 | $2/M | $12/M | NaN | Proprietary | |
| 7 | gemini-3-pro | 1486 | 41,378 | $2/M | $12/M | NaN | Proprietary | |
| 8 | grok-4.20-beta1 | xAI | 1482 | 13,010 | $ | $ | - | Proprietary |
| 9 | gpt-5.4-high | OpenAI | 1482 | 12,322 | $2.5/M | $15/M | NaN | Proprietary |
| 10 | grok-4.20-beta-0309-reasoning | xAI | 1480 | 12,442 | $2/M | $6/M | NaN | Proprietary |
| 11 | gpt-5.2-chat-latest-20260210 | OpenAI | 1477 | 18,619 | $1.75/M | $14/M | NaN | Proprietary |
| 12 | gemini-3-flash | 1474 | 30,788 | $0.5/M | $3/M | NaN | Proprietary | |
| 13 | grok-4.20-multi-agent-beta-0309 | xAI | 1474 | 12,841 | $2/M | $6/M | NaN | Proprietary |
| 14 | claude-opus-4-5-20251101-thinking-32k | Anthropic | 1473 | 37,167 | $5/M | $25/M | NaN | Proprietary |
| 15 | grok-4.1-thinking | xAI | 1469 | 50,170 | $ | $ | - | Proprietary |
| 16 | glm-5.1 | Z.ai | 1469 | 7,927 | $1.05/M | $3.5/M | NaN | MIT |
| 17 | claude-opus-4-5-20251101 | Anthropic | 1469 | 50,046 | $5/M | $25/M | NaN | Proprietary |
| 18 | gpt-5.4 | OpenAI | 1467 | 12,870 | $2.5/M | $15/M | NaN | Proprietary |
| 19 | qwen3.5-max-preview | Alibaba | 1466 | 10,245 | $ | $ | - | Proprietary |
| 20 | claude-sonnet-4-6 | Anthropic | 1463 | 12,043 | $3/M | $15/M | NaN | Proprietary |
| 21 | gemini-3-flash (thinking-minimal) | 1462 | 36,244 | $0.5/M | $3/M | NaN | Proprietary | |
| 22 | grok-4.1 | xAI | 1461 | 54,146 | $ | $ | - | Proprietary |
| 23 | dola-seed-2.0-pro | Bytedance | 1460 | 21,521 | $ | $ | - | Proprietary |
| 24 | gpt-5.4-mini-high | OpenAI | 1458 | 9,900 | $2.5/M | $15/M | NaN | Proprietary |
| 25 | glm-5 | Z.ai | 1457 | 16,528 | $1/M | $3.2/M | NaN | MIT |
| 26 | gpt-5.1-high | OpenAI | 1455 | 40,884 | $1.25/M | $10/M | NaN | Proprietary |
| 27 | claude-sonnet-4-5-20250929-thinking-32k | Anthropic | 1452 | 62,470 | $3/M | $15/M | NaN | Proprietary |
| 28 | claude-sonnet-4-5-20250929 | Anthropic | 1452 | 60,425 | $3/M | $15/M | NaN | Proprietary |
| 29 | gemma-4-31b | 1451 | 5,822 | $0.14/M | $0.4/M | NaN | Apache 2.0 | |
| 30 | ernie-5.0-0110 | Baidu | 1451 | 24,817 | $ | $ | - | Proprietary |
| 31 | gpt-5.3-chat-latest | OpenAI | 1451 | 17,274 | $1.75/M | $14/M | NaN | Proprietary |
| 32 | kimi-k2.5-thinking | Moonshot | 1451 | 23,091 | $0.6/M | $3/M | - | Modified MIT |
| 33 | ernie-5.0-preview-1203 | Baidu | 1449 | 9,766 | $ | $ | - | Proprietary |
| 34 | claude-opus-4-1-20250805-thinking-16k | Anthropic | 1449 | 49,870 | $15/M | $75/M | NaN | Proprietary |
| 35 | gemini-2.5-pro | 1448 | 109,917 | $1.25/M | $10/M | NaN | Proprietary | |
| 36 | qwen3.6-plus | Alibaba | 1448 | 4,305 | $0.325/M | $1.95/M | NaN | Proprietary |
| 37 | qwen3.5-397b-a17b | Alibaba | 1447 | 17,989 | $0.39/M | $2.34/M | NaN | Apache 2.0 |
| 38 | claude-opus-4-1-20250805 | Anthropic | 1447 | 77,459 | $15/M | $75/M | NaN | Proprietary |
| 39 | mimo-v2-pro | Xiaomi | 1446 | 10,827 | $1/M | $3/M | NaN | Proprietary |
| 40 | gpt-4.5-preview-2025-02-27 | OpenAI | 1444 | 14,547 | $75/M | $150/M | NaN | Proprietary |
| 41 | chatgpt-4o-latest-20250326 | OpenAI | 1443 | 82,573 | $5/M | $15/M | NaN | Proprietary |
| 42 | glm-4.7 | Z.ai | 1443 | 12,138 | $0.38/M | $1.74/M | NaN | MIT |
| 43 | gpt-5.2-high | OpenAI | 1441 | 33,089 | $1.75/M | $14/M | NaN | Proprietary |
| 44 | gpt-5.2 | OpenAI | 1439 | 30,208 | $1.75/M | $14/M | NaN | Proprietary |
| 45 | gemma-4-26b-a4b | 1439 | 5,780 | $ | $ | - | Apache 2.0 | |
| 46 | gpt-5.1 | OpenAI | 1439 | 43,525 | $1.25/M | $10/M | NaN | Proprietary |
| 47 | gemini-3.1-flash-lite-preview | 1438 | 18,747 | $0.25/M | $1.5/M | NaN | Proprietary | |
| 48 | qwen3-max-preview | Alibaba | 1435 | 27,770 | $0.78/M | $3.9/M | NaN | Proprietary |
| 49 | gpt-5-high | OpenAI | 1433 | 31,997 | $1.25/M | $10/M | NaN | Proprietary |
| 50 | longcat-flash-chat-2602-exp | Meituan | 1433 | 8,528 | $ | $ | - | Proprietary |