Last updated: March 26, 2025 (data verified by human)
Models
22 total
1
Claude 3.7 Sonnet (20250219)
Anthropic
1354.01
Score
2
Gemini-2.5-Pro-Exp-03-25
1267.7
Score
3
Claude 3.5 Sonnet (20241022)
Anthropic
1245.4
Score
4
DeepSeek-R1
DeepSeek
1203.8
Score
5
early-grok-3
xAI
1144.94
Score
6
o3-mini-high (20250131)
OpenAI
1144.14
Score
7
Claude 3.5 Haiku (20241022)
Anthropic
1136.02
Score
8
Gemini-2.0-Pro-Exp-02-05
1099.19
Score
9
o3-mini (20250131)
OpenAI
1097.7
Score
10
o1 (20241217)
OpenAI
1049.23
Score
11
o1-mini (20240912)
OpenAI
1046.16
Score
12
Gemini-2.0-Flash-Thinking-01-21
1033.91
Score
13
Gemini-2.0-Flash-001
1030.44
Score
14
Gemini-2.0-Flash-Thinking-1219
1023.15
Score
15
Gemini-Exp-1206
1022.31
Score
16
Gemini-2.0-Flash-Exp
983.52
Score
17
Qwen2.5-Max
Alibaba
977.59
Score
18
GPT-4o-2024-11-20
OpenAI
964
Score
19
DeepSeek-V3
DeepSeek
963.43
Score
20
Qwen2.5-Coder-32B-Instruct
Alibaba
903.53
Score
21
Gemini-1.5-Pro-002
894.57
Score
22
Llama-3.1-405B-Instruct
Meta
811.92
Score
1 | Anthropic | 1354.01 | $3.00 | $15.00 | +12.49 / -10.75 | 4,825 | Proprietary | |
2 | Google | 1267.7 | N/A | N/A | +15.58 / -15.39 | 1,654 | Proprietary | |
3 | Anthropic | 1245.4 | $3.00 | $15.00 | +4.51 / -4.90 | 21,059 | Proprietary | |
4 | DeepSeek | 1203.8 | $0.55 | $2.19 | +8.60 / -10.35 | 3,760 | MIT | |
5 | xAI | 1144.94 | N/A | N/A | +9.38 / -8.34 | 6,051 | Proprietary | |
6 | OpenAI | 1144.14 | $1.10 | $4.40 | +12.82 / -8.24 | 2,874 | Proprietary | |
7 | Anthropic | 1136.02 | $0.80 | $4.00 | +4.57 / -4.99 | 15,226 | Proprietary | |
8 | Google | 1099.19 | N/A | N/A | +5.30 / -5.97 | 9,407 | Proprietary | |
9 | OpenAI | 1097.7 | $1.10 | $4.40 | +8.25 / -7.01 | 6,294 | Proprietary | |
10 | OpenAI | 1049.23 | $15.00 | $60.00 | +7.45 / -6.34 | 9,198 | Proprietary | |
11 | OpenAI | 1046.16 | $1.10 | $4.40 | +5.17 / -5.14 | 13,745 | Proprietary | |
12 | Google | 1033.91 | N/A | N/A | +15.72 / -19.53 | 1,064 | Proprietary | |
13 | Google | 1030.44 | $0.10 | $0.40 | +8.58 / -9.64 | 4,431 | Proprietary | |
14 | Google | 1023.15 | N/A | N/A | +6.87 / -7.24 | 8,010 | Proprietary | |
15 | Google | 1022.31 | N/A | N/A | +7.30 / -5.28 | 12,099 | Proprietary | |
16 | Google | 983.52 | N/A | N/A | +6.16 / -4.80 | 14,485 | Proprietary | |
17 | Alibaba | 977.59 | $1.60 | $6.40 | +7.63 / -5.87 | 7,914 | Proprietary | |
18 | OpenAI | 964 | $2.50 | $10.00 | +4.21 / -5.17 | 16,567 | Proprietary | |
19 | DeepSeek | 963.43 | $0.27 | $1.10 | +6.29 / -7.28 | 7,717 | DeepSeek | |
20 | Alibaba | 903.53 | $0.80 | $0.80 | +4.05 / -5.24 | 15,151 | Apache 2.0 | |
21 | Google | 894.57 | $1.25 | $5.00 | +3.86 / -5.56 | 14,507 | Proprietary | |
22 | Meta | 811.92 | $3.50 | $3.50 | +18.01 / -22.20 | 1,117 | Llama 3.1 |
Showing 22 of 22 models
Data sourced from: web.lmarena.ai/leaderboard