LLM WebDevArena Leaderboard
Last updated: August 12, 2025 (data verified by human)
Models
42 total
1

OpenAIGPT-5 (high)

OpenAI

1481.98
Score
Input Cost
$1.25
Output Cost
$10.00
95% CI
+11.92 / -10.06
Votes
3,651
License
Proprietary
Ties: Models may share the same rank; higher Arena Score appears first (1, 2, 2, 3…).
2

AnthropicClaude Opus 4.1 (20250805)

Anthropic

1426.27
Score
Input Cost
$15.00
Output Cost
$75.00
95% CI
+18.88 / -18.07
Votes
1,402
License
Proprietary
Ties: Models may share the same rank; higher Arena Score appears first (1, 2, 2, 3…).
2

GoogleGemini-2.5-Pro

Google

1404.65
Score
Input Cost
$1.25
Output Cost
$10.00
95% CI
+6.99 / -7.02
Votes
7,085
License
Proprietary
Ties: Models may share the same rank; higher Arena Score appears first (1, 2, 2, 3…).
3

DeepSeekDeepSeek-R1-0528

DeepSeek

1391.46
Score
4

AnthropicClaude Opus 4 (20250514)

Anthropic

1381.55
Score
5

AlibabaQwen3-Coder

Alibaba

1363.47
Score
6

ZAIGLM-4.5

ZAI

1363.3
Score
6

AnthropicClaude Sonnet 4 (20250514)

Anthropic

1359.24
Score
6

AnthropicClaude 3.7 Sonnet (20250219)

Anthropic

1358.4
Score
6

ZAIGLM-4.5-Air

ZAI

1353.76
Score
11

MoonshotKimi-K2-Instruct

Moonshot

1314.79
Score
12

GoogleGemini-2.5-Flash

Google

1290.18
Score
13

OpenAIGPT-4.1-2025-04-14

OpenAI

1253.32
Score
14

AnthropicClaude 3.5 Sonnet (20241022)

Anthropic

1238.13
Score
15

DeepSeekDeepSeek-V3-0324

DeepSeek

1207.92
Score
15

DeepSeekDeepSeek-R1

DeepSeek

1199.4
Score
15

OpenAIGPT-4.1-mini-2025-04-14

OpenAI

1192.72
Score
15

AlibabaQwen3-235B-A22B

Alibaba

1189.46
Score
15

OpenAIo3-2025-04-16

OpenAI

1186.21
Score
17

MistralMistral Medium 3

Mistral

1180.03
Score
18

xAIGrok-4-0709

xAI

1176.46
Score
22

xAIGrok-3-preview-02-24

xAI

1143.3
Score
22

OpenAIo3-mini-high (20250131)

OpenAI

1136.73
Score
22

AnthropicClaude 3.5 Haiku (20241022)

Anthropic

1133.39
Score
22

MiniMaxMiniMax-M1

MiniMax

1129.68
Score
24

OpenAIo4-mini-2025-04-16

OpenAI

1117.87
Score
27

OpenAIo3-mini (20250131)

OpenAI

1092.16
Score
27

GoogleGemini-2.0-Pro-Exp-02-05

Google

1089.73
Score
27

OpenAIgpt-oss-120b

OpenAI

1081.54
Score
30

OpenAIo1 (20241217)

OpenAI

1045.15
Score
30

OpenAIo1-mini (20240912)

OpenAI

1042.57
Score
30

GoogleGemini-2.0-Flash-001

Google

1040.25
Score
30

GoogleGemini-2.0-Flash-Thinking-01-21

Google

1029.78
Score
32

MetaLlama-4-Maverick-17B-128E-Instruct

Meta

1026.98
Score
35

GoogleGemini-2.0-Flash-Exp

Google

980.05
Score
35

AlibabaQwen2.5-Max

Alibaba

975.52
Score
37

OpenAIGPT-4o-2024-11-20

OpenAI

964
Score
37

DeepSeekDeepSeek-V3

DeepSeek

959.77
Score
39

AlibabaQwen2.5-Coder-32B-Instruct

Alibaba

901.96
Score
39

MetaLlama-4-Scout-17B-16E-Instruct

Meta

901.13
Score
39

GoogleGemini-1.5-Pro-002

Google

892.55
Score
42

MetaLlama-3.1-405B-Instruct

Meta

809.61
Score
Showing 42 of 42 models
Data sourced from: web.lmarena.ai/leaderboard