BridgeBenchBridgeBench

Leaderboard Overview

See how leading AI coding models stack up across algorithms, debugging, refactoring, generation, UI, security, and speed. Each card provides a snapshot of the top performers in that category. Learn more.

Speed

View
Mar 27 · 0h ago
RankModeltok/sTTFT
1Grok 4.20 (Non-Reasoning)243.31999ms
2Grok 4.20 Reasoning237.71497ms
3Gemini 3.1 Pro122.27608ms
4Claude Sonnet 4.695.31207ms
5Claude Opus 4.692.21922ms
6GPT-5.488397ms
7GLM-573.3962ms
8MiniMax M2.768.26150ms
9Grok 461.43684ms
10MiMo-V2-Pro57.57791ms
Coming Soon

Overall

RankModelScore
1GPT-5.495.5
2GPT-5.4 Mini94.8
3GPT-5.4 Nano92.9
4GPT-4.191.8
5Qwen 3.5 35B-A3B91.7
6Claude Sonnet 4.590.7
7Qwen 3.5 122B-A10B90.0
8o3-mini89.6
9Qwen 3.5 27B89.5
10Gemini 2.5 Pro88.9
Coming Soon

Algorithms

RankModelScore
1GPT-5.4 Mini99.0
2GPT-5.498.9
3GPT-5.4 Nano97.8
4Qwen 3.5 122B-A10B94.9
5Qwen 3.5 35B-A3B94.7
6Qwen 3.5 27B94.5
7GPT-4.192.7
8o3-mini90.3
9Gemini 2.5 Pro89.8
10Claude Sonnet 4.589.6
Coming Soon

Debugging

RankModelScore
1GPT-5.496.4
2GPT-5.4 Mini96.4
3GPT-5.4 Nano96.0
4Qwen 3.5 35B-A3B96.0
5Qwen 3.5 122B-A10B94.1
6GPT-4.193.8
7Qwen 3.5 27B93.2
8Claude Sonnet 4.592.5
9o3-mini91.4
10Gemini 2.5 Pro90.6
Coming Soon

Refactoring

RankModelScore
1GPT-5.4 Nano98.3
2GPT-5.497.9
3GPT-5.4 Mini97.6
4Claude Sonnet 4.593.1
5GPT-4.191.9
6o3-mini89.8
7Gemini 2.5 Pro88.4
8Qwen 3.5 122B-A10B87.4
9Qwen 3.5 35B-A3B87.3
10Qwen 3.5 Flash (02-23)86.5
Coming Soon

Generation

RankModelScore
1GPT-5.497.0
2GPT-5.4 Mini94.4
3Qwen 3.5 35B-A3B93.5
4Qwen 3.5 122B-A10B92.5
5GPT-4.192.4
6Qwen 3.5 27B92.2
7Qwen 3.5 Flash (02-23)90.8
8Claude Sonnet 4.590.4
9GPT-5.4 Nano90.1
10Gemini 2.5 Pro89.3
Coming Soon

UI

RankModelScore
1Claude Sonnet 4.590.9
2GPT-5.489.7
3Gemini 2.5 Pro89.0
4GPT-4.188.9
5GPT-5.4 Mini88.4
6Grok 488.1
7Qwen 3.5 27B86.9
8Qwen 3.5 122B-A10B86.7
9o3-mini86.5
10Qwen 3.5 35B-A3B86.0