BridgeBenchBridgeBench
Speed
Model Analysis

MiniMax M2.7

minimax/MiniMax-M2.7

68.2

median tok/s

6150ms TTFT
100.0% success

Throughput Runs

9

TTFT Runs

6

Avg TTFT

5870ms

Avg Throughput

84.7 tok/s

Total Cost

$0.0692

Commentary

by openai/gpt-5.4-mini

MiniMax M2.7 is reliable and cost-efficient on BridgeBench, with a 100.0% success rate and low total cost of $0.069162. Sustained decode speed is solid overall at 68.2 tok/s median throughput, but startup latency is the main weakness: median TTFT is 6150 ms and average TTFT is 5870 ms, which is slow for interactive use. Throughput is highly prompt-dependent, ranging from 83.2 tok/s on API Design down to 40.2 tok/s on Technical Essay.

Api Designthroughput

This is the strongest sustained decode case, with 83.2 tok/s median throughput over 4096 average output tokens. The long output did not materially degrade throughput, indicating good steady-state generation performance.

Data Structuresthroughput

Performance here is mid-pack at 68.2 tok/s median throughput with similarly long outputs around 3953 tokens. The result is stable, but notably slower than the API Design workload, suggesting sensitivity to prompt/content structure.

Essaythroughput

This is the weakest throughput case at 40.2 tok/s median, despite a shorter average output of 2907 tokens. The drop implies the model slows materially on more open-ended, prose-heavy generation.

Definitionttft

TTFT is slow at 6325 ms median, and the low average output token count of 277 means startup latency is not being amortized by long generations. This is a poor interactive latency profile for short-answer tasks.

Factualttft

TTFT improves somewhat to 5384 ms median, but remains high for a factual prompt with only 163 average output tokens. Startup latency is still the dominant bottleneck rather than decode speed.

Notable Prompts

Api Designthroughput

Highest sustained throughput at 83.2 tok/s, with no sign of degradation on very long outputs.

Essaythroughput

Lowest throughput at 40.2 tok/s, indicating the model slows sharply on essay-style generation.

Definitionttft

Worst startup latency at 6325 ms median, making short responses feel especially slow.

Factualttft

Best TTFT in the set at 5384 ms median, though still high in absolute terms.

All Runs

PromptTypeTok/sTTFT
1. Api Design
throughput-api-design
throughput78.732871ms
2. Api Design
throughput-api-design
throughput84.636595ms
3. Api Design
throughput-api-design
throughput83.232686ms
1. Data Structures
throughput-data-structures
throughput264.474832ms
2. Data Structures
throughput-data-structures
throughput59.418050ms
3. Data Structures
throughput-data-structures
throughput68.231400ms
1. Essay
throughput-essay
throughput45.027998ms
2. Essay
throughput-essay
throughput39.011358ms
3. Essay
throughput-essay
throughput40.211648ms
1. Definition
ttft-definition
ttftn/a7297ms
2. Definition
ttft-definition
ttftn/a6325ms
3. Definition
ttft-definition
ttftn/a5974ms
1. Factual
ttft-factual
ttftn/a5384ms
2. Factual
ttft-factual
ttftn/a6597ms
3. Factual
ttft-factual
ttftn/a3641ms

15 runs · Throughput rows require valid long-output runs · TTFT shown for all successful runs