Model Analysis

MiniMax M2.7

minimax/MiniMax-M2.7

68.2

median tok/s

6150ms TTFT

100.0% success

Throughput Runs

TTFT Runs

Avg TTFT

5870ms

Avg Throughput

84.7 tok/s

Total Cost

$0.0692

Commentary

by openai/gpt-5.4-mini

MiniMax M2.7 is reliable and cost-efficient on BridgeBench, with a 100.0% success rate and low total cost of $0.069162. Sustained decode speed is solid overall at 68.2 tok/s median throughput, but startup latency is the main weakness: median TTFT is 6150 ms and average TTFT is 5870 ms, which is slow for interactive use. Throughput is highly prompt-dependent, ranging from 83.2 tok/s on API Design down to 40.2 tok/s on Technical Essay.

Api Designthroughput

This is the strongest sustained decode case, with 83.2 tok/s median throughput over 4096 average output tokens. The long output did not materially degrade throughput, indicating good steady-state generation performance.

Data Structuresthroughput

Performance here is mid-pack at 68.2 tok/s median throughput with similarly long outputs around 3953 tokens. The result is stable, but notably slower than the API Design workload, suggesting sensitivity to prompt/content structure.

Essaythroughput

This is the weakest throughput case at 40.2 tok/s median, despite a shorter average output of 2907 tokens. The drop implies the model slows materially on more open-ended, prose-heavy generation.

Definitionttft

TTFT is slow at 6325 ms median, and the low average output token count of 277 means startup latency is not being amortized by long generations. This is a poor interactive latency profile for short-answer tasks.

Factualttft

TTFT improves somewhat to 5384 ms median, but remains high for a factual prompt with only 163 average output tokens. Startup latency is still the dominant bottleneck rather than decode speed.

Notable Prompts

Api Designthroughput

Highest sustained throughput at 83.2 tok/s, with no sign of degradation on very long outputs.

Essaythroughput

Lowest throughput at 40.2 tok/s, indicating the model slows sharply on essay-style generation.

Definitionttft

Worst startup latency at 6325 ms median, making short responses feel especially slow.

Factualttft

Best TTFT in the set at 5384 ms median, though still high in absolute terms.

All Runs

Prompt	Type	Tok/s	TTFT	Tokens	Cost
1. Api Design throughput-api-design	throughput	78.7	32871ms	4096	$0.0083
2. Api Design throughput-api-design	throughput	84.6	36595ms	4096	$0.0083
3. Api Design throughput-api-design	throughput	83.2	32686ms	4096	$0.0083
1. Data Structures throughput-data-structures	throughput	264.4	74832ms	4096	$0.0083
2. Data Structures throughput-data-structures	throughput	59.4	18050ms	3666	$0.0074
3. Data Structures throughput-data-structures	throughput	68.2	31400ms	4096	$0.0083
1. Essay throughput-essay	throughput	45.0	27998ms	4096	$0.0083
2. Essay throughput-essay	throughput	39.0	11358ms	2284	$0.0046
3. Essay throughput-essay	throughput	40.2	11648ms	2342	$0.0047
1. Definition ttft-definition	ttft	n/a	7297ms	280	$0.0006
2. Definition ttft-definition	ttft	n/a	6325ms	250	$0.0005
3. Definition ttft-definition	ttft	n/a	5974ms	302	$0.0006
1. Factual ttft-factual	ttft	n/a	5384ms	181	$0.0004
2. Factual ttft-factual	ttft	n/a	6597ms	167	$0.0004
3. Factual ttft-factual	ttft	n/a	3641ms	142	$0.0003

15 runs · Throughput rows require valid long-output runs · TTFT shown for all successful runs