MiniMax M2.7
minimax/MiniMax-M2.7
68.2
median tok/s
Throughput Runs
9
TTFT Runs
6
Avg TTFT
5870ms
Avg Throughput
84.7 tok/s
Total Cost
$0.0692
Commentary
by openai/gpt-5.4-miniMiniMax M2.7 is reliable and cost-efficient on BridgeBench, with a 100.0% success rate and low total cost of $0.069162. Sustained decode speed is solid overall at 68.2 tok/s median throughput, but startup latency is the main weakness: median TTFT is 6150 ms and average TTFT is 5870 ms, which is slow for interactive use. Throughput is highly prompt-dependent, ranging from 83.2 tok/s on API Design down to 40.2 tok/s on Technical Essay.
This is the strongest sustained decode case, with 83.2 tok/s median throughput over 4096 average output tokens. The long output did not materially degrade throughput, indicating good steady-state generation performance.
Performance here is mid-pack at 68.2 tok/s median throughput with similarly long outputs around 3953 tokens. The result is stable, but notably slower than the API Design workload, suggesting sensitivity to prompt/content structure.
This is the weakest throughput case at 40.2 tok/s median, despite a shorter average output of 2907 tokens. The drop implies the model slows materially on more open-ended, prose-heavy generation.
TTFT is slow at 6325 ms median, and the low average output token count of 277 means startup latency is not being amortized by long generations. This is a poor interactive latency profile for short-answer tasks.
TTFT improves somewhat to 5384 ms median, but remains high for a factual prompt with only 163 average output tokens. Startup latency is still the dominant bottleneck rather than decode speed.
Notable Prompts
Highest sustained throughput at 83.2 tok/s, with no sign of degradation on very long outputs.
Lowest throughput at 40.2 tok/s, indicating the model slows sharply on essay-style generation.
Worst startup latency at 6325 ms median, making short responses feel especially slow.
Best TTFT in the set at 5384 ms median, though still high in absolute terms.
All Runs
| Prompt | Type | Tok/s | TTFT | Tokens | Cost | |
|---|---|---|---|---|---|---|
1. Api Design throughput-api-design | throughput | 78.7 | 32871ms | 4096 | $0.0083 | |
2. Api Design throughput-api-design | throughput | 84.6 | 36595ms | 4096 | $0.0083 | |
3. Api Design throughput-api-design | throughput | 83.2 | 32686ms | 4096 | $0.0083 | |
1. Data Structures throughput-data-structures | throughput | 264.4 | 74832ms | 4096 | $0.0083 | |
2. Data Structures throughput-data-structures | throughput | 59.4 | 18050ms | 3666 | $0.0074 | |
3. Data Structures throughput-data-structures | throughput | 68.2 | 31400ms | 4096 | $0.0083 | |
1. Essay throughput-essay | throughput | 45.0 | 27998ms | 4096 | $0.0083 | |
2. Essay throughput-essay | throughput | 39.0 | 11358ms | 2284 | $0.0046 | |
3. Essay throughput-essay | throughput | 40.2 | 11648ms | 2342 | $0.0047 | |
1. Definition ttft-definition | ttft | n/a | 7297ms | 280 | $0.0006 | |
2. Definition ttft-definition | ttft | n/a | 6325ms | 250 | $0.0005 | |
3. Definition ttft-definition | ttft | n/a | 5974ms | 302 | $0.0006 | |
1. Factual ttft-factual | ttft | n/a | 5384ms | 181 | $0.0004 | |
2. Factual ttft-factual | ttft | n/a | 6597ms | 167 | $0.0004 | |
3. Factual ttft-factual | ttft | n/a | 3641ms | 142 | $0.0003 |
15 runs · Throughput rows require valid long-output runs · TTFT shown for all successful runs