GPT-5.4
openai/gpt-5.4
88.0
median tok/s
Throughput Runs
9
TTFT Runs
6
Avg TTFT
743ms
Avg Throughput
77.1 tok/s
Total Cost
$0.3004
Commentary
by openai/gpt-5.4-miniGPT-5.4 is reliable on BridgeBench speed, with a 100.0% success rate and no prompt-specific failures. Startup latency is solid at 397 ms median TTFT, but average TTFT rises to 743 ms, suggesting some long-tail cold-start or queueing variance. Sustained decode is strong overall at 88.0 tok/s median throughput, though the average drops to 77.1 tok/s and the technical essay workload pulls performance down materially; total cost is moderate at $0.300380.
This is the strongest throughput case at 92.7 tok/s median with no issues, indicating good sustained generation on structured, mid-length outputs. The 3,410-token average output did not materially degrade speed.
Throughput is stable at 88.0 tok/s median, essentially matching the overall median and indicating consistent decode behavior on technical content. Output length is slightly shorter than API Design, but there is no sign of instability.
This is the main throughput weakness at 57.0 tok/s median, a large drop versus the other throughput prompts. The 3,571-token average output suggests longer-form prose is more expensive for this model and likely drives the lower overall average throughput.
TTFT is slightly slower here at 406 ms median, but still in a good range for interactive use. The very small 58-token outputs keep the latency profile focused on startup rather than decode.
This is the fastest startup case at 387 ms median TTFT, indicating low first-token latency on short factual responses. The 12-token average output is tiny, so this prompt is a clean read on initiation speed.
Notable Prompts
It is the clear outlier, with median throughput far below the other throughput prompts and dragging down the average.
It has the best sustained decode rate and remains stable on long structured outputs.
It has the lowest median TTFT, indicating strong startup responsiveness on short requests.
It is the slowest TTFT prompt, though only marginally, which suggests modest variance rather than a major latency issue.
All Runs
| Prompt | Type | Tok/s | TTFT | Tokens | Cost | |
|---|---|---|---|---|---|---|
1. Api Design throughput-api-design | throughput | 92.7 | 392ms | 3453 | $0.0348 | |
2. Api Design throughput-api-design | throughput | 89.3 | 395ms | 3381 | $0.0340 | |
3. Api Design throughput-api-design | throughput | 93.2 | 379ms | 3397 | $0.0342 | |
1. Data Structures throughput-data-structures | throughput | 68.4 | 407ms | 3072 | $0.0310 | |
2. Data Structures throughput-data-structures | throughput | 90.5 | 377ms | 2842 | $0.0287 | |
3. Data Structures throughput-data-structures | throughput | 88.0 | 380ms | 2710 | $0.0274 | |
1. Essay throughput-essay | throughput | 54.7 | 479ms | 4096 | $0.0412 | |
2. Essay throughput-essay | throughput | 57.0 | 345ms | 3357 | $0.0338 | |
3. Essay throughput-essay | throughput | 59.9 | 451ms | 3260 | $0.0328 | |
1. Definition ttft-definition | ttft | n/a | 358ms | 56 | $0.0006 | |
2. Definition ttft-definition | ttft | n/a | 406ms | 60 | $0.0006 | |
3. Definition ttft-definition | ttft | n/a | 507ms | 57 | $0.0006 | |
1. Factual ttft-factual | ttft | n/a | 2421ms | 12 | $0.0002 | |
2. Factual ttft-factual | ttft | n/a | 387ms | 12 | $0.0002 | |
3. Factual ttft-factual | ttft | n/a | 380ms | 12 | $0.0002 |
15 runs · Throughput rows require valid long-output runs · TTFT shown for all successful runs