Model Analysis

GPT-5.4

openai/gpt-5.4

88.0

median tok/s

397ms TTFT

100.0% success

Throughput Runs

TTFT Runs

Avg TTFT

743ms

Avg Throughput

77.1 tok/s

Total Cost

$0.3004

Commentary

by openai/gpt-5.4-mini

GPT-5.4 is reliable on BridgeBench speed, with a 100.0% success rate and no prompt-specific failures. Startup latency is solid at 397 ms median TTFT, but average TTFT rises to 743 ms, suggesting some long-tail cold-start or queueing variance. Sustained decode is strong overall at 88.0 tok/s median throughput, though the average drops to 77.1 tok/s and the technical essay workload pulls performance down materially; total cost is moderate at $0.300380.

Api Designthroughput

This is the strongest throughput case at 92.7 tok/s median with no issues, indicating good sustained generation on structured, mid-length outputs. The 3,410-token average output did not materially degrade speed.

Data Structuresthroughput

Throughput is stable at 88.0 tok/s median, essentially matching the overall median and indicating consistent decode behavior on technical content. Output length is slightly shorter than API Design, but there is no sign of instability.

Essaythroughput

This is the main throughput weakness at 57.0 tok/s median, a large drop versus the other throughput prompts. The 3,571-token average output suggests longer-form prose is more expensive for this model and likely drives the lower overall average throughput.

Definitionttft

TTFT is slightly slower here at 406 ms median, but still in a good range for interactive use. The very small 58-token outputs keep the latency profile focused on startup rather than decode.

Factualttft

This is the fastest startup case at 387 ms median TTFT, indicating low first-token latency on short factual responses. The 12-token average output is tiny, so this prompt is a clean read on initiation speed.

Notable Prompts

Essaythroughput

It is the clear outlier, with median throughput far below the other throughput prompts and dragging down the average.

Api Designthroughput

It has the best sustained decode rate and remains stable on long structured outputs.

Factualttft

It has the lowest median TTFT, indicating strong startup responsiveness on short requests.

Definitionttft

It is the slowest TTFT prompt, though only marginally, which suggests modest variance rather than a major latency issue.

All Runs

Prompt	Type	Tok/s	TTFT	Tokens	Cost
1. Api Design throughput-api-design	throughput	92.7	392ms	3453	$0.0348
2. Api Design throughput-api-design	throughput	89.3	395ms	3381	$0.0340
3. Api Design throughput-api-design	throughput	93.2	379ms	3397	$0.0342
1. Data Structures throughput-data-structures	throughput	68.4	407ms	3072	$0.0310
2. Data Structures throughput-data-structures	throughput	90.5	377ms	2842	$0.0287
3. Data Structures throughput-data-structures	throughput	88.0	380ms	2710	$0.0274
1. Essay throughput-essay	throughput	54.7	479ms	4096	$0.0412
2. Essay throughput-essay	throughput	57.0	345ms	3357	$0.0338
3. Essay throughput-essay	throughput	59.9	451ms	3260	$0.0328
1. Definition ttft-definition	ttft	n/a	358ms	56	$0.0006
2. Definition ttft-definition	ttft	n/a	406ms	60	$0.0006
3. Definition ttft-definition	ttft	n/a	507ms	57	$0.0006
1. Factual ttft-factual	ttft	n/a	2421ms	12	$0.0002
2. Factual ttft-factual	ttft	n/a	387ms	12	$0.0002
3. Factual ttft-factual	ttft	n/a	380ms	12	$0.0002

15 runs · Throughput rows require valid long-output runs · TTFT shown for all successful runs