Blog

Insights on AI Coding

Thoughts on vibe coding, agentic coding, AI benchmarks, and building the future of AI-assisted development.

March 26, 20268 min read

BridgeBench v2: A New Standard for Vibe Coding and Agentic AI Benchmarks

BridgeBench v2 is here — a ground-up redesign of how we evaluate AI coding models. With expanded categories, real-world vibe coding tasks, and agentic coding workflows, v2 sets the new bar for AI coding benchmarks.

announcementvibe codingagentic codingAI codingBridgeBench v2

March 25, 20266 min readComing soon

Vibe Coding Needs Better Benchmarks

Traditional coding benchmarks test toy problems. Vibe coding and agentic coding demand evaluation on production-grade tasks — here's what that looks like.

opinionvibe codingagentic codingAI coding

March 22, 20267 min readComing soon

SpeedBench Methodology: Measuring AI Code Generation Speed

How we measure throughput and TTFT separately for accurate, reproducible speed benchmarks that reflect real agentic coding performance.

methodologyspeedbenchagentic coding

March 20, 20265 min readComing soon

Introducing BridgeBench: The Vibe Coding Benchmark

Why we built BridgeBench and how it measures what actually matters for AI coding — from vibe coding workflows to agentic pipelines.

announcementvibe codingAI coding

More posts on AI coding, vibe coding workflows, and agentic development coming soon.

Join the Discord for early access to new content.