Morph
Docs
Blog
Pricing
Contact Us
MCP
Book a Call
Sign Up / Log In
Back to Blog
We Hit 10,500 Tokens/Sec on B200
Technical deep-dive: custom CUDA kernels + speculative execution for 2.3x speedup
Tejas Bhakta
September 15, 2025
4 min read
Table of Contents
Why This Speed Matters (It's Not Just Marketing)
The Technical Problem
How We Hit 10,500 Tok/Sec
Real-World Performance Data
Limitations & Future Work
Try It Yourself