Sneak peek into batched speculative decoding in LLMs
13 min read · 2026
13 min read · January 22, 2026
2026 · AI LLMs inference · projects