Eagle 3.1: Collaboration Between the EAGLE Team, vLLM Team, and TorchSpec Team
(news.ycombinator.com)
1.
2.
Accelerating Gemma 4: faster inference with multi-token prediction drafters
(news.ycombinator.com)