AdapTive-LeArning Speculator System (ATLAS): Faster LLM inference
(news.ycombinator.com)
1.
2.
Faster LLM inference
(news.ycombinator.com)