Speculative KV coding: losslessly compressing KV cache by up to ~4×
(news.ycombinator.com)
1.
2.
Bypassing the Branch Predictor
(news.ycombinator.com)
3.
The ITTAGE indirect branch predictor
(news.ycombinator.com)