Multi-Stream LLMs: new paper on parallelizing/separating prompts, thinking, I/O
(news.ycombinator.com)
1.
2.