1.
2.
Real-time LLM Inference on Standard GPUs: 3k tokens/s per request
(news.ycombinator.com)
3.
4.
Why isn't AMD's MI300X competitive?
(news.ycombinator.com)
5.
6.
Is a $30,000 GPU Good at Password Cracking?
(bleepingcomputer.com)
7.