1.
2.
Real-time LLM Inference on Standard GPUs: 3k tokens/s per request
(news.ycombinator.com)
3.
4.
6.
7.
9.
10.
Why isn't AMD's MI300X competitive?
(news.ycombinator.com)
11.
12.
13.
Is a $30,000 GPU Good at Password Cracking?
(bleepingcomputer.com)
14.
15.
16.
Scaling Karpathy's Autoresearch: What Happens When the Agent Gets a GPU Cluster
(news.ycombinator.com)
17.
18.
20.
NVIDIA is (really) profiting from the AI boom
(engadget.com)
22.
NVIDIA reportedly stops production of H20 AI chips
(engadget.com)
23.
25.
China tells Alibaba, ByteDance to justify purchases of Nvidia AI chips
(arstechnica.com)
27.
28.
29.