31.
32.
Waypoint-1: Real-Time Interactive Video Diffusion from Overworld
(news.ycombinator.com)
33.
Inference startup Inferact lands $150M to commercialize vLLM
(techcrunch.com)
34.
35.
Three types of LLM workloads and how to serve them
(news.ycombinator.com)
36.
Weight Transfer for RL Post-Training in under 2 seconds
(news.ycombinator.com)
37.
38.
Launch HN: Tamarind Bio (YC W24) – AI Inference Provider for Drug Discovery
(news.ycombinator.com)
39.
Nvidia just admitted the general-purpose GPU era is ending
(venturebeat.com)
40.
Five Things to Know About Nvidia’s $20 Billion Licensing Deal
(feeds.content.dowjones.io)
41.
42.
43.
Post-transformer inference: 224× compression of Llama-70B with improved accuracy
(news.ycombinator.com)
44.
Vsora Jotunn-8 5nm European inference chip
(news.ycombinator.com)
45.
The Easiest Way to Build a Type Checker
(news.ycombinator.com)
46.
Principles of Vasocomputation
(news.ycombinator.com)
47.
Cloud-Native Computing Is Poised To Explode
(slashdot.org)
48.
Realizing value with AI inference at scale and in production
(technologyreview.com)
50.
51.
Ovi: Twin backbone cross-modal fusion for audio-video generation
(news.ycombinator.com)
52.
Ovi
(news.ycombinator.com)
53.
Elixir 1.19
(news.ycombinator.com)
54.
Cerebras systems raises $1.1B Series G
(news.ycombinator.com)
55.
Cerebras Systems Raises $1.1B Series G at $8.1B Valuation
(news.ycombinator.com)
56.
GPT-OSS Reinforcement Learning
(news.ycombinator.com)
57.
Show HN: Run Qwen3-Next-80B on 8GB GPU at 1tok/2s throughput
(news.ycombinator.com)
58.
Defeating Nondeterminism in LLM Inference
(news.ycombinator.com)
59.
Some users report their Firefox browser is scoffing CPU power
(news.ycombinator.com)
60.
Token growth indicates future AI spend per dev
(news.ycombinator.com)