Skip to content
Tech News
clear
Topics: Today This Week This Month This Year
91.
Two different tricks for fast LLM inference (news.ycombinator.com)
92.
AI inference costs dropped up to 10x on Nvidia's Blackwell — but hardware is only half the equation (venturebeat.com)
93.
AI inference startup Modal Labs in talks to raise at $2.5B valuation, sources say (techcrunch.com)
94.
OpenAI executives were on a tear this week trying to quell critics (cnbc.com)
95.
As Rocks May Think (news.ycombinator.com)
96.
Intel's roadmap adds mysterious 'hybrid' AI processor featuring x86 CPUs, dedicated AI accelerator, and programmable IP — chip may capitalize on a market forgotten by Nvidia and AMD (tomshardware.com)
97.
Waypoint-1: Real-Time Interactive Video Diffusion from Overworld (news.ycombinator.com)
98.
Inference startup Inferact lands $150M to commercialize vLLM (techcrunch.com)
99.
Sources: Project SGLang spins out as RadixArk with $400M valuation as inference market explodes (techcrunch.com)
100.
Three types of LLM workloads and how to serve them (news.ycombinator.com)
101.
Weight Transfer for RL Post-Training in under 2 seconds (news.ycombinator.com)
102.
Jensen Huang discusses the economics of inference, power delivery, and more at CES 2026 press Q&A session — 'You sell a chip one time, but when you build software, you maintain it forever' (tomshardware.com)
103.
Launch HN: Tamarind Bio (YC W24) – AI Inference Provider for Drug Discovery (news.ycombinator.com)
104.
Nvidia just admitted the general-purpose GPU era is ending (venturebeat.com)
105.
Five Things to Know About Nvidia’s $20 Billion Licensing Deal (feeds.content.dowjones.io)
106.
Nvidia's $20 billion Groq IP deal bolsters AI market domination — hardware stack and key engineer behind Google TPUs included in bombshell agreement (tomshardware.com)
107.
Nvidia buys AI chip startup Groq's assets for $20 billion in the company's biggest deal ever — Transaction includes acquihires of key Groq employees, including CEO (tomshardware.com)
108.
Post-transformer inference: 224× compression of Llama-70B with improved accuracy (news.ycombinator.com)
109.
Vsora Jotunn-8 5nm European inference chip (news.ycombinator.com)
110.
The Easiest Way to Build a Type Checker (news.ycombinator.com)
111.
Principles of Vasocomputation (news.ycombinator.com)
112.
Cloud-Native Computing Is Poised To Explode (slashdot.org)
113.
Realizing value with AI inference at scale and in production (technologyreview.com)
114.
Cloud-native computing is poised to explode, thanks to AI inference work (zdnet.com)
115.
Baseten takes on hyperscalers with new AI training platform that lets you own your model weights (venturebeat.com)
116.
Ovi: Twin backbone cross-modal fusion for audio-video generation (news.ycombinator.com)
117.
Ovi (news.ycombinator.com)
118.
Elixir 1.19 (news.ycombinator.com)
119.
Cerebras systems raises $1.1B Series G (news.ycombinator.com)
120.
Cerebras Systems Raises $1.1B Series G at $8.1B Valuation (news.ycombinator.com)
Today's top topics: anthropic mythos 5 claude cybersecurity fable 5 amazon prime day apple kindle prime day macbook
View all today's topics →