Skip to content
Tech News
clear
Topics: Today This Week This Month This Year
1.
Canonical's Upcoming AI Tool: Talk to Ubuntu Instead of Typing (slashdot.org)
2.
AI inference startup Baseten reportedly raising $1.5B months after its last mega-round (techcrunch.com)
3.
AI inference startup Baseten reportedly raising $1.5B months after its last mega round (techcrunch.com)
4.
The Null Is Always False (Except When It Is True) (2014) (news.ycombinator.com)
5.
Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes (venturebeat.com)
6.
Upstart chipmakers keep challenging Nvidia. This time it's Microsoft-backed D-Matrix (cnbc.com)
7.
MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 tokens per second (news.ycombinator.com)
8.
Claude AI: What's free in 2026 and what isn't? (engadget.com)
9.
Perplexity AI unveils hybrid local-cloud inference system at Computex 2026 (venturebeat.com)
10.
Perplexity Computer adding ability to split tasks between local and cloud models (9to5mac.com)
11.
Bringing Up DeepSeek-V4-Flash on AMD MI300X (news.ycombinator.com)
12.
How is Groq raising more money? (news.ycombinator.com)
13.
Intel details long-awaited Crescent Island AI GPU at Computex, boasts up to 480 GB of LPDDR5X to combat memory shortages — company shares more details of its Xe3P inference accelerator at Computex (tomshardware.com)
14.
1-Bit Bonsai Image 4B Image Generation for Local Devices (news.ycombinator.com)
15.
Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA (news.ycombinator.com)
16.
After Nvidia’s $20B not-acqui-hire, AI chip startup Groq reportedly raising $650M (techcrunch.com)
17.
After Nvidia’s $20B not-aqui-hire, AI chip startup Groq reportedly raising $650M (techcrunch.com)
18.
Real-time LLM Inference on Standard GPUs: 3k tokens/s per request (news.ycombinator.com)
19.
Has the hunt for AI compute uncovered the next Cerebras? (techcrunch.com)
20.
Stress disrupts hippocampal integration of overlapping events, memory inference (news.ycombinator.com)
21.
Use boring languages with LLMs (news.ycombinator.com)
22.
Use Boring Languages with LLMs (news.ycombinator.com)
23.
The current AI pricing was always going to go away (news.ycombinator.com)
24.
Cutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpoint (news.ycombinator.com)
25.
KV Cache Is Becoming the Memory Hierarchy of Inference (news.ycombinator.com)
26.
UK sovereign LLM inference (news.ycombinator.com)
27.
Cerebras stock nearly doubles on day one as AI chipmaker hits $100 billion — what it means for AI infrastructure (venturebeat.com)
28.
$200 'socketed' Nvidia AI GPU for servers hacked into a PCIe card with custom PCB and 3D-printed cooling — modded Tesla V100 SMX data center GPU runs AI LLMs and is more efficient than many modern midrange offerings in AI inference (tomshardware.com)
29.
Abstract Machines for Logic Programs (news.ycombinator.com)
30.
AI Computing Is a Memory Hog. An Nvidia-Backed Startup Has an Answer. (feeds.content.dowjones.io)
Today's top topics: apple steam machine valve spacex prime day amazon zdnet google anthropic elon musk
View all today's topics →