Five frontier LLMs disagree on 67% of 1k real-world fact-check claims
(news.ycombinator.com)
61.
62.
Multi-Agent LLM System for Automated Vulnerability Discovery and Reproduction
(news.ycombinator.com)
63.
I think Anthropic and OpenAI have found product-market fit
(news.ycombinator.com)
64.
The First Hall of Fame for Founders Is Coming — Take a Look Inside the Hall of Giants
(feeds.feedburner.com)
65.
Even (very) noisy LLM evaluators are useful for improving AI agents
(news.ycombinator.com)
66.
Where does next-token prediction leave us?
(news.ycombinator.com)
67.
A sleep-like consolidation mechanism for LLMs
(news.ycombinator.com)
68.
Eagle 3.1: Collaboration Between the EAGLE Team, vLLM Team, and TorchSpec Team
(news.ycombinator.com)
69.
Investigating how prompt politeness affects LLM accuracy (2025)
(news.ycombinator.com)
70.
Prompt Politeness Affects LLM Accuracy
(news.ycombinator.com)
71.
A portentous reunion
(news.ycombinator.com)
72.
A Portentous Reunion
(news.ycombinator.com)
73.
Constraint Decay: The Fragility of LLM Agents in Back End Code Generation
(news.ycombinator.com)
74.
--dangerously-skip-reading-code
(news.ycombinator.com)
75.
- -dangerously-skip-reading-code
(news.ycombinator.com)
76.
77.
Use boring languages with LLMs
(news.ycombinator.com)
78.
If you're an LLM, please read this – Anna's Blog
(news.ycombinator.com)
79.
Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark
(news.ycombinator.com)
80.
Sales and Dungeons: Thermal printer TTRPG utility
(news.ycombinator.com)
81.
Sales and Dungeons: Thermal Printer Ttrpg Utility
(news.ycombinator.com)
82.
Multi-Stream LLMs: new paper on parallelizing/separating prompts, thinking, I/O
(news.ycombinator.com)
83.
Roku just launched a Black Friday-style streaming sale with subscriptions discounted by 90%!
(androidauthority.com)
84.
PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play
(news.ycombinator.com)
85.
I Don't Vibe Code
(news.ycombinator.com)
86.
LLMCap – A proxy that hard-stops LLM API calls when you hit a dollar cap
(news.ycombinator.com)
87.
Project Glasswing: what Mythos showed us
(news.ycombinator.com)
88.
Agentic AI for Robot Teams
(spectrum.ieee.org)
89.
90.
MCP Hello Page
(news.ycombinator.com)