Following the Text Gradient at Scale
(news.ycombinator.com)
1.
2.
4.
OpenAI talks about not talking about goblins
(theverge.com)
5.
How to build custom reasoning agents with a fraction of the compute
(venturebeat.com)
6.
7.
8.
9.
10.
11.
Evaluating large language models for accuracy incentivizes hallucinations
(feeds.nature.com)
12.
MiniMax M2.7 Is Now Open Source
(news.ycombinator.com)
13.
Simulating a 2D Quadcopter from Scratch
(news.ycombinator.com)
14.
Meta's Superintelligence Lab unveils its first public model, Muse Spark
(arstechnica.com)
15.
The ladder is missing rungs – Engineering Progression When AI Ate the Middle
(news.ycombinator.com)
16.
Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models
(news.ycombinator.com)
17.
Improving Composer through real-time RL
(news.ycombinator.com)
18.
Mercor competitor Deccan AI raises $25M, sources experts from India
(techcrunch.com)
19.
Training Driving AI at 50,000× Real Time
(spectrum.ieee.org)
20.
21.
An FAQ on Reinforcement Learning Environments
(news.ycombinator.com)
22.
23.
Ndea (YC W26) is hiring a symbolic RL search guidance lead
(news.ycombinator.com)
24.
25.
26.
CoreWeave acquires agent-training startup OpenPipe
(techcrunch.com)
27.
Reinforcement learning, explained with a minimum of math and jargon
(news.ycombinator.com)