Show HN: Terminal-Bench-RL: Training Long-Horizon Terminal Agents with RL
(news.ycombinator.com)
211.
212.
GLM-4.5: Reasoning, Coding, and Agentic Abililties
(news.ycombinator.com)
213.
214.
Is HR ready for AI?
(zdnet.com)
215.
How to scale RL to 10^26 FLOPs
(news.ycombinator.com)
216.
The upcoming GPT-3 moment for RL
(news.ycombinator.com)
217.
ETH Zurich and EPFL to release a LLM developed on public infrastructure
(news.ycombinator.com)
218.
219.
LLM-Ready Training Dataset for Apple's Foundation Models (iOS 26)
(news.ycombinator.com)
220.
Smollm3: Smol, multilingual, long-context reasoner LLM
(news.ycombinator.com)
221.
222.
223.
224.
Can the music industry make AI the next Napster?
(theverge.com)
225.
Did AI companies win a fight with authors? Technically
(theverge.com)
226.
Reinforcement learning, explained with a minimum of math and jargon
(news.ycombinator.com)
227.
Fault Tolerant Llama training – PyTorch blog
(news.ycombinator.com)
228.
229.
231.
232.
Mastodon updates its terms to prohibit AI model training
(techcrunch.com)
233.
234.
235.
236.
I built a large language model "from scratch"
(news.ycombinator.com)