ProgramBench: Can language models rebuild programs from scratch?
(news.ycombinator.com)
1.
2.
ProgramBench: Can Language Models Rebuild Programs from Scratch?
(news.ycombinator.com)
3.
Salesforce says it will hire 1,000 ‘AI-native’ new grads
(feeds.feedburner.com)
4.
5.
Hackers Hate AI Slop Even More Than You Do
(wired.com)
6.
7.
CO2 Levels In the Atmosphere Hit 'Depressing' New Record
(slashdot.org)
8.
9.
10.
11.
DeepClaude – Claude Code agent loop with DeepSeek V4 Pro
(news.ycombinator.com)
12.
New statue in London, attributed to Banksy, of a suited man, blinded by a flag
(news.ycombinator.com)
13.
Mercedes-Benz commits to bringing back physical buttons
(news.ycombinator.com)
15.
Ulta Promo Codes: Up to 50% Off in May
(wired.com)
16.
17.
How fast is a macOS VM, and how small could it be?
(news.ycombinator.com)
19.
Discovering hard disk physical geometry through microbenchmarking (2019)
(news.ycombinator.com)
20.
21.
Show HN: A new benchmark for testing LLMs for deterministic outputs
(news.ycombinator.com)
22.
A Decade of AMD Ryzen: 10 Years of CPUs Tested
(techspot.com)
23.
A Decade of AMD Ryzen: 10 Years of CPUs Tested
(techspot.com)
24.
A good AGENTS.md is a model upgrade. A bad one is worse than no docs at all
(news.ycombinator.com)
25.
26.
Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview
(news.ycombinator.com)
27.
28.
The predictable failure of the QDay Prize
(news.ycombinator.com)
29.
SWE-bench Verified no longer measures frontier coding capabilities
(news.ycombinator.com)
30.
Why SWE-bench Verified no longer measures frontier coding capabilities
(news.ycombinator.com)
Today's top topics:
openai
apple
google
google health
chatgpt
samsung
nvidia
anthropic
android authority
spacex