Evaluating GPT5's reasoning ability using the Only Connect game show
(news.ycombinator.com)
2131.
2132.
2133.
OpenAI adds new GPT-5 models, restores o3, o4-mini and it's a mess all over again
(bleepingcomputer.com)
2134.
Evaluating LLMs playing text adventures
(news.ycombinator.com)
2135.
ChatGPT’s model picker is back, and it’s complicated
(techcrunch.com)
2136.
2137.
2138.
2139.
Launch HN: Design Arena (YC S25) – Head-to-head AI benchmark for aesthetics
(news.ycombinator.com)
2141.
2142.
Nexus: An Open-Source AI Router for Governance, Control and Observability
(news.ycombinator.com)
2143.
Evaluating LLMs Playing Text Adventures
(news.ycombinator.com)
2144.
This Gemini UI change should’ve been the default from the start (APK teardown)
(androidauthority.com)
2145.
The GPT-5 rollout has been a big mess
(arstechnica.com)
2146.
2147.
2148.
OpenAI is testing 3,000-per-week limit for GPT-5 Thinking
(bleepingcomputer.com)
2149.
2150.
2151.
Token growth indicates future AI spend per dev
(news.ycombinator.com)
2152.
2153.
xAI is testing Grok 4.20 to take on GPT-5, may launch this month
(bleepingcomputer.com)
2154.
Apple brings OpenAI's GPT-5 to iOS and macOS
(news.ycombinator.com)
2155.
An AI Model for the Brain Is Coming to the ICU
(wired.com)
2156.
2157.
2158.
The Great American EV Tax Credit Rush Has Begun
(gizmodo.com)
2159.
It shocked the market but has China's DeepSeek changed AI?
(feeds.bbci.co.uk)
2160.
GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2
(news.ycombinator.com)
Today's top topics:
apple
nasa
openai
artemis ii
microsoft
google
samsung
android authority
moon
anthropic