Gemini 2.5 Deep Think
(news.ycombinator.com)
121.
122.
Deep Think in the Gemini app
(news.ycombinator.com)
123.
VC Victor Lazarte is leaving Benchmark to launch his own firm
(techcrunch.com)
124.
AI agent benchmarks are broken
(news.ycombinator.com)
125.
AI Agent Benchmarks Are Broken
(news.ycombinator.com)
126.
Former Intel CEO launches a benchmark to measure AI alignment
(techcrunch.com)
127.
Koala: A benchmark suite for performance-oriented shell-optimization research
(news.ycombinator.com)
128.
Benchmarking Postgres
(news.ycombinator.com)
129.
Slightly better named character reference tokenization than Chrome, Safari, FF
(news.ycombinator.com)
130.
Can we fix AI’s evaluation crisis?
(technologyreview.com)
131.
132.
133.
Benchmark: snapDOM vs html2canvas
(news.ycombinator.com)
134.
Benchmark: SnapDOM may be a serious alternative to html2canvas
(news.ycombinator.com)
Today's top topics:
google
apple
openai
spacex
anthropic
samsung
remarkable
amazon
android authority
microsoft