Gemini 2.5 Deep Think
(news.ycombinator.com)
91.
92.
Deep Think in the Gemini app
(news.ycombinator.com)
93.
VC Victor Lazarte is leaving Benchmark to launch his own firm
(techcrunch.com)
94.
AI agent benchmarks are broken
(news.ycombinator.com)
95.
AI Agent Benchmarks Are Broken
(news.ycombinator.com)
96.
Former Intel CEO launches a benchmark to measure AI alignment
(techcrunch.com)
97.
Koala: A benchmark suite for performance-oriented shell-optimization research
(news.ycombinator.com)
98.
Benchmarking Postgres
(news.ycombinator.com)
99.
Slightly better named character reference tokenization than Chrome, Safari, FF
(news.ycombinator.com)
100.
Can we fix AI’s evaluation crisis?
(technologyreview.com)
101.
102.
103.
Benchmark: snapDOM vs html2canvas
(news.ycombinator.com)
104.
Benchmark: SnapDOM may be a serious alternative to html2canvas
(news.ycombinator.com)
Today's top topics:
apple
openai
anthropic
nvidia
chatgpt
google
macbook neo
automation
silicon valley
openclaw