PBM Drug Pricing Distortion Report
(news.ycombinator.com)
121.
122.
123.
124.
Study identifies weaknesses in how AI systems are evaluated
(news.ycombinator.com)
125.
AI benchmarks are a bad joke – and LLM makers are the ones laughing
(news.ycombinator.com)
126.
127.
128.
AI Model Growth Outpaces Hardware Improvements
(spectrum.ieee.org)
129.
Battlefield 6 Benchmark: 43 GPUs Tested
(techspot.com)
130.
131.
Unpacking Cloudflare Workers CPU Performance Benchmarks
(news.ycombinator.com)
132.
Microsoft lets bosses spot teams that are dodging Copilot
(news.ycombinator.com)
133.
Leaked Apple iPad Pro M5 benchmark shows massive improvements
(bleepingcomputer.com)
134.
Tau² benchmark: How a prompt rewrite boosted GPT-5-mini by 22%
(news.ycombinator.com)
135.
Tau² Benchmark: How a Prompt Rewrite Boosted GPT-5-Mini by 22%
(news.ycombinator.com)
136.
137.
Why do browsers throttle JavaScript timers?
(news.ycombinator.com)
138.
Worried about the Pixel 10 Pro XL benchmark controversy? Here’s why you shouldn’t be
(androidauthority.com)
139.
Benchmarking GPT-5 on 400 real-world code reviews
(news.ycombinator.com)
140.
Herbie detects inaccurate expressions and finds more accurate replacements
(news.ycombinator.com)
141.
Do LLMs identify fonts?
(news.ycombinator.com)
142.
Efficiently Generating a Number in a Range (2018)
(news.ycombinator.com)
143.
Gemini 2.5 Deep Think
(news.ycombinator.com)
144.
Deep Think in the Gemini app
(news.ycombinator.com)
145.
VC Victor Lazarte is leaving Benchmark to launch his own firm
(techcrunch.com)
146.
AI agent benchmarks are broken
(news.ycombinator.com)
147.
AI Agent Benchmarks Are Broken
(news.ycombinator.com)
148.
Former Intel CEO launches a benchmark to measure AI alignment
(techcrunch.com)
149.
Koala: A benchmark suite for performance-oriented shell-optimization research
(news.ycombinator.com)
150.
Benchmarking Postgres
(news.ycombinator.com)