Tech News
clear
Topic Analysis: Today This Week This Month This Year
91.
SWE-Bench Pro (news.ycombinator.com)
92.
CompileBench: Can AI Compile 22-year-old Code? (news.ycombinator.com)
93.
Tau² benchmark: How a prompt rewrite boosted GPT-5-mini by 22% (news.ycombinator.com)
94.
Tau² Benchmark: How a Prompt Rewrite Boosted GPT-5-Mini by 22% (news.ycombinator.com)
95.
Crowdstrike and Meta just made evaluating AI security tools easier (zdnet.com)
96.
Why do browsers throttle JavaScript timers? (news.ycombinator.com)
97.
Powerful GPUs or Fast Interconnects: Analyzing Relational Workloads (news.ycombinator.com)
98.
Worried about the Pixel 10 Pro XL benchmark controversy? Here’s why you shouldn’t be (androidauthority.com)
99.
How well do coding agents use your library? (news.ycombinator.com)
100.
How Well Do Coding Agents Use Your Library? (news.ycombinator.com)
101.
Benchmarking GPT-5 on 400 real-world code reviews (news.ycombinator.com)
102.
Herbie detects inaccurate expressions and finds more accurate replacements (news.ycombinator.com)
103.
Do LLMs identify fonts? (news.ycombinator.com)
104.
Efficiently Generating a Number in a Range (2018) (news.ycombinator.com)
105.
Gemini 2.5 Deep Think (news.ycombinator.com)
106.
Deep Think in the Gemini app (news.ycombinator.com)
107.
Modernising the Amiga at Forty (news.ycombinator.com)
108.
VC Victor Lazarte is leaving Benchmark to launch his own firm (techcrunch.com)
109.
A new AI coding challenge just published its first results — and they aren’t pretty (techcrunch.com)
110.
A new AI coding challenge just published its first results – and they aren’t pretty (techcrunch.com)
111.
AI coding tools are shifting to a surprising place: The terminal (techcrunch.com)
112.
AI agent benchmarks are broken (news.ycombinator.com)
113.
AI Agent Benchmarks Are Broken (news.ycombinator.com)
114.
Former Intel CEO launches a benchmark to measure AI alignment (techcrunch.com)
115.
Koala: A benchmark suite for performance-oriented shell-optimization research (news.ycombinator.com)
116.
I recommend this Windows laptop to creatives and pro users - and it's on sale for Prime Day (zdnet.com)
117.
Benchmarking Postgres (news.ycombinator.com)
118.
Slightly better named character reference tokenization than Chrome, Safari, FF (news.ycombinator.com)
119.
Can we fix AI’s evaluation crisis? (technologyreview.com)
120.
A Chinese firm has just launched a constantly changing set of AI benchmarks (technologyreview.com)
Today's top topics: amazon billion linux samsung model edge bose headphones sonos operating
View all today's topics →