Skip to content
Tech News
clear
Topics: Today This Week This Month This Year
1.
OpenAI Announces Benchmarks for AI Life Sciences Research. Its Best Model Failed 63.9% of the Test (slashdot.org)
2.
How We Broke Top AI Agent Benchmarks: And What Comes Next (news.ycombinator.com)
3.
The Download: gig workers training humanoids, and better AI benchmarks (technologyreview.com)
4.
AI benchmarks are broken. Here’s what we need instead. (technologyreview.com)
5.
Exclusive: This new benchmark could expose AI’s biggest weakness (feeds.feedburner.com)
Today's top topics: prime day reviews zdnet
View all today's topics →