From 0% to 36% on Day 1 of ARC-AGI-3
(news.ycombinator.com)
1.
2.
ARC-AGI-3
(news.ycombinator.com)
3.
Exclusive: This new benchmark could expose AI’s biggest weakness
(feeds.feedburner.com)
4.
This new benchmark could expose AI’s biggest weakness
(feeds.feedburner.com)