Large language models often know when they are being evaluated
(news.ycombinator.com)
27511.
27512.
Scientists Reveal Easy Three-Step Plan to Terraform Mars
(futurism.com)
27513.
What to read this weekend: Vampires and more vampires
(engadget.com)
27514.
What is systems programming, really? (2018)
(news.ycombinator.com)
27515.
“Language and Image Minus Cognition”: An Interview with Leif Weatherby
(news.ycombinator.com)
27516.
We investigated Amsterdam's attempt to build a 'fair' fraud detection model
(news.ycombinator.com)
27517.
Clinical knowledge in LLMs does not translate to human interactions
(news.ycombinator.com)
27518.
Sony is Still Putting Its Faith in ‘Marathon’
(gizmodo.com)
27520.
13 Best Laptops of 2025, Tested and Reviewed
(wired.com)
27521.
27522.
27523.
5 reasons why buying the latest flagship is not always a good idea
(androidauthority.com)
27524.
27525.
27526.
Mollusk shell assemblages as a tool for identifying unaltered seagrass beds
(news.ycombinator.com)
27527.
Solar Orbiter gets world-first views of the Sun's poles
(news.ycombinator.com)
27528.
Unsupervised Elicitation of Language Models
(news.ycombinator.com)
27529.
27530.
Laika’s ‘ParaNorman’ Is Coming Back to Theaters
(gizmodo.com)
27531.
27533.
27534.
Biofuels policy has been a failure for the climate, new report claims
(arstechnica.com)
27535.
27536.
27537.
27538.
27539.
27540.
Rethinking Losses for Diffusion Bridge Samplers
(news.ycombinator.com)