Skip to content
Tech News
clear
Topics: Today This Week This Month This Year
1.
Show HN: Agent-skills-eval – Test whether Agent Skills improve outputs (news.ycombinator.com)
2.
The RAG era is ending for agentic AI — a new compilation-stage knowledge layer is what comes next (venturebeat.com)
3.
Archaeologists Unearth ‘Advanced’ Gold Dental Bridge in Medieval Scottish Grave (gizmodo.com)
4.
The AI scaffolding layer is collapsing. LlamaIndex's CEO explains what survives. (venturebeat.com)
5.
The retrieval rebuild: Why hybrid retrieval intent tripled as enterprise RAG programs hit the scale wall (venturebeat.com)
6.
Is fun at work overrated? (feeds.feedburner.com)
7.
I Won a Championship That Doesn't Exist (news.ycombinator.com)
8.
RAG precision tuning can quietly cut retrieval accuracy by 40%, putting agentic pipelines at risk (venturebeat.com)
9.
Monitoring LLM behavior: Drift, retries, and refusal patterns (venturebeat.com)
10.
Closure of China’s influential journal ranking leaves academics reeling — what will take its place? (feeds.nature.com)
11.
Evaluating large language models for accuracy incentivizes hallucinations (feeds.nature.com)
12.
Watch the Lego ‘Project Hail Mary’ Set (Almost) Go to Space (gizmodo.com)
13.
Duolingo was evaluating its workers’ AI use. Workers pushed back. (feeds.feedburner.com)
14.
Show HN: Continual Learning with .md (news.ycombinator.com)
15.
Scientists Used Medieval Poems and Trees to Uncover a 13th-Century Solar Surge (gizmodo.com)
16.
Wit, unker, Git: The lost medieval pronouns of English intimacy (news.ycombinator.com)
17.
A Digital Compute-in-Memory Architecture for NFA Evaluation (news.ycombinator.com)
18.
Smart people recognize each other – science proves it (news.ycombinator.com)
19.
How to deal with a passive-aggressive colleague (feeds.feedburner.com)
20.
General scales unlock AI evaluation with explanatory and predictive power (feeds.nature.com)
21.
The story of Britain's oldest sweet, the Pontefract Cake (2019) (news.ycombinator.com)
22.
Red Rooms makes online poker as thrilling as its serial killer (theverge.com)
23.
Duolingo’s CEO Uses a Secret Test to Evaluate Job Candidates — Before They Even Step into the Interview (feeds.feedburner.com)
24.
Chroma Context-1: Training a Self-Editing Search Agent (news.ycombinator.com)
25.
Show HN: Claude skill that evaluates B2B vendors by talking to their AI agents (news.ycombinator.com)
26.
Gerard of Cremona (news.ycombinator.com)
27.
I built an AI receptionist for a mechanic shop (news.ycombinator.com)
28.
Our whole way of thinking about leadership is a century out of date (feeds.feedburner.com)
29.
Tech Employees Are Reportedly Being Evaluated by How Fast They Burn Through LLM Tokens (gizmodo.com)
30.
VisiCalc Reconstructed (news.ycombinator.com)
Today's top topics: peacock premier league
View all today's topics →