Measuring Political Bias in Claude
(news.ycombinator.com)
31.
32.
Laude Institute announces first batch of ‘Slingshots’ AI grants
(techcrunch.com)
33.
How to Evaluate LLMs and GenAI Workflows Holistically
(computer.org)
34.
35.
Open-source MCPEval makes protocol-level agent testing plug-and-play
(venturebeat.com)
36.
LSM-2: Learning from incomplete wearable sensor data
(news.ycombinator.com)
37.
Launch HN: Confident AI (YC W25) – Open-source evaluation framework for LLM apps
(news.ycombinator.com)