How to Evaluate LLMs and GenAI Workflows Holistically
(computer.org)
1.
2.
Evaluating Agents
(news.ycombinator.com)