Hackernews posts about Evals

  1. Macro Evals for Agentic Systems (developers.openai.com)