Hackernews posts about Evans
- Oliver Evans (en.wikipedia.org)
- How Will OpenAI Compete? (www.ben-evans.com)
- How Will OpenAI Compete? (www.ben-evans.com)
- How will OpenAI compete? (www.ben-evans.com)
- AI, networks and Mechanical Turks (2025) (www.ben-evans.com)
- Opus 4.6 and Codex 5.3 (simonwillison.net)
- AGENTS.md outperforms skills in our agent evals (vercel.com)
- Promptfoo: Local LLM evals and red teaming (github.com)
- Infrastructure configuration can swing coding evals by several % points (www.anthropic.com)
- Agents.md outperforms skills in our agent evals (vercel.com)
- Randomness in Agentic Evals (arxiv.org)
- If agents use your tool, you need evals (tessl.io)
- Randomness in Agentic Evals (arxiv.org)
- Demystifying Evals for AI Agents (www.anthropic.com)
- Testing Agent Skills Systematically with Evals (developers.openai.com)
- Don't Write Evals for Fast-Moving Systems (simon.podhajsky.net)
- Show HN: Optimize_anything: A Universal API for Optimizing Any Text Parameter (gepa-ai.github.io)