Hackernews posts about Catbench
- DatBench fixes VLM evals: 70% blindly solvable, 42% mislabeled, 35% prod gap (www.datologyai.com)
- DatBench: Cut VLM eval compute by >10× while INCREASING signal (www.datologyai.com)
- Show HN: CatBench Vector Search Playground on Postgres (tanelpoder.com)
- CatBench Vector Search Playground on Postgres (tanelpoder.com)