Hackernews posts about Ben
- Meta got caught gaming AI benchmarks (www.theverge.com)
- Show HN: LocalScore – Local LLM Benchmark (www.localscore.ai)
- LLM Benchmark for 'Longform Creative Writing' (eqbench.com)
- Show HN: Benchi – A benchmarking tool written in Go (github.com)
- Which countries would benefit most from an American brain drain? (www.economist.com)
- OpenAI wants to bend copyright rules. Study suggests it isn't waiting (www.theregister.com)
- Benefits of Apache Iceberg for geospatial data analysis (wherobots.com)
- Therapy chatbot trial yields mental health benefits (home.dartmouth.edu)
- Medical Benchmarks and the Myth of the Universal Patient (www.newyorker.com)
- DeepSeek-V3-0324 Crushes GPT-4.5 in Math and Code Benchmarks at 1/277 the Cost (api-docs.deepseek.com)
- We need a better term for GenAI output – "slop" is too benign (www.rockpapershotgun.com)
- #1 open-source agent on SWE-Bench Verified by combining Claude 3.7 and O1 (www.augmentcode.com)
- LocalScore: A Local LLM Benchmark (www.localscore.ai)
- This Bench Does Not Exist (doesnotexist.openbenches.org)
- Why Software Consultants Benefit from Liberal Arts Education (spin.atomicobject.com)
- NPB-Rust: NAS Parallel Benchmarks in Rust (arxiv.org)
- The Curve Is Bending (grantslatton.com)
- No Man's Sky's newest update adds bones and fossils beneath the Earth (www.polygon.com)
- Vending-Bench: Testing long-term coherence in agents (andonlabs.com)
- Don't overlook the many benefits of plastics (www.economist.com)