Hackernews posts about Benchmarks
- Benchmarks in CI: Escaping the Cloud Chaos (codspeed.io)
- Qwen3 235B beats Claude on some code benchmarks (huggingface.co)
- The AGI Final Frontier: The CLJ-AGI Benchmark (raspasov.posthaven.com)
- AI Startup Caught Cheating on Benchmark Papers (twitter.com)
- The Brokk Power Ranking LLM Coding Benchmark (brokk.ai)
- TaxCalcBench: A benchmark for evaluating AI's ability to calculate tax returns (www.columntax.com)
- Cheating on Quantum Computing Benchmarks (www.schneier.com)
- AMD Threadripper 9980X and 9970X Linux Benchmarks (www.phoronix.com)
- Giving Benchmarks a Boat (buttondown.com)
- Is your AI benchmark lying to you? (www.nature.com)
- Prisma ORM Without Rust: Latest Performance Benchmarks (www.prisma.io)
- Benchmarks in CI: Escaping the Cloud Chaos (codspeed.io)