Hackernews posts about Benchmarks
- Nvidia DGX Spark: When benchmark numbers meet production reality (publish.obsidian.md)
- JanitorBench: A new LLM benchmark for multi-turn chats (about.janitorai.com)
- JavaScript Engines Benchmarks (ivankra.github.io)
- AMD vs. Intel: A Unicode Benchmark (lemire.me)
- Show HN: The Legal Embedding Benchmark (MLEB) (huggingface.co)
- Why Alpha Arena was a bad benchmark (borisagain.substack.com)
- To solve the benchmark crisis, evals must think (blog.fig.inc)
- Hetzner Servers Benchmark (softuts.com)
- Exasol Outperforms ClickHouse by 10x on TPC-H Analytical Benchmark (www.exasol.com)
- Nix CI Benchmarks (garnix-io.github.io)
- Apple M5 chip smashes Snapdragon X2 Elite in early single-thread benchmarks (www.tomshardware.com)
- WebGPU Benchmark: 15M Moving Nodes in Browser (ajlaston.github.io)
- Benchmark for Agent Context Engineering (2025) (www.tarasyarema.com)
- Server rendering benchmarks: Railway vs. Cloudflare vs. Vercel (blog.railway.com)
- Brazil Hedge Funds Outperform Benchmark as Bullish Bets Pay Off (www.bloomberg.com)
- AMD vs. Intel: A Unicode Benchmark (lemire.me)
- Show HN: CellARC: ARC-AGI style benchmark built on cellular automata (cellarc.mireklzicar.com)