Hackernews posts about HumanEval
- BigCodeBench: The Next Generation of HumanEval (github.com)
- HumanEval is saturated: new coding LLM benchmark released (bigcode-bench.github.io)
- Running HumanEval Safely with Riza (riza.io)
- Show HN: I built the LLM Comparison Tool I wish existed (llm-stats.com)
- Show HN: European Swallow AI – Sonnet-quality coding at $2.60/M tokens (www.europeanswallowai.com)
- Show HN: Fine-Tuning Index of Open-Source LLMs vs. OpenAI (predibase.com)
- Show HN: Atlas: Independent Evals and Benchmarking for Generative AI Models (app.layerlens.ai)
- Beat GPT-4o at Python with 100 dumb LLaMAs (modal.com)
- Humanely dealing with humungus crawlers (flak.tedunangst.com)
- Show HN: HumanAlarm – Real people knock on your door to wake you up (humanalarm.com)
- HumaneAI pin maker selling itself for $1B (gizmodo.com)
- Valuing Humans in the Age of Superintelligence: HumaneRank (roadtoartificia.com)
- Humanimals (twitter.com)
- Humanely Dealing with Humungus Crawlers (flak.tedunangst.com)
- Refactoring Humanely and "Accidental Pomodoro" (melatonin.dev)
- How to Kill Bugs Humanely (reducing-suffering.org)
- Humanely dealing with humungus crawlers (flak.tedunangst.com)
- Humanely Dealing with Humungus Crawlers (flak.tedunangst.com)
- Show HN: Find humanely raised animal-based products (findhumane.com)
- Show HN: The $10 coffee that tanked my credit score (cretit.com)