Hackernews posts about RLHF
- Show HN: Think Fu – Metacognition as a service (thinkfu.org)
- Was the Iran War Caused by AI Psychosis? (houseofsaud.com)
- Reinforcement Learning (I.e. Policy Gradient Algorithms) (rlhfbook.com)
- My Dev Box Setup Script (rlafuente.com)
- AdGPT (adgpt.rlafuente.com)
- LLM-Native Advertising (What Ads in GenAI Will Look Like) (adgpt.rlafuente.com)
- Shaping the exploration of the motivation-space matters for AI safety (www.lesswrong.com)
- RLHF Book (rlhfbook.com)
- RLHF is just barely RL (twitter.com)
- Dispelling misconceptions about RLHF (aerial-toothpaste-34a.notion.site)
- RLHF from Scratch (github.com)
- Direct Preference Optimization vs. RLHF (www.together.ai)
- RLHF Is Cr*P, It's a Paint Job on a Rusty Car: Geoffrey Hinton (officechai.com)
- Andrej Karpathy on X: RLHF is just barely RL (twitter.com)
- Using Hallucinations to Bypass RLHF Filters (arxiv.org)
- RLHF reduces LLM creativity and output variety (twitter.com)
- Finetuning or RLHF on Anthropic (www.anthropic.com)
- Language Models Learn to Mislead Humans via RLHF (arxiv.org)
- RLHF Sycophancy: Gemini 3.0 discards calculated data to mimic user edits (tomaszmachnik.pl)
- Extreme sycophancy RLHF is needed (twitter.com)
- RLHF with Dagster and Modal (kyrylai.com)
- Ring-1T: Trillion-Parameter Model Trained with RLVR and RLHF (ant-ling.medium.com)
- Opal: An Operator Algebra View of RLHF (arxiv.org)
- RLHF: Reinforcement Learning from Human Feedback (huyenchip.com)
- A Short Introduction to RLHF (ttumiel.com)
- Reducing RLHF hallucinations and sycophancy in Gemini 3 (Interactive Demo) (tomaszmachnik.pl)
- Notes on RLHF Book by Nathan Lambert (shubhamg.bearblog.dev)