Hackernews posts about RLHF
- Dispelling misconceptions about RLHF (aerial-toothpaste-34a.notion.site)
- Alan Turing Institute accused of 'toxic' culture (www.thetimes.com)
- Show HN: 16-Pad Sampler from Your Videos (sampler.rlafuente.com)
- RLHF Book (rlhfbook.com)
- RLHF is just barely RL (twitter.com)
- RLHF a LLM in <50 lines of Python (datadreamer.dev)
- Training and aligning LLMs with RLHF and RLHF alternatives (magazine.sebastianraschka.com)
- Direct Preference Optimization vs. RLHF (www.together.ai)
- Tune PaLM 2 with your own RLHF training data (github.com)
- RLHF Is Cr*P, It's a Paint Job on a Rusty Car: Geoffrey Hinton (officechai.com)
- Andrej Karpathy on X: RLHF is just barely RL (twitter.com)
- Using Hallucinations to Bypass RLHF Filters (arxiv.org)
- RLHF reduces LLM creativity and output variety (twitter.com)
- Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF (www.interconnects.ai)
- Finetuning or RLHF on Anthropic (www.anthropic.com)
- Language Models Learn to Mislead Humans via RLHF (arxiv.org)
- Extreme sycophancy RLHF is needed (twitter.com)
- RLHF with Dagster and Modal (kyrylai.com)
- RLHF Learning Resources in 2024 (www.interconnects.ai)
- RLHF 201 (www.latent.space)
- Multimodal LM roundup: Unified IO 2, I/O, Gemini, LLaVA-RLHF, and RLHF questions (www.interconnects.ai)
- Nvidia Toolkit for RLHF (github.com)
- Is Google running RLHF for free on Bard users? (old.reddit.com)
- Reinforcement Learning: ChatGPT and RLHF [video] (www.youtube.com)
- RLHF and LLM Evaluations [video] (www.youtube.com)
- Rethinking the Role of PPO in RLHF (thwu1.github.io)
- Made a Swipe Based Twitter for RLHF (tagalong.ai)