Hackernews posts about RLHF

Related: Stability AI   ChatGPT   LLM   Alpaca  
  1. How RLHF Works (www.interconnects.ai)
  2. Constitutional AI: RLHF on Steroids (astralcodexten.substack.com)
  3. Constitutional AI: RLHF on Steroids (astralcodexten.substack.com)
  4. Unpacking the HF in RLHF (maestroai.substack.com)