Hackernews posts about RLHF

Related: Stability AI   ChatGPT   LLM   Alpaca  
  1. RLHF Book (rlhfbook.com)
  2. Dispelling misconceptions about RLHF (aerial-toothpaste-34a.notion.site)
  3. RLHF from Scratch (github.com)