Reinforcement Learning from Human Feedback (RLHF) in Notebooks github.com 69 points by ash_at_hny 12 hours ago
Hl
[dead]
[dead]