NHacker Next
login
▲Reinforcement Learning from Human Feedback (RLHF) in Notebooksgithub.com
58 points by ash_at_hny 6 hours ago | 1 comment
Loading comments...
kcdom1000f 3 hours ago [-]
Hl
careful_ai 2 minutes ago [-]
[dead]