NHacker Next

login

▲Reinforcement Learning from Human Feedback (RLHF) in Notebooksgithub.com

58 points by ash_at_hny 6 hours ago | 1 comment

Loading comments...

▲Reinforcement Learning from Human Feedback (RLHF) in Notebooksgithub.com

58 points by ash_at_hny 6 hours ago | 1 comment

kcdom1000f 3 hours ago [-]

Hl

careful_ai 2 minutes ago [-]

[dead]

Loading comments...

kcdom1000f 3 hours ago [-]

Hl

careful_ai 2 minutes ago [-]

[dead]