Hacker News new | past | comments | ask | show | jobs | submit login

RLHF is one thing, but now that the training is done it has no bearing on whether or not you can show the chain of thought to the user.



Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: