The original GPT models did this a lot iirc. | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		jszymborski 17 days ago \| parent \| context \| favorite \| on: Wildthing – A model trained on role-reversed ChatG... The original GPT models did this a lot iirc.

daveguy 17 days ago [–]

Maybe the role reversal breaks most of the RLHF training. The training was definitely not done in the context of role reversal, so it could be out of distribution. If so, this is a glimpse of the intelligence of the LLM core without the RL/RAG/etc tape and glue layers.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact