Input embeddings are taken from a dictionary in case of text tokens, but they do... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

kolinko 11 months ago | parent | context | favorite | on: How Does GPT-4o Encode Images?

Input embeddings are taken from a dictionary in case of text tokens, but they don’t need to be - they can be any vector really.

iknownothow 11 months ago [–]

But don't input embeddings need to undergo backprop during training? Won't the external-model's embeddings just be noise since they don't share embedding space with the model that is being trained?

If the external-model also undergoes training along with the model then I think that might work.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact