Well, depends. For some models (especially two tower style models that use a dot... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

Scene_Cast2 7 months ago | parent | context | favorite | on: How does cosine similarity work?

Well, depends. For some models (especially two tower style models that use a dot product), you're definitely right and it makes a huge difference. In my very limited experience with LLM embeddings, it doesn't seem to make a difference.

extasia 7 months ago [–]

Interesting, I hadn’t heard of two tower modes before!

Yes, I guess it’s curious that the information lost doesn’t seem very significant (this also matches my experience!)

Scene_Cast2 7 months ago | [–]

Two tower models (and various variants thereof) are popular for early stages of recommendation system pipelines and search engine pipelines.

Join us for AI Startup School this June 16-17 in San Francisco!
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact