LLMs are, at their core, search tools. Training is indexing and prompting is que...

jcranmer · 2025-01-07T03:37:44 1736221064

> LLMs are, at their core, search tools.

Fundamentally, no they're not. That is why you have cases like the Air Canada chatbot that told a user about a refund opportunity that didn't exist, or the lawyer in Mata v Avianca who cited a case that didn't exist. If you ask an LLM to search for something that doesn't exist, there's a decent chance it will hallucinate something into existence for you.

What LLMs are good at is effectively turning fuzzy search terms into non-fuzzy terms; they're also pretty good at taking some text and recasting into an extremely formulaic paradigm. In other words, turning unstructured text into something structured. The problem they have is that they don't have enough understanding of the world to do something useful that with structured representation that needs to be accurate.

sdesol · 2025-01-07T02:58:56 1736218736

> LLMs are, at their core, search tools.

This is the wrong take. Search tools are deterministic unless you purposely inject random weights into the ranking. With search tools, the same search query will always yield the same search result, provided they are designed too and/or the underlying data has not changed.

With LLMs, I can ask the exact same question and get a different response, even if the data has not changed.

Scene_Cast2 · 2025-01-07T03:06:43 1736219203

The randomness comes from sampling. With local LLMs, you can fix the random seed, or even disable sampling all together - both will get you determinism.

I agree that LLMs are not search tools, but for very different reasons.

sdesol · 2025-01-07T04:02:33 1736222553

Thanks for the info on local LLMs. Based on my chats with multiple LLMs, the biggest issue appears to be hardware.

Non-deterministic hardware: All LLMs mentioned that modern computing hardware, such as GPUs or TPUs, can introduce non-determinism due to factors like parallel processing, caching, or numerical instability. This can make it challenging to achieve determinism, even with fixed random seeds or deterministic algorithms.

You can find the summary of my chats https://beta.gitsense.com/?chat=1c3e69f9-7b8b-48a3-8b99-bb1b.... If you scroll to the top and click on the "Conversation" link in the first message, you can read the individual responses.

klabb3 · 2025-01-07T03:53:02 1736221982

Semantics. It may be able to get deterministic but it’s unstable wrt unrelated changes in the training data, no? If I add a page about sausages to a search index, the results for ”ski jacket” will be unaffected. In a practical sense, LLMs are non-deterministic. I mean, ChatGPT even has a ”regenerate” button to expose this ”turbulence” as a feature.

User23 · 2025-01-07T05:33:23 1736228003

Hence n-grams rather than documents.

Also what's with using "semantics" as a dismissal when the technology we're talking about is the most semantically relevant search ever made.

IanCal · 2025-01-07T02:40:10 1736217610

Half of the problems are people treating them as searchers when they aren't. They're absolutely not ngram indexes of existing data, either.

mvdtnz · 2025-01-07T02:48:43 1736218123

I'm losing track of the number of different things the Hacker News commenters claim LLMs are "at their core".

bitwize · 2025-01-07T03:42:46 1736221366

LLMs are, at their core, fucking Dissociated Press. That's what makes them fun and interesting, and that's the problem with using them for real production work.

sulam · 2025-01-07T04:31:25 1736224285

Isn't this answer obvious/facile but also true? They're next token predictors.