I'm most excited at what this is going to look like not by abandoning RAG but by...

zmmmmm · on Feb 23, 2024

The question I would like to know is whether that just leads you back to hallucinations. ie: is the avoidance of hallucinations intrinsically due to forcing the LLM to consider limited context, rather than directing it to specific / on topic context. Not sure how well this has been established for large context windows?

kromem · on Feb 24, 2024

Having details in context seems to reduce hallucinations, which makes sense if we'd switch to using the more accurate term of confabulations.

LLM confabulations generally occur when they don't have the information to answer, so they make it up, similar to it you've seen split brain studies where one hemisphere is shown something that gets a reaction and the other hemisphere is explaining it with BS.

So yes, RAG is always going to potentially have confabulations if it cuts off the relevant data. But large contexts themselves shouldn't cause it.

visarga · on Feb 24, 2024

> you can fit relevant chunks from an entire reference library into the context window too

I'm curious if a large language model utilizes an extensive context that includes multiple works, whether copyrighted or not, to produce text that significantly differs from the source material, would this constitute infringement? Considering that the model is engaging in a novel process by relating numerous pieces of text, comparing and contrasting their information, and then generating the output of this analysis, could the output be considered usable as training data?

I would set such a model to make a list of concepts, and then generate a wikipedia-like article on each one of them based on source materials obtained with a search engine. The model can tell if the topic is controversial or settled, what is the distribution of human responses, if they are consistent or contradictory, in general report on the controversy, and also report on the common elements that everyone agrees upon.

It would be like writing a report or an analysis. Could help reduce hallucinations and bias, while side stepping copyright infringement because it adds a new purpose and layer of analysis on top of the source materials, and carefully avoids replicating original expression.

streetcat1 · on Feb 24, 2024

I am not sure, it depends on the cost. If they charge per token, a large context will mostly be irrelevant. For some reason, the article did not mention it.

kromem · on Feb 24, 2024

The article did mention costs, specifically it was provided to them for free and they don't know how much it will actually cost.

As for your larger point, it really depends on the ROI.

To summarize your Twitter feed, probably not.

To identify correlating factors and trends across your industry's recent research papers, the $5 bill will probably be fine.

crucialfelix · on Feb 24, 2024

I would think for the common case of answering a question given a reference library, that RAG is going to remain cheaper and better.

No way do we want to post the entire reference library for every conversation.

Only if it's one off: read this book, answer questions.

josteink · on Feb 24, 2024

> And that is very promising.

Agreed. But I don’t think a lot of people will be willing to use an openly racist AI for business purposes.

I want my AI to fact-based, not ideologically driven and presenting things which doesn’t exist as facts.

cpill · on Feb 24, 2024

yeah, imagine what this will do for lawyers