Just because Apple includes it in one of their prompts doesn't mean it improves ...

jsheard · 2024-09-15T23:27:27 1726442847

It seems plausible that stressing the importance of the system prompt instructions might do something, but I don't see how telling the model not to hallucinate would work. How could the model know that its most likely prediction has gone off the rails, without any external point of reference?

og_kalu · 2024-09-15T23:43:01 1726443781

Internally, LLMs know a whole lot more about the truth and uncertainty of their prediction than the say. Pushing that to words is difficult but not impossible.

https://news.ycombinator.com/item?id=41504226

jshmrsn · 2024-09-15T23:36:38 1726443398

Some of the text that the LLM is trained on is fictional, some of the text that its trained on is factual. Telling it to not make things up can tell it to generate text that’s more like the factual text. Not saying it does work, but this is a reason how it might work.

viraptor · 2024-09-15T23:37:12 1726443432

The model can be trained to interpret "don't hallucinate" as "refer only to the provided context and known facts, do not guess or extrapolate new information", which wouldn't get rid of the issue completely, but likely would improve the quality if that's what you're after and if there's enough training data for "I don't know" responses.

(But it all depends on the fine-tuning they did, so who knows, maybe it's just an Easter egg)

potatoman22 · 2024-09-16T02:33:28 1726454008

I think it's more likely that it's included for liability reasons.

tkz1312 · 2024-09-16T03:21:07 1726456867

I’ve had pretty good experience with it personally. It quite often just tells me it doesn’t know or isn’t sure instead of just making something up.

mrfinn · 2024-09-16T04:13:49 1726460029

I did something similar and to my surprise effectively made the LLM in my tests admit when they don't know something. Not always but worked sometimes. I don't prompt "don't hallucinate" but "admit when you don't know something". It's a logical thing in the other hand, many prompts just transmit the idea of being "helpful" or "powerful" to the LLMs without any counterweight idea. So the LLM tries to say something "helpful" in any case.

magicalhippo · 2024-09-16T07:19:50 1726471190

Playing around with local models, Gemma for example will usually comply when I tell it "Say you don't know if you don't know the answer". Others, like Phi-3, completely ignores that instruction and confabulates away.

fkyoureadthedoc · 2024-09-16T15:35:05 1726500905

Stop trying to make f̶e̶t̶c̶h̶ confabulate happen, it's not going to happen.

astrange · 2024-09-16T05:51:59 1726465919

It does help if you train the model to make it help.

wkat4242 · 2024-09-15T23:38:17 1726443497

Yeah and some of the other prompts were misspelled and of doubtful use:

> In order to make the draft response nicer and complete, a set of question [sic] and its answer are provided," reads one prompt. "Please write a concise and natural reply by modify [sic] the draft response," it continues.

This really sounds like a placeholder made up by one engineer until a more qualified team sits down and defines it.

astrange · 2024-09-16T08:51:03 1726476663

That's not a big problem since it will understand it, and if they already fine tuned the model to work with that prompt it'd get harder to change.

wkat4242 · 2024-09-16T11:07:39 1726484859

I just don't think Apple would release something like this. They're the company that laser engraves their screws because of their attention to detail.

NavinF · 2024-09-16T17:09:11 1726506551

Which apple screws are laser engraved?

wkat4242 · 2024-09-17T00:31:52 1726533112

The ones on the MacBook Pro used to be. At least were when I still used Apple until 2015 or so.

The butterfly keyboards were unusable to me and also the OS got too locked down so I left the platform.