The more interesting question IMO is not how good the code can get. It is what m...

wongarsu · 2025-01-03T12:01:06 1735905666

You should get decent results by asking it to do that in the prompt. Just add "if you are uncertain, answer I don't know" or "give the answer or say I don't know" or something along those lines

LLM are far from perfect at knowing their limits, but they are better at it than most people give them credit for. They just never do it unless prompted for it.

Fine tuning can improve that ability. For example the thinking tokens paper [1] is at some level training the model to output a special token when it doesn't reach a good answer (and then try again, thus "thinking")

1: https://arxiv.org/abs/2405.08644

Hendrikto · 2025-01-03T12:10:30 1735906230

The problem is, they do not think.

anonzzzies · 2025-01-03T13:14:22 1735910062

So, like many people then? Many people are even not at the level of llms but more like markov chains.