Language models are not designed to know things, they are designed to say things...

otabdeveloper4 · 2025-05-31T08:08:28 1748678908

> This leads to output that we would classify as "very boring".

Not really. I set temperature to 0 for my local models, it works fine.

The reason why the cloud UIs don't allow a temperature of 0 is because then models sometimes start to do infinite loops of tokens, and that would break the suspension of disbelief if the public saw it.

mdp2021 · 2025-05-31T08:16:37 1748679397

Which local models are you using, that do not output loop garbage at temperature 0?

What do you get at very low temperature values instead of 0?

otabdeveloper4 · 2025-05-31T15:00:58 1748703658

> Which local models are you using, that do not output loop garbage at temperature 0?

All of them. I make my own frontends using llama-cpp. Quality goes up with temperature 0 and loops are rare.

The temperature setting isn't for improving quality, it's to not break your suspension of disbelief that you're talking to an intelligent entity.

mdp2021 · 2025-05-31T15:56:48 1748707008

> All of them

You must be using recent (or just different) models than those I tried. Mine returned garbage easily at temperature 0. (But unfortunately, I cannot try and report from there.)

This (LLM behaviour and benchmarking at low or 0 temperature value) should be a topic to investigate.

otabdeveloper4 · 2025-05-31T21:17:37 1748726257

Probably a bug in the code you ran somewhere.

verisimi · 2025-05-31T08:19:13 1748679553

> Language models are not designed to know things, they are designed to say things - that's why they are called language models and not knowledge models.

This is true. But you go to Google not to 'have a chat' but ostensibly to learn something based in knowledge.

Google seem to be making an error in swapping the provision of 'knowledge' for 'words' you'd think, but then again perhaps it makes no difference when it comes to advertising dollars which is their actual business.