LLMs aren’t deterministic even with a seed.

jsheard · 2025-09-14T21:15:25 1757884525

Doesn't that depend on the implementation? There's a trade-off between performance and determinism for sure, but if determinism is what you want then it should be possible.

jb1991 · 2025-09-14T21:18:46 1757884726

If you fix random seeds, disable dropout, and configure deterministic kernels, you can get reproducible outputs locally. But you still have to control for GPU non-determinism, parallelism, and even library version differences. Some frameworks (like PyTorch) have flags (torch.use_deterministic_algorithms(True)) to enforce this.

geor9e · 2025-09-14T21:17:23 1757884643

what if you set top_p=1, temperature=0, and always run it on the same local hardware

mkarrmann · 2025-09-14T22:22:09 1757888529

Horace He at Thinking Machines just dropped an awesome article describing exactly this: https://thinkingmachines.ai/blog/defeating-nondeterminism-in...

TL;DR: assuming you've squashed all regular non-determinism (itself a tall ask), you either need to ensure you always batch requests deterministically, or ensure all kernels are "batch invariant" (which is absolutely not common practice to do).

daemonologist · 2025-09-14T21:21:51 1757884911

Maybe if you run it on CPU. (Maybe on GPU if all batching is disabled, but I wouldn't bet on it.)

mrheosuper · 2025-09-15T03:56:57 1757908617

cosmic wave will get you

worble · 2025-09-14T22:51:12 1757890272

Yes, that's the joke

jb1991 · 2025-09-14T21:15:56 1757884556

This. I’m still amazed how many people don’t understand how this technology actually works. Even those you would think would have a vested interest in understanding it.