For whatever it’s worth, in response to the same question posed by me (“what is ...

raincole · 2024-12-28T18:20:12 1735410012

Perhaps Google AI reads HN at work just like us.

WarOnPrivacy · 2024-12-28T18:33:48 1735410828

    The median lethal dose (LD50) of caffeine in humans is estimated to be 150–200 milligrams per kilogram of body mass. However, the lethal dose can vary depending on a person's sensitivity to caffeine, and can be as low as 57 milligrams per kilogram. 

    Route of administration 
    Oral 367.7 mg/kg bw
    Dermal >2000 mg/kg bw
    Inhalation LC50 combined: ca. 4.94 mg/L

ref: https://i.imghippo.com/files/yeKK3113pE.png 13:25EST (by a Kagi shill ftr)

hinkley · 2024-12-28T20:49:03 1735418943

That’s the danger with thinking in terms of LD50.

That’s half the people in a caffeine chugging contest falling over dead. The first 911 call would be much much earlier. I doubt you’d get to 57 mg before someone thought they were having a heart attack (angina).

pests · 2024-12-28T18:04:37 1735409077

I also got similar and just tried, we are posting within minutes.

--

The median lethal dose (LD50) of caffeine in humans is estimated to be 150–200 milligrams per kilogram of body mass. However, the lethal dose can vary depending on a person's sensitivity to caffeine, and can be as low as 57 milligrams per kilogram. Route of administration LD50 Oral 367.7 mg/kg bw Dermal 2000 mg/kg bw Inhalation LC50 combined: ca. 4.94 mg/L The FDA estimates that toxic effects, such as seizures, can occur after consuming around 1,200 milligrams of caffeine.

There was a table in the middle there.

griomnib · 2024-12-28T19:11:18 1735413078

LLM are non deterministic by nature.

richk449 · 2024-12-28T21:24:34 1735421074

Is this really true? The linear algebra is deterministic, although maybe there is some chaotic behavior with floating point handling. The non deterministic part mostly comes from intentionally added randomness, which can be turned off right?

Maybe the argument is that if you turn off the randomness you don’t have an LLM like result any more?

hansvm · 2024-12-29T06:09:11 1735452551

Floats are deterministic too (this winds up being helpful if you want to do something like test an algorithm on every single float); you just might get different deterministic outcomes on different compilation targets or with threaded intermediate values.

The argument is, as you suggest, that without randomness you don't have an LLM-like result any more. You _can_ use the most likely token every time, or beam search, or any number of other strategies to try to tease out an answer. Doing so gives you a completely different result distribution, and it's not even guaranteed to give a "likely" output (imagine, e.g., a string of tokens that are all 10% likely for any greedy choice, vs a different string where the first is 9% and the remainder are 90% -- with a 10-token answer the second option is 387 million times more likely with random sampling but will never happen with a simple deterministic strategy, and you can tweak the example slightly to keep beam search and similar from finding good results).

That brings up an interesting UI/UX question.

Suppose (as a simplified example) that you have a simple yes/no question and only know the answer probabilistically, something like "will it rain tomorrow" with an appropriate answer being "yes" 60% of the time and "no" 40%. Do you try to lengthen the answer to include that uncertainty? Do you respond "yes" always? 60% of the time? To 60% of the users and then deterministically for a period of time for each user to prevent flip-flopping answers?

The LD50 question is just a more complicated version of that conundrum. The model isn't quite sure. The question forces its hand a bit in terms of the classes of answers. What should its result distribution be?

chimpanzee · 2024-12-28T19:30:52 1735414252

Yes, that’s the main issue as ideally they wouldn’t be non-deterministic on well-established quantitative facts.

griomnib · 2024-12-28T19:37:06 1735414626

But they can never be. RAG gets you somewhere, but it’s still a pile of RNGs under a trenchcoat.

chimpanzee · 2024-12-28T20:17:25 1735417045

>> ideally

griomnib · 2024-12-28T21:57:59 1735423079

It’s just not possible. You can do a lot with nondeterministic systems, they have value - but oranges and apples. They need to coexist.

chimpanzee · 2024-12-28T22:42:48 1735425768

ideal (def. #2) = Existing only in the mind; conceptual, imaginary

https://en.m.wiktionary.org/wiki/ideal

(We’re allowed to imagine the impossible.)

griomnib · 2024-12-28T23:55:29 1735430129

Fair, I am loath to take away your dreams!