I just for fun accessed GPT 4 through nat.dev and tried one of the questions: "M...

antonvs · on Aug 8, 2023

> "There is no information provided about her condition at noon."

> "... However, without direct evidence, it is impossible to say with absolute certainty."

GPT is correct in both cases, she might have died just before noon and been resuscitated after noon. After all, the fact that we know her blood pressure suggests that she might be in hospital, so it's not even an unlikely scenario.

knodi123 · on Aug 8, 2023

Shades of the "Fair Witness" from Stranger In A Strange Land.

https://en.wikipedia.org/wiki/Stranger_in_a_Strange_Land#Fai...

AstralStorm · on Aug 9, 2023

The next prompt would be "explain how it would be possible for her to live and why is it also possible for her to be dead" or "what sort of additional information do you need to establish whether she is alive or dead" etc. See if GPT produces a plausible scenario as evidence of its reasoning.

Just stating "we need more data" is not stating why, and the why is critical for reasoning. You need to produce an example of how she could be dead and how she could be alive. And preferably state how this apparent paradox can be resolved.

My guess is that GPT will either chase its own tail or backtrack not backing your hypothesis that it figured out example and counterexample.

iraqmtpizza · on Aug 9, 2023

if she was dead at noon, the most she could be by 7 was undead

ineedasername · on Aug 8, 2023

Yes, it is a very interesting aspect of these models that they have a “low self esteem” (of course, not in the literal human sense) in their own abilities until you give them a pep talk, “don’t worry LLM, you got this”

elboru · on Aug 8, 2023

Good analogy, I also noticed it in other scenarios, for example when I want it to tell me if I’m wrong in an assumption, or when I want to know its opinion on some approach I’m thinking to take. It is too nice, almost like it doesn’t want to impose its thoughts or hurting my feelings lol.

BoorishBears · on Aug 8, 2023

A more generalized approach than encouraging them is simply telling them who to model themselves after.

A 10 year graphic design artist talking to another career graphic design artist will be opinionated about selecting a specific font for example. But an persona-less AI trained to be helpful to all people should not be so opinionated, since a beginner might be mislead into thinking there's only 1 font for their situation, rather than an assortment of guidelines and intuition most professionals have.

paulddraper · on Aug 8, 2023

It's a reticence to hallucinate.

eschneider · on Aug 8, 2023

Eh...I died once, and it got better, so I'd say that in this case GPT 4 is technically correct. And as we all know, technically correct is the best kind of correct.