Oh the interesting part is “our AI could not interpret images of common objects ...

WithinReason · 2024-10-06T10:26:31 1728210391

There are cases where AI can recognise gender on an X-ray when humans can't, find tumors that experienced doctor's can't. This must mean that human doctors looking at Xrays use just boring pattern recognition and AI has actual concepts of what it's seeing.

lifeisstillgood · 2024-10-06T12:09:46 1728216586

But does it really? Or is it more observant than a human doctor and more thorough, but only at the limited task of deciding if this X-ray looks like the million other X-rays of a male abdomen versus the million X-rays of a female abdomen.

I assume counting the number of ribs is not what is meant …

lewhoo · 2024-10-06T17:05:39 1728234339

Those certainly were the catchy headlines. Here's an interesting article:

https://news.mit.edu/2024/study-reveals-why-ai-analyzed-medi...

“We found that even state-of-the-art models which are optimally performant in data similar to their training sets are not optimal — that is, they do not make the best trade-off between overall and subgroup performance — in novel settings,” Ghassemi says. “Unfortunately, this is actually how a model is likely to be deployed. Most models are trained and validated with data from one hospital, or one source, and then deployed widely.”

johnisgood · 2024-10-06T11:56:03 1728215763

That is actually pretty cool, but I believe you meant to say "biological sex" instead of "gender". :P

I have no clue how an AI may find the gender (which is in the mind) of someone through x-rays alone.

WithinReason · 2024-10-06T12:28:20 1728217700

the mathematical correlation between the two is so high as to be negligible

johnisgood · 2024-10-06T12:36:31 1728218191

What does this mean? I do not think anyone could determine my gender based on x-rays alone. My biological sex, however, definitely.

WithinReason · 2024-10-06T14:23:12 1728224592

It's simple math. If the correlation between gender and sex is 0.99 then if a method can determine your sex with say a 90% accuracy then it can determine your gender with an 89% accuracy (very roughly). The difference is negligible.

jxjx · 2024-10-06T16:50:38 1728233438

"Gender" is often used as a synonym of "sex".

The more recent and somewhat controversial concept of it being an identity isn't the only sense of the word.

johnisgood · 2024-10-06T21:25:36 1728249936

> "Gender" is often used as a synonym of "sex"

It used to be the case, yes, I agree, but these days people are referring to gender identity when they talk about gender, IME.

Doxin · 2024-10-07T08:42:07 1728290527

Mind that there's a big difference between machine learning (which these robots use) and generative AI, which is what most of the recent hype has been about.

ML is by now mostly a proven technique with known limitations. E.g. being unable to deal correctly with situations not present in the training data. Generative AI is an offshoot of this, where people largely seem to like pretending those known limitations don't apply for vague reasons.

lewhoo · 2024-10-06T08:27:28 1728203248

What ? Stable diffusion doesn't have an underlying understanding that humans typically have two arms, two hands and five fingers per hand gathered from vast sea of training data ? That's a bold statement.

lifeisstillgood · 2024-10-06T09:17:13 1728206233

I think the issue is “understanding”

IIRR it’s a debate as to the difference between 99% of the time It predicts the next pixel will be fleshy and the pixel next to it is background this making something that looks fingery (and so when presented with An odd angle that 99% drops crazily” or that somehow there is a executive function that has evolved that gets a concept of finger with movement, musculature etc

It’s the “somehow evolved” part that is where I have my concerns.

Predictive ability based on billions images, sounds good. Executive function - how does that work? But at some point we are playing “what is consciousness” games.

Would love to hear more rigourous thought than mine - any links gratefully received:-)

lewhoo · 2024-10-06T09:35:19 1728207319

I actually agree with you. I was a bit sarcastic. If I understand correctly there isn't a fundamental difference when it comes to text output vs pixel data output in this context. If so then it suddenly sounds much more of a stretch (intuitively) to claim that somehow stable diffusion understands the real world (like people claim to be the case with language models).

kbrkbr · 2024-10-06T09:32:04 1728207124

> and five fingers per hand

In my experience it's more like three to six. But your argument's still valid. There is a concept