> With a little more effort we can establish the same for any enumerable set of ...

readyplayernull · on March 13, 2024

> Failure as negation does have its place but it also is what will probably block strong-AI.

Interesting, could you please explain more on this?

nyrikki · on March 13, 2024

The deduction principle turns out to be invalid for weak rejections and strong AI requires it.

Consider strong negation:

Did Homer write the Iliad? No, he did not.

VS a too computationally expensive system with true,false,other with no knowledge of Homer:

Did Homer write the Iliad? No, Homer did not exist.

Vs negation as failure of binary threshold ANNs:

Did Homer write the Iliad? No.

Transformers explicitly can find known unknowns, unknowable unknowns (eg future unknowns), etc..

But exhaustive unknowns may or may not be valid.

There is a lot more to it, but strong AI requires universal quantification, ML is existential quantification.

The above is just one way to think about why Word sense disambiguation and ATP are though to be AI-complete.

yodsanklai · on March 13, 2024

How relevant are these considerations for practical use cases?

Specifically, for all practical purpose, it is sufficient to have probabilistic guarantees. Suppose an AI is able to generate mathematical proofs as well as humans, it wouldn't really matter if in theory "they are limited to solving problems in TC^0 with failure as negation"

nyrikki · on March 13, 2024

While TC^0 can divide or multiply through say FFT, it actually can't do counters or addition which I am pretty sure requires TC^1 or Log-depth Threshold Circuits.

LTSM is much more powerful.

drdeca · on March 13, 2024

> Same with attention in feed forward networks requiring exponential time with a reduction in expressability.

Huh? Can you say more about that? What takes exponential time with transformers?

nyrikki · on March 13, 2024

> We prove that the time complexity of self-attention is necessarily quadratic in the input length, unless the Strong Exponential Time Hypothesis (SETH) is false. This argument holds even if the attention computation is performed only approximately, and for a variety of attention mechanisms.

https://proceedings.mlr.press/v201/duman-keles23a/duman-kele...

drdeca · on March 14, 2024

Thanks! But, quadratic time complexity (which I had heard that attention requires, which doesn't surprise me) is not exponential time.

I acknowledge that they related this to the assumption of the SETH (which surprises me a little). But, this doesn't mean that transformers take exponential time. I don't think I understood what you meant by the "with a reduction in expressability" part of the statement, so maybe that is the reason behind me not following.