I wonder how efficiently it can do that compared to other systems. For example a...

3v3rt · on Aug 21, 2015

I think one of the caveats was that the function should be relatively smooth and continuous, a mandelbrot set isn't very smooth is it?

fnbr · on Aug 21, 2015

Smoothness isn't necessary to prove the result, just continuity.

You do need smoothness to prove a bound on the rate of convergence of the basis representation, and given that the boundary of the Mandelbrot doesn't have a closed form representation (as far as I'm aware), I think that the convergence of the neural network representation would be extremely slow.

Houshalter · on Aug 21, 2015

The advantage of neural networks is that they can be trained. You can give it a set of inputs and desired outputs, and do gradient descent on it.

Neural networks are essentially like trainable digital circuits. The proof of universality shows that neurons can approximate any kind of logic gate (or any input output mapping, like a lookup table.) A lookup table by itself isn't terribly useful, but you can put them in a series and make arbitrary circuits. And that makes (deep) neural networks strictly more powerful than "shallow" methods.

Retric · on Aug 21, 2015

A very important caveat is the ability to be trained + universality does not mean they can be trained to fit any function to arbitrary precision in finite time.

Houshalter · on Aug 21, 2015

Well of course not. If you could fit any neural network to any function quickly, you would have super powers. But in practice, local optima do seem to stop being an issue in big neural networks.

Also this article shows a method of how to construct a lookup table from a neural network in linear time. So in the worst case you can just memorize the input to output table, quickly. In the best case you can fit a simple elegant model, which fits the data perfectly with very few parameters. Given unlimited amounts of time to search the parameter space. Real world NNs are somewhere between these two.

fnbr · on Aug 21, 2015

I think you're correct.

That's one of the issues with results from theoretical math- the result is true, but it might not be useful.

As another commenter in the thread talks about, we can get a nearly identical result by using polynomials of extremely large degree, but that doesn't work well, because of overfitting. We could come up with a polynomial that also calculates with arbitrarily high accuracy if a point is in the Mandelbrot set or not, but it might end up being of 10100 degree, which is useless.

reagency · on Aug 22, 2015

My gut sense is that 10100 degreed wouldn't even be enough to render a pixel accurate mandlebrot set on your phone's screen. So many twisty edges.