Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

We have LeelaZero numbers: https://ogs-forums.s3.amazonaws.com/original/2X/2/21de8caa52...

The plateau is real. I'm not sure if continuous self-play will lead to continuous progress for all of eternity. The system is clearly slowing down in self-learning.

AlphaZero's plateau is also well documented: https://i.imgur.com/NMNp6Kq.png

--------

I'm not trying to cast doubt upon reinforcement learning / MCTS / Neural Nets in the game of Go. It is clearly the best methodology we got today.

But anyone who has any experience with neural nets knows about the local-maxima problem. ALL neural nets reach a local maxima eventually. Once this point is reached, you have to rely upon other methodologies to improve playing strength.

Assuming Elo-growth for all time using a singular methodology is naive. We will go very, very far with Deep Learning, but are you satisfied with that limit? Other open questions remain: Go is very far away from being a solved game, even with a magical machine that plays 2000+ Elo stronger than humans.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: