Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It is. Value and policy networks are nonlinear approximators for value and policy functions.

You're making the mistake of assuming anything about how the human brain learns.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: