Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
argonaut
on March 9, 2016
|
parent
|
context
|
favorite
| on:
AlphaGo beats the world champion Lee Sedol in firs...
It is. Value and policy networks are nonlinear approximators for value and policy functions.
You're making the mistake of assuming anything about how the human brain learns.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
You're making the mistake of assuming anything about how the human brain learns.