It is. Value and policy networks are nonlinear approximators for value and polic...

		argonaut on March 9, 2016 \| parent \| context \| favorite \| on: AlphaGo beats the world champion Lee Sedol in firs... It is. Value and policy networks are nonlinear approximators for value and policy functions. You're making the mistake of assuming anything about how the human brain learns.