AlphaGo and AlphaStar both started out based on human training and then played a...

rglover · on Feb 29, 2024

Yeah, but they had a limited set of rules to work within (they were just hyper-efficient at calculating the possible outcomes relative to those rules). Humans, in theory, only have the rules they believe as there technically are no rules (it's all make-believe). For example, what was the "rule" that told people to make a wheel? There wasn't one. The human had to think about it/conceive it, which AI can't (and I'd argue never will be able to) without rules.

imtringued · on March 1, 2024

Reinforcement learning is a completely different strategy compared to how most LLMs work.