I am 100% certain that the training of such an AI will result in winning a game ...

lordnacho · on Feb 16, 2024

I knew it, I knew it! It would be a Spiffing Brit video.

That guy is a genius at finding exploits in computer games. I don't know how he does it, I think you need to play a fair bit of each game before you find these little corners of the ruleset.

boppo1 · on Feb 17, 2024

Idk maybe he uses some sort of fuzzer

jdietrich · on Feb 16, 2024

If you train the model purely based on win rate, sure. Fortunately, we can efficiently use RLHF to train a model to play in a human-like way and give entertaining matches.

Aerroon · on Feb 16, 2024

But wouldn't this be amazing for the developer to fix a lot of edge cases/bugs?

Baeocystin · on Feb 16, 2024

Maybe, maybe not. The stochastic, black-box nature of the current wave of ML systems gives me a gut feeling that using them like this is more of a Monkey's Paw wish granter than useful tool without a lot of refinement first. Time will tell!