Plus, trivially parallelizable across many drones, and it's not even using normal RL techniques to accelerate learning. Training a drone controller is more or less a solved problem, the interest of this is whether all the crash datapoints are useful.