Google's technology was already working years ago without machine learning. If you look at the object recognition/detection benchmarks, they are already human level, so cameras should be enough (with a good enough GPU). 5 years ago deep learning wasn't working. Algorithms are getting better faster than hardware getting cheaper.