What's most interesting to me is that an army of contractors is required to get even the low accuracy levels of voice recognition we have today. The "AI" revolution is pure smoke and mirrors designed entirely to bilk investors out of their dollars. There's been no improvement in "AI" in the last 50 years, except that we have a lot more data to push through the same useless models.
100 bucks for anyone who uses a 50 year model/method and get within twice the word-error-rate of today's speech recognition systems.
Also, how do you explain the progress on ImageNet challenge over the last 10 years? Each method there uses the same dataset, yet error rates (top5) have gone from 30% in 2011 to around 3% in 2016.
Not going to deny that faster GPUs has helped a lot.
But it is not the full picture either. DenseNet-121 has higher accuracy than VGG16, yet requires only 1/15 the storage space, and 1/5 the number of operations during inference. I'm pretty sure training time is faster also.