Though coming soon: Neural networks to determine whether speech is NN-generated? :P
I guess this would be an ideal use case for a generative adversarial network based approach.
Though coming soon: Neural networks to determine whether speech is NN-generated? :P