I can't speak for voice data, as I've not worked with voice, but I did my MSc on...

jfoutz · on Feb 28, 2019

You might say, if you can identify and simulate all cases of real life degradation, your problem is basically solved, just reverse the simulation on your inputs.

I’m not saying ocr isn’t hard. I’m saying normalizing all those characters basically is the problem.

dbdjfjrjvebd · on March 1, 2019

This isn't quite true if e.g. there are degenerate cases.