You might say, if you can identify and simulate all cases of real life degradation, your problem is basically solved, just reverse the simulation on your inputs.
I’m not saying ocr isn’t hard. I’m saying normalizing all those characters basically is the problem.
I’m not saying ocr isn’t hard. I’m saying normalizing all those characters basically is the problem.