Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Worse for the corpus of english text I tried on it; it doesn't seem to recognize punctuation at all, and it's marginally worse at I/1/l on sans-serif text (which, to be fair, trips up humans too).

Those were the only two relative deficiencies I noticed.

It does seem to beat tesseract on samples with mixed dark-on-light and light-on-dark text, but that was the only big win I saw in my brief look at it.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: