Hacker News new | past | comments | ask | show | jobs | submit login

How does this compare to Tesseract?



Worse for the corpus of english text I tried on it; it doesn't seem to recognize punctuation at all, and it's marginally worse at I/1/l on sans-serif text (which, to be fair, trips up humans too).

Those were the only two relative deficiencies I noticed.

It does seem to beat tesseract on samples with mixed dark-on-light and light-on-dark text, but that was the only big win I saw in my brief look at it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: