Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Cool! Which OCR engine/model do you use?


EasyOCR, may switch to paddleOCR in the future.


You may want to try https://github.com/VikParuchuri/surya (I'm the author). I've only benchmarked against tesseract, but it outperforms it by a lot (benchmarks in repo). Happy to discuss.

You could also try https://github.com/VikParuchuri/marker for general PDF parsing (I'm also the author) - it seems like you're more focused on tables.


How does surya compare to AWS Textract? A previous employer went through a bunch of different OCRs and ended up using Textract because they found it to be the most accurate overall.


I unfortunately haven't had time to benchmark against more than tesseract.


That’s my experience as well. I am still looking for alternatives, but Textract is now the baseline.


Thanks for sharing.


PaddleOCR works pretty well, how are you planning to integrate it in your workflow? I found huge differences in throughput between python serving and frameworks (i.e. NVIDIA Triton Inference Server).


Grateful for your insight! Could you explain the reason for the switch? Is there any benchmark data available for sharing?


Performance depend on the language / type of docs. Main reason for contemplating switching is that easyOCR seems to not be maintained anymore (no commit in the repo in last 5 months)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: