LLM based OCR is a disaster, great potential for hallucinations and no estimate ...

utkarshphirke · 2025-03-13T09:32:24 1741858344

Absolutely right - we tried estimating LLM confidence and the results are not great. Any process that requires reliability will struggle with LLM OCR.

https://news.ycombinator.com/item?id=43350816

menaerus · 2025-03-06T18:55:31 1741287331

CNN-based OCR also have "hallucinations" and Transformers aren't that much different in that respect. This is a problem solved with domain specific post-processing.

leumon · 2025-03-06T19:25:28 1741289128

well already in 2013 ocr systems used in xerox scanners (turned on by default!) randomly altered numbers, so its not an issue only occuring in llms.