I would like to see how it performs with massively warped and skewed scanned tex...

thegabriele · 2025-03-07T11:21:05 1741346465

I use gemini to solve textual CAPTCHAS with those kind of distortions and more: 60% of the time it works every time.

amelius · 2025-03-06T23:05:25 1741302325

Are you trying to build a captcha solver?

SilentM68 · 2025-03-06T23:15:23 1741302923

No, not a captcha solver. When I worked in education, I was given a 90s paper document that a teacher needed OCRd but it was completely warped. It was my job to remediate those type of documents for Accessibility reasons. I had to scan and OCR it but the result was garbage. Mind you I had access to Windows, Linux and MacOS tools but still difficult to do. I had to guess what it said, which was not impossible but it was time-consuming, not doable in the time-frame I was given, so I had no option but to manually retype all the information into a new document and convert it that way. Document remediation and accessibility should be a good use case for A.I., in education.

arcfour · 2025-03-06T18:52:16 1741287136

Garbage in, garbage out?

edude03 · 2025-03-06T18:55:55 1741287355

"Yes" but if a human could do it "AI" should be able to do it too.