Hacker Newsnew | past | comments | ask | show | jobs | submit | 2big2fail_47's commentslogin

I find it interesting that there's all these independent AI-OCR Projects but still no commercial offering. Is it still too inaccurate, too complex or simply too expensive?


I dont know, but maybe existing commercial OCR is still on top, and also using ML. Recently tried a free trial for OCR/reading Sütterlin and it was a weird feeling being so outclassed in reading.


Mistral offers their OCR commercially through their API and in their Chat services, at least.

https://mistral.ai/news/mistral-ocr




There are commercial OCR offerings from the big cloud providers (plus, like, Adobe). In my experience they generally outperform anything open-weights, although there's been a lot of improvement in VLMs in the past year or two.


One that I’ve seen recently is https://reducto.ai It appears to be an OCR wrapper.


It is because the AI is not actually doing OCR. It is giving an interpretation of what the text in an image is by ingesting vision tokens and mapping them onto text tokens.

So you either have to be fine with a lot of uncertainty as to the accuracy of that interpretation or you have to wait for an LLM that can do it in a completely reproducible way every time.


great analysis! thank you


for that case i'd just go with plexamp and self-host the music.


yeah it doesn’t work if you try it with https://youtube.com

i really like the idea though


Would investigate why it wasn't working and push out a fix. thanks for trying it out.


I also got that from the subtext


why do you have a record player with bluetooth in the first place. isn't that against the very concept of an analogue medium?


i think the nytimes landing page does a good job at looking and feeling like an analogue newspaper


Love NTS. Human-curated radio is still the best way to find new music :)


plexamp (in combination with tailscale) works very well :)


i use rsync from my itunes/music.app library to the smb synology drive that acts as my plexamp library. works very well :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: