Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Knowing that you have to do that as a separate step adds a whole additional level of complexity too.

For example, if some content has the images and some don't, you need to add whole additional steps to your processing and potentially add hallucinations in.

What are you using for document extraction lately, Simon?



I'm really impressed with Gemini - Gemini 2.0 Pro Exp seems remarkably good at even really complex scrappy documents.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: