Knowing that you have to do that as a separate step adds a whole additional leve...

		brianjking 4 months ago \| parent \| context \| favorite \| on: Auntie PDF – an open source app built using Mistra... Knowing that you have to do that as a separate step adds a whole additional level of complexity too. For example, if some content has the images and some don't, you need to add whole additional steps to your processing and potentially add hallucinations in. What are you using for document extraction lately, Simon?

I'm really impressed with Gemini - Gemini 2.0 Pro Exp seems remarkably good at even really complex scrappy documents.