I built a CLI tool for experimenting with Mistral OCR here: https://simonwilliso...

brianjking · 2025-03-08T15:07:04 1741446424

Knowing that you have to do that as a separate step adds a whole additional level of complexity too.

For example, if some content has the images and some don't, you need to add whole additional steps to your processing and potentially add hallucinations in.

What are you using for document extraction lately, Simon?

simonw · 2025-03-08T21:12:02 1741468322

I'm really impressed with Gemini - Gemini 2.0 Pro Exp seems remarkably good at even really complex scrappy documents.

bilater · 2025-03-08T16:02:24 1741449744

Agreed - I am surprised they did are not using Pixtral to read images as well.