Just tested with a multilingual (bidi) English/Hebrew document. The Hebrew outpu...

nicodjimenez · 2025-03-06T18:28:52 1741285732

You can get bounding boxes from our pdf api at Mathpix.com

Disclaimer, I’m the founder

kergonath · 2025-03-06T19:44:35 1741290275

Mathpix is ace. That’s the best results I got so far for scientific papers and reports. It understands the layout of complex documents very well, it’s quite impressive. Equations are perfect, figures extraction works well.

There are a few annoying issues, but overall I am very happy with it.

nicodjimenez · 2025-03-06T21:22:36 1741296156

Thanks for the kind words. What are some of the annoying issues?

kergonath · 2025-03-08T13:44:15 1741441455

I had a billing issue at the beginning. It was resolved very nicely but I try to be careful and I monitor the bill a bit more than I would like.

Actually my main remaining technical issue is conversion to standard Markdown for use in a data processing pipeline that has issues with the Mathpix dialect. Ideally I’d do it on a computer that is airgaped for security reasons. But I haven’t found a very good way of doing it because the Python library wanted to check my API key.

A problem I have and that is not really Mathpix’s fault is that I don’t really know how to store the figures pictures to keep them with the text in a convenient way. I haven’t found a very satisfying strategy.

Anyway, keep up the good work!