I'm not sure why the comment is downvoted!
Let me see; the OP did not specifically try/ask for open-source solutions; at least, that is what I read.
Let me break it down!
As one of the commenters mentioned, he/she uses four different tools to parse PDFs to handle common parsing cases — tables, tables with images, OCR, layouts, handwriting, etc.
With LLMwhisperer, you don't need that.
Parsing is just a part of the problem. Engineers still need to figure out what LLM models work/are sufficient, reduce costs(tokens) and performance(parsing a million pages), and make the AI stack production-ready.
LLMWhisperer at least handles most use cases and moves out of your way fast.
Also, LLMwhisperer is not open-source; it's API is charged based on pages parsed.
https://unstract.com/llmwhisperer/
Try demo in playground: https://pg.llmwhisperer.unstract.com/
Quick tutorial: https://unstract.com/blog/extract-table-from-pdf/