Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is incredibly exciting. I've been pondering/experimenting on a hobby project that makes reading papers and textbooks easier and more effective. Unfortunately the OCR and figure extraction technology just wasn't there yet. This is a game changer.

Specifically, this allows you to associate figure references with the actual figure, which would allow me to build a UI that solves the annoying problem of looking for a referenced figure on another page, which breaks up the flow of reading.

It also allows a clean conversion to HTML, so you can add cool functionality like clicking on unfamiliar words for definitions, or inserting LLM generated checkpoint questions to verify understanding. I would like to see if I can automatically integrate Andy Matuschak's Orbit[0] SRS into any PDF.

Lots of potential here.

[0] https://docs.withorbit.com/



>a UI that solves the annoying problem of looking for a referenced figure on another page, which breaks up the flow of reading.

A tangent but this exact issue is what I was frustrated for a long time with pdf reader and reading science papers. Then I found sioyek that pops up a small window when you hover over links (references and equations and figures) and it solved it.

Granted, the pdf file must be in right format, so OCR could make this experience better. Just saying the UI component of that already exist

https://sioyek.info/


Zotero's PDF viewer also does this now. Being able to annotate PDFs and having a reference manager has been a life saver.


Thanks for the link! Good to know someone is working on something similar.


Wait does this deal with images?


The output includes images from the input. You can see that on one of the examples where a logo is cropped out of the source and included in the result.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: