Hacker Newsnew | past | comments | ask | show | jobs | submit | more superdocs1's commentslogin

I built Youtube Segment Clipper, a free Chrome extension to save parts of YouTube videos with timestamps and transcripts.

https://chromewebstore.google.com/detail/mkmcjgbighammmaoohb...


SEEKING WORK | Europe | Remote

I'm a freelance consultant specializing in document processing.

Currently building an app that extracts key information from PDFs and automatically highlights the source text for each extracted data point, allowing users to easily verify the accuracy of the extracted information.

I can help with:

  - LLMs, RAG, Structured Extraction
  - Data extraction from PDF documents
  - OCR projects
  - Table detection/recognition
  - Document classification
  - Document splitting
  - PDF workflow automation
Email: superdocsio@gmail.com

Web: https://superdocs.io


Building an app that extracts key information from PDFs + highlights citations. You provide a PDF and a JSON schema defining what to extract, and it returns the extracted values, the citations and their precise locations in the document.

This is especially valuable in workflows where verification of LLM extracted information is critical (e.g. legal and finance). It can handle complex layouts like multiple columns, tables and also scanned documents.

Planning to offer this both as an API and a self-hosted option for organizations with strict data privacy requirements.

Screenshot: https://superdocs.io/highlight.png


SEEKING WORK | Europe | Remote

I'm a freelance consultant specializing in document processing.

Currently building an app that extracts key information from PDFs and automatically highlights the source text for each extracted data point, allowing users to easily verify the accuracy of the extracted information.

I can help with:

  - LLMs, RAG, Structured Extraction
  - Data extraction from PDF documents
  - OCR projects
  - Table detection/recognition
  - Document classification
  - Document splitting
  - PDF workflow automation
Email: superdocsio@gmail.com

Web: https://superdocs.io


SEEKING WORK | Europe | Remote

I'm a freelance consultant specializing in document processing.

Currently building an app that extracts key information from PDFs and automatically highlights the source text for each extracted data point, allowing users to easily verify the accuracy of the extracted information.

I can help with:

  - LLMs, RAG, Structured Extraction
  - Data extraction from PDF documents
  - OCR projects
  - Table detection/recognition
  - Document classification
  - Document splitting
  - PDF workflow automation
Email: superdocsio@gmail.com

Web: https://superdocs.io


Building an app that extracts key information from PDFs + highlights citations.

You provide a PDF and a JSON schema defining what to extract, and it returns the extracted values, the citations and their precise locations in the document.

This is especially valuable in workflows where verification of LLM extracted information is critical (e.g. legal and finance). It can handle complex layouts like multiple columns, tables and also scanned documents.

Planning to offer this both as an API and a self-hosted option for organizations with strict data privacy requirements.

Screenshot: https://superdocs.io/highlight.png

Feel free to get in touch for a demo.


I've been working on some structured OCR tools recently (in the context of reading resume pdfs and allowing much more useful search filters over them than our ATS system allows) and I've found Gemini with structured outputs capable of doing a fantastic job. I'm curious, do you have any rough pointers for how to do this self-hosted?


SEEKING WORK | Europe | Remote

I'm a freelance consultant specializing in document processing.

Currently building an app that extracts key information from PDFs and automatically highlights the source text for each extracted data point, allowing users to easily verify the accuracy of the extracted information.

I can help with:

  - LLMs, RAG, Structured Extraction
  - Data extraction from PDF documents
  - OCR projects
  - Table detection/recognition
  - Document classification
  - Document splitting
  - PDF workflow automation
Email: superdocsio@gmail.com

Web: https://superdocs.io


Location: Europe

Remote: Yes

Willing to relocate: Open

Technologies: LLM, RAG, Python, Document Processing, OCR, Image Processing, NLP

Portfolio: https://superdocs.io

Email: superdocsio@gmail.com

Currently building an app that extracts key information from PDFs and automatically highlights the source text for each extracted data point, allowing users to easily verify the accuracy of the extracted information.

I can help with:

  - LLMs, RAG, Structured Extraction
  - Data extraction from PDF documents
  - OCR projects
  - Table detection/recognition
  - Document classification
  - Document splitting
  - PDF workflow automation
  - AI assistants


SEEKING WORK | Europe | Remote

I'm a freelance consultant specializing in document processing.

Currently building an app that extracts key information from PDFs and automatically highlights the source text for each extracted data point, allowing users to easily verify the accuracy of the extracted information.

I can help with:

  - LLMs, RAG, Structured Extraction
  - Data extraction from PDF documents
  - OCR projects
  - Table detection/recognition
  - Document classification
  - Document splitting
  - PDF workflow automation
Email: superdocsio@gmail.com

Web: https://superdocs.io


SEEKING WORK | Europe | Remote

I'm a freelance consultant specializing in document processing.

Currently building an app for data extraction from contracts.

I can help with:

  - LLMs
  - Data extraction from PDF documents
  - OCR projects
  - Table detection/recognition
  - Document classification
  - Document splitting
  - PDF workflow automation
Email: superdocsio@gmail.com

Web: https://superdocs.io


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: