Hacker Newsnew | past | comments | ask | show | jobs | submit | jlink's commentslogin

I would be happy to have your feedback. You can use it for free, no account is required. According your needs, which feature is missing or useless? Does it make a better job than your existing tool? Do you even use a tool? Would you be willing to give a try in your team?


Make me think of this cluster of 200 PS3 that was used in 2009 by EPFL to solve the elliptic curve discrete logarithm problem: https://www.epfl.ch/labs/lacal/articles/112bit_prime/


Initially, I wanted to go way further and add 3D avatars dancing like the connected users. This would use the webcam + bodypix to map the user dance with its 3D avatar. However, all of this requires just too much computation on client side to have something useable. Anyway, any suggestions on that lighter version?


... I didn't know she was a celebrity. Initially, I intended to make a youtube video with a compilation of random people on chatroulette trying to solve an integral that I was showing on the webcam. I've never finished that project but now 10 years later I retrieved that video rush and it made me smile.


haha nice one!


It works also with empty cells in the middle.


I didn't know about Tabula and i've given a try at the instant. Apparently it only extracts tables and ignores everything around. This might be good in some cases but it is a problem if you want to extract a form, a whole textbook, your bank statements or anything. Also, I noticed that Tabula has some slight troubles when columns are not drawn in the table. But overall it is a good tool for extracting only tables, that's true.


could be a nice feature but not easy task. I'll give a try, though.


Please update us/me when you do. I'm also working on the same problem, would love to chat.


During the development I compared my results with the ones of pdftotext utility and i obtained more or less similar results. The objective of my code was to have an equivalent tool easily embeddable in any java/android project and to learn more about apache pdfbox.


I imagine it's not an easy task guessing about proportionally spaced fonts, overlapping bounding boxes, columns, tables, wrapping, and so forth.


yes, definitely not easy but fortunately pdfbox offers a solid base to start with.


Happy to know it could help you. Good cooking to you!


Both I and my accountant thank you haha.


glad to hear that!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: