Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Semhash: Fast deduplication and dataset multitool in Python (minishlab.github.io)
3 points by stephantul 12 months ago | hide | past | favorite | 1 comment


Hello,

today we released Semhash! Here's a blogpost on how it works. We did a show HN yesterday, but that got deleted for some reason.

Let me know what you think!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: