Hacker Newsnew | past | comments | ask | show | jobs | submit | mandatory's commentslogin

Yep, that's an example of what the automated scanning looks for. You can see a very similar example in the slides: https://media.defcon.org/DEF%20CON%2032/DEF%20CON%2032%20pre...


It's just because I did this talk and made FindThatMeme :) so not a popular method, just what I used to do large scale OCR.


Oh I completely missed that you're actually the same guy!


Yes I definitely want to improve the search to be better. It is currently very text heavy and I (only recently) got image similarity indexing working. Hoping to leverage this to do something like you mentioned!

I'd also like to figure out how to turn an image into a description of whats in it. My ML/tensorflow knowledge is very weak though, so I still have a lot to learn here.


The image similarity search is probably a blog post of its own.

Short TL;DR: It runs off my home server running a large vector database (opendistro): https://opendistro.github.io/for-elasticsearch-docs/docs/knn...


Nope, you can use it totally offline. No way of getting banned as far as I'm aware.


Yep, this is exactly what I'm running on the raspberry pi LB. Nginx makes it super easy!


Author here: KnowYourMeme is one of many sites that memes are continually ingested from (any site that has memes I try to ingest regularly) :)


Amazing work! Also, thank you for making that feed on the main page, been laughing for a while here :D


Also lost 20 minutes doom scrolling that feed. Add an upvote button and some ML and you could destroy some lives.


Thanks! Comment made my night.


Nice IPhone cluster.

Have you tried something based on deep-learning that uses Transformers : https://github.com/roatienza/deep-text-recognition-benchmark (available weights are for tasks that seem similar to OCR so there is a good chance you can use it out of the box). With a good gpu it should process hundreds to thousands image per seconds, so you likely can build your index in less than a day. (Maybe you can even port it to your iphone stack :) )

https://github.com/microsoft/GenerativeImage2Text (You'll probably have to train on your custom dataset that you have constituted)

There are tons of other freely available solutions that you can get with a search for things with keywords like "image to text ocr" "transformers" "visual transformers"...


You can do better than a general image-to-text model reading memes, because they all use the same fonts - so you want something trained off synthetic data made with that font.


Personally, I've been hunting for something that can extract both the text and the associated image. I've never seen anything that can do both.


All hail the memelord!


How do you ingest your social circle's in-group memes? Are they reliably posted to meme generator sites?


What about copyright?


What kind of app? I'm planning on releasing a developer API soon so people can integrate it into their own bots/services.


Thanks!


Well I just added it :) so you should be able to find it now. If you find any others feel free to upload them!


Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: