More

mandatory · on Aug 11, 2024

Yep, that's an example of what the automated scanning looks for. You can see a very similar example in the slides: https://media.defcon.org/DEF%20CON%2032/DEF%20CON%2032%20pre...

mandatory · on Aug 11, 2024

It's just because I did this talk and made FindThatMeme :) so not a popular method, just what I used to do large scale OCR.

krackers · on Aug 11, 2024

Oh I completely missed that you're actually the same guy!

mandatory · on Jan 11, 2023

Yes I definitely want to improve the search to be better. It is currently very text heavy and I (only recently) got image similarity indexing working. Hoping to leverage this to do something like you mentioned!

I'd also like to figure out how to turn an image into a description of whats in it. My ML/tensorflow knowledge is very weak though, so I still have a lot to learn here.

mandatory · on Jan 11, 2023

The image similarity search is probably a blog post of its own.

Short TL;DR: It runs off my home server running a large vector database (opendistro): https://opendistro.github.io/for-elasticsearch-docs/docs/knn...

mandatory · on Jan 11, 2023

Nope, you can use it totally offline. No way of getting banned as far as I'm aware.

mandatory · on Jan 11, 2023

Yep, this is exactly what I'm running on the raspberry pi LB. Nginx makes it super easy!

mandatory · on Jan 11, 2023

Author here: KnowYourMeme is one of many sites that memes are continually ingested from (any site that has memes I try to ingest regularly) :)

aemreunal · on Jan 11, 2023

Amazing work! Also, thank you for making that feed on the main page, been laughing for a while here :D

yojo · on Jan 11, 2023

Also lost 20 minutes doom scrolling that feed. Add an upvote button and some ML and you could destroy some lives.

mandatory · on Jan 11, 2023

Thanks! Comment made my night.

GistNoesis · on Jan 11, 2023

Nice IPhone cluster.

Have you tried something based on deep-learning that uses Transformers : https://github.com/roatienza/deep-text-recognition-benchmark (available weights are for tasks that seem similar to OCR so there is a good chance you can use it out of the box). With a good gpu it should process hundreds to thousands image per seconds, so you likely can build your index in less than a day. (Maybe you can even port it to your iphone stack :) )

https://github.com/microsoft/GenerativeImage2Text (You'll probably have to train on your custom dataset that you have constituted)

There are tons of other freely available solutions that you can get with a search for things with keywords like "image to text ocr" "transformers" "visual transformers"...

astrange · on Jan 12, 2023

You can do better than a general image-to-text model reading memes, because they all use the same fonts - so you want something trained off synthetic data made with that font.

generalizations · on Jan 11, 2023

Personally, I've been hunting for something that can extract both the text and the associated image. I've never seen anything that can do both.

taneq · on Jan 11, 2023

All hail the memelord!

spiffytech · on Jan 12, 2023

How do you ingest your social circle's in-group memes? Are they reliably posted to meme generator sites?

counttheforks · on Jan 11, 2023

What about copyright?

mandatory · on Oct 18, 2022

What kind of app? I'm planning on releasing a developer API soon so people can integrate it into their own bots/services.

mandatory · on Oct 16, 2022

Thanks!

mandatory · on Oct 16, 2022

Well I just added it :) so you should be able to find it now. If you find any others feel free to upload them!