Hacker News new | past | comments | ask | show | jobs | submit login

You use bloom filtering to massively reduce the number of possible matching messages, then you do a regular text search on the remaining set.



But how do you actually index text with a single bloom filter?


You don't.

You put mail into buckets (using some hashes), and hands out the bloom filter for each bucket.

It is better than downloading all mail, but it is nowhere near the speed I want my email search to be.


And there is a bucket for each word? That seems like a lot of bloom filters




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: