Data scientist here. I'm surprised nobody here has yet mentioned the false-positive issue with trawling a large dataset: even a tiny false positive rate leads to a very high probability of convicting an innocent person if you mine a large enough database.