I didn't account for names entities or n-grams in the feature vector though. That's a very interesting idea.
@mattdeboard - what algorithm did you use to count the occurrence and size of clusters?
I didn't account for names entities or n-grams in the feature vector though. That's a very interesting idea.
@mattdeboard - what algorithm did you use to count the occurrence and size of clusters?