What's the Minhash key groups value used for in the MinhashDriver?  I mean, I 
see it is used for building up the key out of the hashed values, but what's the 
significance of different values for it?  The default is 2, what does it mean 
practically speaking if I choose, say, 10?  AFAICT, it would mean that I would 
have more clusters, assuming that we still meet the minimum cluster size 
imposed by the reducer?

Thanks,
Grant

Reply via email to