What's the Minhash key groups value used for in the MinhashDriver? I mean, I see it is used for building up the key out of the hashed values, but what's the significance of different values for it? The default is 2, what does it mean practically speaking if I choose, say, 10? AFAICT, it would mean that I would have more clusters, assuming that we still meet the minimum cluster size imposed by the reducer?
Thanks, Grant
