Hi,
I have a field grouping based on 2 fields. I have 32 consumers for the
tuple and I see most of the times, out of 64 bolts, the field group is
always on 8 of them. Of the 8, 2 have more than 60% of the data. The data
for the field grouping can have 20 different combinations.

Do you know what is the way to compute the Hash of the fields used for
computing? One of the groups mails indicate that the approach is -

*It calls "hashCode" on the list of selected values and mods it by the *
*number of consumer tasks. You can play around with that function to see
if *
*something about your data is causing something degenerative to happen and *
*cause skew*

I saw the clojure code but not sure how to understand this.

Thanks
Kashyap

Reply via email to