Hi All, Can someone provide has any idea on my above question? Appreciate the help
On Thu, Apr 24, 2014 at 7:15 PM, krish ws <krisws.2...@gmail.com> wrote: > Hi, > I have a question related to hive table *bucketing* based on > multiple columns(*Clustered by* on a common set of columns). > > How would be the join performance if I am joining this table to itself > based on few columns that I have specified in *clustered by *condition(not > all)? > > Will the hashing differs based on few columns vs using all columns that I > specified in the *Clustered by* clause on a table? > > Regards > Krish >