On Feb 1, 2012, at 5:04 PM, Aleksandr Elbakyan wrote: > Hello All, > > I am trying to understand how does pig group partitioning work, I was not > able to find any documentation regarding what happen under the hood. > > > For example > > B = GROUP A BY age; > > Does pig partition data by age? Or it will partition by something else?
It partitions by the group by key (age in this case). Similarly for joins and order by, it partitions by the join key and the sort key. > > > Other question: > If I want to create custom partitioner can I pass fields I want data be > partition by or it will be the same as group by key? No. Alan. > > > Regards, > Aleksandr >
