This document should be helpful for you https://wiki.apache.org/pig/PigSkewedJoinSpec
Best Regard, Jeff Zhang On 7/14/15, 4:56 AM, "Gagan Juneja" <gagandeepjun...@gmail.com> wrote: >Hi Team, > >We are using Pig intensively in our various projects. We are doing >optimizations for that we wanted to know how join works. Though we have >moved to Skewed joins for some of our use cases. > >At many places in the documentation this is mentioned that in join data is >streamed for second table. But I was identify how this can fit in map >reduce paradigm. > >1. Can anyone please clarify how join happens in pig. >2. What is the meaning of Streaming here? Are we loading the files >directly >in the reducres? > > >Regards, >Gagan