This document should be helpful for you

https://wiki.apache.org/pig/PigSkewedJoinSpec



Best Regard,
Jeff Zhang





On 7/14/15, 4:56 AM, "Gagan Juneja" <gagandeepjun...@gmail.com> wrote:

>Hi Team,
>
>We are using Pig intensively in our various projects. We are doing
>optimizations for that we wanted to know how join works. Though we have
>moved to Skewed joins for some of our use cases.
>
>At many places in the documentation this is mentioned that in join data is
>streamed for second table. But I was identify how this can fit in map
>reduce paradigm.
>
>1. Can anyone please clarify how join happens in pig.
>2. What is the meaning of Streaming here? Are we loading the files
>directly
>in the reducres?
>
>
>Regards,
>Gagan

Reply via email to