[ 
https://issues.apache.org/jira/browse/PIG-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12771693#action_12771693
 ] 

Sriranjan Manjunath commented on PIG-1048:
------------------------------------------

I have also modified a skewed join test case to check if atleast one key is 
present in more than 1 partition instead of checking for all the keys being 
present in multiple partitions. Since the dataset was too small sampler with 
the RLR change did not detect these small keys causing the unit test to fail.

> inner join using 'skewed' produces multiple rows for keys with single row in 
> both input relations
> -------------------------------------------------------------------------------------------------
>
>                 Key: PIG-1048
>                 URL: https://issues.apache.org/jira/browse/PIG-1048
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Thejas M Nair
>            Assignee: Sriranjan Manjunath
>         Attachments: pig_1048.patch
>
>
> ${code}
> grunt> cat students.txt                           
> asdfxc  M       23      12.44
> qwer    F       21      14.44
> uhsdf   M       34      12.11
> zxldf   M       21      12.56
> qwer    F       23      145.5
> oiue    M       54      23.33
>  l1 = load 'students.txt';            
> l2 = load 'students.txt';                  
> j = join l1 by $0, l2 by $0 ; 
> store j into 'tmp.txt'             
> grunt> cat tmp.txt
> oiue    M       54      23.33   oiue    M       54      23.33
> oiue    M       54      23.33   oiue    M       54      23.33
> qwer    F       21      14.44   qwer    F       21      14.44
> qwer    F       21      14.44   qwer    F       23      145.5
> qwer    F       23      145.5   qwer    F       21      14.44
> qwer    F       23      145.5   qwer    F       23      145.5
> uhsdf   M       34      12.11   uhsdf   M       34      12.11
> uhsdf   M       34      12.11   uhsdf   M       34      12.11
> zxldf   M       21      12.56   zxldf   M       21      12.56
> zxldf   M       21      12.56   zxldf   M       21      12.56
> asdfxc  M       23      12.44   asdfxc  M       23      12.44
> asdfxc  M       23      12.44   asdfxc  M       23      12.44$
> ${code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to