[ 
https://issues.apache.org/jira/browse/PIG-4574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rohini Palaniswamy updated PIG-4574:
------------------------------------
    Attachment: PIG-4574-fix-testfailures.patch

Fixing unit test failures in 
https://builds.apache.org/job/Pig-trunk-commit/2232/testReport/

  - TestPigServerLocal.testSkipParseInRegisterForBatch
For LOAD + ORDER BY, since we repeat the LOAD separately for sampler and 
partitioner like in mapreduce, there is increase in the number of LoadFunc 
instantiations.
  - TestTezGraceParallelism.testJoinWithDifferentDepth
scope id change due to different plan for LOAD + ORDER BY


> Eliminate identity vertex for order by and skewed join right after LOAD
> -----------------------------------------------------------------------
>
>                 Key: PIG-4574
>                 URL: https://issues.apache.org/jira/browse/PIG-4574
>             Project: Pig
>          Issue Type: Sub-task
>          Components: tez
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.16.0
>
>         Attachments: PIG-4574-1.patch, PIG-4574-fix-testfailures.patch
>
>
> If ORDER BY or SKEWED JOIN is the operator immediately following LOAD+FOREACH 
> without any FILTER, then data should be read again from HDFS for the 
> partitioner vertex instead of writing to a identity vertex and reading from 
> it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to