[ 
https://issues.apache.org/jira/browse/PIG-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13270903#comment-13270903
 ] 

Julien Le Dem commented on PIG-2659:
------------------------------------

Hi Daniel,
Yes my original motivation was to use this in the visualizer.
Note that I made the location transient for now because some tests are 
comparing serialized physical plans to a saved binary versions, which is very 
brittle and not precise enough. I.E.: it fails whatever you changed to the 
physical plan serialized representation.

explanation of the data
M: A[1,4],A[3,4],B[2,4] C: A[3,4],B[2,4] R: A[3,4]
M: Mapper plan contains the following aliases
C: Combiner plan contains the following aliases
R: reduce plane contains the following aliases
each item is:
alias[line,offset]
This could also be split in 3 different properties:
pig.alias.location.mapper: A[1,4],A[3,4],B[2,4]
pig.alias.location.combiner: A[3,4],B[2,4]
pig.alias.location.reducer: A[3,4]

Yes, we should also make sure merged operators info is maintained. 

I will take care of the white spaces.

There is a test here:
test/org/apache/pig/newplan/logical/relational/TestLocationInPhysicalPlan.java


                
> add source location of the aliases in the physical plan
> -------------------------------------------------------
>
>                 Key: PIG-2659
>                 URL: https://issues.apache.org/jira/browse/PIG-2659
>             Project: Pig
>          Issue Type: Improvement
>          Components: impl
>            Reporter: Julien Le Dem
>            Assignee: Julien Le Dem
>         Attachments: PIG-2659.patch
>
>
> The goal is to provide better information about what is actually running in a 
> job.
> In particular when alias names are being reused.
> For example with the following script:
> {code}
> A = LOAD 'foo' using PigStorage();
> B = GROUP A BY $0;
> A = FOREACH B GENERATE COUNT(A);
> STORE A INTO 'bar';
> {code}
> The job conf will contain the following information
> {code}
> pig.alias.location: M: A[1,4],A[3,4],B[2,4] C: A[3,4],B[2,4] R: A[3,4]
> {code}
> A caveat is that the Logical Plan Optimizer throws away the original 
> information when merging Logical Operators.
> this is already the case today with pig.alias 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to