[ 
https://issues.apache.org/jira/browse/PIG-2587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bill Graham resolved PIG-2587.
------------------------------

       Resolution: Fixed
    Fix Version/s:     (was: 0.10.0)

Committed.
                
> Compute LogicalPlan signature and store in job conf
> ---------------------------------------------------
>
>                 Key: PIG-2587
>                 URL: https://issues.apache.org/jira/browse/PIG-2587
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Bill Graham
>            Assignee: Bill Graham
>             Fix For: 0.11
>
>         Attachments: pig-2587_1.patch
>
>
> We'd like to be able to uniquely identify a re-executed script (possibly with 
> different inputs/outputs) by creating a signature of the {{LogicalPlan}}. 
> Here's the proposal:
> # Add a new method {{LogicalPlan.getSignature()}} that returns a hash of its 
> {{LogicalPlanPrinter}} output.
> # In {{PigServer.execute()}} set the signature on the job conf after the LP 
> is compiled, but before it's executed.
> (1) would allow an impl of 
> {{PigProgressNotificationListener.setScriptPlan()}} to save the LP signature 
> with the script metadata. Upon subsequent runs (2) would allow an impl of 
> {{PigReducerEstimator}} (see PIG-2574) to retrieve the current LP signature 
> and fetch the historical data for the script. It could then use the previous 
> run data to better estimate the number of reducers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to