Mathias Herberts created PIG-2983:
-------------------------------------

             Summary: Enable storing/loading an execution plan
                 Key: PIG-2983
                 URL: https://issues.apache.org/jira/browse/PIG-2983
             Project: Pig
          Issue Type: Wish
          Components: impl
            Reporter: Mathias Herberts


I have no idea at the moment if this is feasible or not, but here is the idea.

When running Pig in production it happens quite frequently that the building of 
the plan takes a fair amount of time compared to the actual execution of the 
job on the Hadoop cluster.

For jobs which are run periodically without modification of the underlying pig 
script it would be nice to be able to store the execution plan when it is first 
built, then load that persisted plan for all other executions.

A unique signature of the script could be computed, for example by removing 
comments and blank lines and computing a digest, and the computed plan stored 
under that name with a signature mechanism to ensure it's correct.

This would speed up lots of executions of Pig.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to