Mathias Herberts created PIG-2983:
-------------------------------------
Summary: Enable storing/loading an execution plan
Key: PIG-2983
URL: https://issues.apache.org/jira/browse/PIG-2983
Project: Pig
Issue Type: Wish
Components: impl
Reporter: Mathias Herberts
I have no idea at the moment if this is feasible or not, but here is the idea.
When running Pig in production it happens quite frequently that the building of
the plan takes a fair amount of time compared to the actual execution of the
job on the Hadoop cluster.
For jobs which are run periodically without modification of the underlying pig
script it would be nice to be able to store the execution plan when it is first
built, then load that persisted plan for all other executions.
A unique signature of the script could be computed, for example by removing
comments and blank lines and computing a digest, and the computed plan stored
under that name with a signature mechanism to ensure it's correct.
This would speed up lots of executions of Pig.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira