[ https://issues.apache.org/jira/browse/HIVE-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12804848#action_12804848 ]
Zheng Shao commented on HIVE-1100: ---------------------------------- bq. Is it a timeout based approach? When does the cleanup happen if the client never issues a clear/continue ? Actually we also have this problem today - if Hive failed before it cleans up the temporary data, but with this feature the problem will be much more obvious. We can add the date and time into "ExecutionId", so that an external process can come in and clean things up. bq. Do we need to enhance explain plan or some other tool so that the client knows which tasks will the query execute ? That's a good idea. 8. Add the "done" information for each task to explain plan so that users can know the status of the query. > Make it possible for users to retry map-reduce jobs in a single Hive query > -------------------------------------------------------------------------- > > Key: HIVE-1100 > URL: https://issues.apache.org/jira/browse/HIVE-1100 > Project: Hadoop Hive > Issue Type: New Feature > Affects Versions: 0.6.0 > Reporter: Zheng Shao > Assignee: Zheng Shao > > Sometimes a single hive query get compiled into several map-reduce jobs, and > one of the jobs failed because of some transient errors. > Users would need to start from scratch. > We should allow the user to start from the point of failure to continue the > query. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.