[jira] Commented: (HIVE-1100) Make it possible for users to retry map-reduce jobs in a single Hive query

Zheng Shao (JIRA) Mon, 25 Jan 2010 18:00:57 -0800

    [ 
https://issues.apache.org/jira/browse/HIVE-1100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12804848#action_12804848
 ]


Zheng Shao commented on HIVE-1100:
----------------------------------

bq. Is it a timeout based approach? When does the cleanup happen if the client 
never issues a clear/continue ?

Actually we also have this problem today - if Hive failed before it cleans up 
the temporary data, but with this feature the problem will be much more obvious.
We can add the date and time into "ExecutionId", so that an external process 
can come in and clean things up.

bq. Do we need to enhance explain plan or some other tool so that the client 
knows which tasks will the query execute ?

That's a good idea. 

8. Add the "done" information for each task to explain plan so that users can 
know the status of the query.


> Make it possible for users to retry map-reduce jobs in a single Hive query
> --------------------------------------------------------------------------
>
>                 Key: HIVE-1100
>                 URL: https://issues.apache.org/jira/browse/HIVE-1100
>             Project: Hadoop Hive
>          Issue Type: New Feature
>    Affects Versions: 0.6.0
>            Reporter: Zheng Shao
>            Assignee: Zheng Shao
>
> Sometimes a single hive query get compiled into several map-reduce jobs, and 
> one of the jobs failed because of some transient errors.
> Users would need to start from scratch.
> We should allow the user to start from the point of failure to continue the 
> query.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-1100) Make it possible for users to retry map-reduce jobs in a single Hive query

Reply via email to