[ 
https://issues.apache.org/jira/browse/PIG-781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12706083#action_12706083
 ] 

Olga Natkovich commented on PIG-781:
------------------------------------

Hi Gunther,

The output looks good - this is exactly what we want.

This would solve issues for adhoc queries; however, we also need to make sure 
that users can detect this programatically. This has two part to it.

(1) The return code they see when a program partially successful. We need to 
add a new return code to 
http://wiki.apache.org/pig/PigErrorHandlingFunctionalSpecification for this.
(2) A per output done file either on DFS or on the local file system to 
indicate success.

I think, for now, we should at least do (1). (2) requires more though to make 
sure we don't leave done files behind forever.

> Error reporting for failed MR jobs
> ----------------------------------
>
>                 Key: PIG-781
>                 URL: https://issues.apache.org/jira/browse/PIG-781
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Gunther Hagleitner
>         Attachments: partial_failure.patch
>
>
> If we have multiple MR jobs to run and some of them fail the behavior of the 
> system is to not stop on the first failure but to keep going. That way jobs 
> that do not depend on the failed job might still succeed.
> The question is to how best report this scenario to a user. How do we tell 
> which jobs failed and which didn't?
> One way could be to tie jobs to stores and report which store locations won't 
> have data and which ones do.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to