[ 
https://issues.apache.org/jira/browse/PIG-4202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14151964#comment-14151964
 ] 

Bikas Saha commented on PIG-4202:
---------------------------------

To be clear, none of my above comment was for processors. It was for the 
OutputCommitter API in Tez which runs in the AM per vertex per output. The 
lifecycle is 
committer.initialize()->committer->setupOutputs()->committer->commitOutput(). 
In the absence of side effects commitOutput() is the equivalent of close(). We 
can add a new committer.close() which would be called immediately after 
committer.commitOutput(). So its value is not clear. Does PigOutputCommitter 
derive from OutputCommitter in Tez?

Questions to understand this better. Are there any issues in cleaning up within 
commitOutput() after commit operations have completed? Do multiple committers 
in a vertex (vertex with multiple HDFS outputs) or multiple outputs in 
different vertices work today? If yes, then why are the statics not causing a 
problem?

> UDFContext.isFrontend return true for OutputCommitter in local mode
> -------------------------------------------------------------------
>
>                 Key: PIG-4202
>                 URL: https://issues.apache.org/jira/browse/PIG-4202
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>             Fix For: 0.14.0
>
>         Attachments: PIG-4202-1.patch, PIG-4202-2.patch
>
>
> We switch to use UDFContext.isFrontend in PIG-4187. However, it does not work 
> as expected in local mode. This causes some local mode test fail 
> (TestOrcStoragePushdown.createInputData).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to