[ 
https://issues.apache.org/jira/browse/TEZ-1062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13978783#comment-13978783
 ] 

Hitesh Shah commented on TEZ-1062:
----------------------------------

bq. We should first go through all outputs to check if they are MROutput and if 
they need commit.

This is wrong IMO. Checking for specific classes should not be done. Either 
assume all outputs need a commit or query the output to check if it requires a 
commit. I believe currently pretty much all current outputs ( intermediate data 
outputs too ) that write to disk need a commit as this is a task level commit 
to handle cases such as speculative attempts. 

> Create SimpleProcessor for processors that only need to implement the run 
> method
> --------------------------------------------------------------------------------
>
>                 Key: TEZ-1062
>                 URL: https://issues.apache.org/jira/browse/TEZ-1062
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Bikas Saha
>            Assignee: Mohammad Kamrul Islam
>         Attachments: TEZ-1062.1.patch
>
>
> The SimpleProcessor could take care of all things like starting input, 
> committing outputs. It would handle no events, since simple processors dont 
> need to handle inputs. Thus the user would only need to implement their 
> custom task logic in a new execute() method.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to