[ 
https://issues.apache.org/jira/browse/ACCUMULO-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13270303#comment-13270303
 ] 

Scott Kuehn commented on ACCUMULO-488:
--------------------------------------

The default timeout is 10 minutes, so the iterators in the InputFormat's 
RecordReaders would have to exceed this amount when returning a single record 
(progress is reported implicitly when a mapper receives a record). I suppose 
AccumuloRowInputFormat is more of a risk if a row has lots of cells, but 10 min 
still seems like plenty of time.

Since waiting more than 10 minutes for a single record is somewhat abnormal, 
users that anticipate this behavior could set the timeout threshold by 
adjusting the 'mapreduce.task.timeout' variable in their Job.  Set it to 0 and 
the task won't timeout.

Is there a scenario that I'm missing?  If not, I think this ticket could be 
closed out -- perhaps just mentioning the timeout var in the javadocs.
                
> InputFormats' RecordReaders should call Context.progress
> --------------------------------------------------------
>
>                 Key: ACCUMULO-488
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-488
>             Project: Accumulo
>          Issue Type: Bug
>          Components: client
>    Affects Versions: 1.4.0, 1.3.5
>            Reporter: John Vines
>              Labels: newbie
>             Fix For: 1.4.1
>
>
> The RecordReader for both input formats never call Context.progress(). This 
> can leave long-running tasks timing out while still making progress.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to