Infinite loop in CombineFileInputFormat#getMoreSplits(), with missing blocks
----------------------------------------------------------------------------
Key: MAPREDUCE-2862
URL: https://issues.apache.org/jira/browse/MAPREDUCE-2862
Project: Hadoop Map/Reduce
Issue Type: Bug
Reporter: Kazuki Ohta
Hi, we met the infinite loop on CombineFileInputFormat#getMoreSplits().
At first, we lost some blocks by mis-operation :-(. Then, one job tried to use
these missing blocks. At that time getMoreSplits() goes into the infinite loop.
>From our investigation, this List could be an empty array.
> https://github.com/apache/hadoop-mapreduce/blob/trunk/src/java/org/apache/hadoop/mapreduce/lib/input/CombineFileInputFormat.java#L363
Then 'for' loop just after that line does nothing, and entry is not removed
from 'blockToNodes'.
Finally this line goes into the infinite loop.
> https://github.com/apache/hadoop-mapreduce/blob/trunk/src/java/org/apache/hadoop/mapreduce/lib/input/CombineFileInputFormat.java#L348
We're now creating the patch against this problem...
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira