Progress reported for pipes tasks is incorrect.
-----------------------------------------------
Key: MAPREDUCE-1073
URL: https://issues.apache.org/jira/browse/MAPREDUCE-1073
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: pipes
Reporter: Sreekanth Ramakrishnan
Currently in pipes,
{{org.apache.hadoop.mapred.pipes.PipesMapRunner.run(RecordReader<K1, V1>,
OutputCollector<K2, V2>, Reporter)}} we do the following:
{code}
while (input.next(key, value)) {
downlink.mapItem(key, value);
if(skipping) {
downlink.flush();
}
}
{code}
This would result in consumption of all the records for current task and taking
task progress to 100% whereas the actual pipes application would be trailing
behind.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.