improving the logging during shuffling
--------------------------------------
Key: HADOOP-3332
URL: https://issues.apache.org/jira/browse/HADOOP-3332
Project: Hadoop Core
Issue Type: Improvement
Components: mapred
Reporter: Runping Qi
Below is an excerpt from the log file of a reducer.
A same set of of messages about fetching schedule is logged every second.
Yet, the critical information --- which hosts were slow --- was not there.
2008-05-01 00:33:13,215 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0 Need another 3 map output(s) where 1 is
already in progress
2008-05-01 00:33:14,216 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0: Got 0 new map-outputs & 0 obsolete
map-outputs from tasktracker and 0 map-outputs from previous failures
2008-05-01 00:33:14,216 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0 Got 2 known map output location(s);
scheduling...
2008-05-01 00:33:14,216 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0 Scheduled 0 of 2 known outputs (2 slow hosts
and 0 dup hosts)
2008-05-01 00:33:14,216 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0 Need another 3 map output(s) where 1 is
already in progress
2008-05-01 00:33:15,217 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0: Got 0 new map-outputs & 0 obsolete
map-outputs from tasktracker and 0 map-outputs from previous failures
2008-05-01 00:33:15,217 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0 Got 2 known map output location(s);
scheduling...
2008-05-01 00:33:15,217 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0 Scheduled 0 of 2 known outputs (2 slow hosts
and 0 dup hosts)
2008-05-01 00:33:15,217 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0 Need another 3 map output(s) where 1 is
already in progress
2008-05-01 00:33:16,218 INFO org.apache.hadoop.mapred.ReduceTask:
task_200804302255_0002_r_000720_0: Got 0 new map-outputs & 0 obsolete
map-outputs from tasktracker and 0 map-outputs from previous failures
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.