Greetings list,
I am trying to debug why my fetch process is dying on the reduce side -
I see a single reduce task out of 16 dying with the following message:
Timed out.java.io.IOException: Task process exit with nonzero status. at
org.apache.hadoop.mapred.TaskRunner.runChild
Which is
Shawn Gervais wrote:
When I have been at the terminal to observe the timed out process before
it is reaped, I have seen that it continues to use 100% of a single
processor. strace of the java process did not produce any usable leads.
When the reduce task is reassigned, either to the same