Hello all,
We were having some trouble with the tasktracker on one of our machines
while doing a fetch and had to restart the tasktracker. Is it possible
to restart the task that was being done? We were doing a rather large
fetch, and it was reducing when it errored (exception is below). Is it
possible to use the data that was already mapped and just restart the
reduce job? or are we going to have to re-do the entire fetch?
Exception:
060120 125418 task_r_24x406 copy failed: task_m_2hi2zg from 127.0.0.2:61640
java.io.IOException: timed out waiting for response
at org.apache.nutch.ipc.Client.call(Client.java:296)
at org.apache.nutch.ipc.RPC$Invoker.invoke(RPC.java:127)
at $Proxy2.getFile(Unknown Source)
at
org.apache.nutch.mapred.ReduceTaskRunner.prepare(ReduceTaskRunner.java:94)
at org.apache.nutch.mapred.TaskRunner.run(TaskRunner.java:62)
Thanks for any info.
-Matt Zytaruk
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general