Hello all,

We were having some trouble with the tasktracker on one of our machines while doing a fetch and had to restart the tasktracker. Is it possible to restart the task that was being done? We were doing a rather large fetch, and it was reducing when it errored (exception is below). Is it possible to use the data that was already mapped and just restart the reduce job? or are we going to have to re-do the entire fetch?

Exception:
060120 125418 task_r_24x406 copy failed: task_m_2hi2zg from 127.0.0.2:61640
java.io.IOException: timed out waiting for response
       at org.apache.nutch.ipc.Client.call(Client.java:296)
       at org.apache.nutch.ipc.RPC$Invoker.invoke(RPC.java:127)
       at $Proxy2.getFile(Unknown Source)
at org.apache.nutch.mapred.ReduceTaskRunner.prepare(ReduceTaskRunner.java:94)
       at org.apache.nutch.mapred.TaskRunner.run(TaskRunner.java:62)

Thanks for any info.

-Matt Zytaruk


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to