Hi list,

I downloaded the nightly build of nutch+hadoop, and have been trying to get it working on a small cluster of machines.

I have it working properly on a single machine, however when I try to have my map and reduce tasks run on the cluster slaves, I get the following exception:

060405 000219 SEVERE Can't open map output:/home2/nutch/filesystem/mapreduce/local/part-2.out/task_m_1v749p java.io.FileNotFoundException: /home2/nutch/filesystem/mapreduce/local/part-2.out/task_m_1v749p at org.apache.hadoop.fs.LocalFileSystem.openRaw(LocalFileSystem.java:114)
        (rest of stack trace snipped)

Oddly enough, this map task ran on the same machine which produced the above error message. This it the output from the map task on the same machine:

060405 000210 task_m_1v749p  Child starting
060405 000211 Server connection on port 50050 from 127.0.0.1: starting
060405 000211 task_m_1v749p  Client connection to 0.0.0.0:50050: starting
060405 000211 task_m_1v749p  Client connection to 10.10.0.3:9000: starting
060405 000211 task_m_1v749p Using URL normalizer: org.apache.nutch.net.BasicUrlNormalizer
060405 000211 Server connection on port 50050 from 127.0.0.1: starting
060405 000211 task_m_1v749p  Client connection to 0.0.0.0:50050: starting
060405 000211 task_m_1v749p 1.0% /user/nutch/urls/urllist.txt:2+2
060405 000211 Task task_m_1v749p is done.
        (parsing lines snipped for brevity)


All map tasks finish with the output above, however none of my reduce tasks are finishing. The problem exists when the map task, and the corresponding reduce task which depends on the map's output, are run on the same machine or different machines. In both cases I see an IPC timeout Exception being thrown, 1 minute (60,000 ms, as specified in the hadoop-default.xml file) after the above "FileNotFound" exception is generated.

Does anyone have any pointers as to where I should look to determine the reason the map output is not being generated, or is not able to be accessed?

Regards,
-Shawn


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to