Hi All,
I am trying to setup Nutch0.8 on a single machine with Hadoop for
testing. I followed the setup instructions in the NutchHadoop tutorial,
and the setup seemed to work initially after making a change in the
log4j.properties file. (I used to see an error from the logger saying
that "Either file or DatePattern options are not set". I just changed
the hadoop.log.file param to /my/nutch/dir/logs/hadoop.log - that fixed
the problem).
However, after running a couple of small crawls, I seem to be having a
lot of rpc related problems. I am seeing the "rpc timed out" message
often when I try to start a crawl using the crawl tool. After running
turning the Debug logging, I saw the following exceptions in the log
file multiple times:
2006-07-28 09:36:29,917 DEBUG ipc.RPC - Call: pollForTaskWithClosedJob 1
2006-07-28 09:36:36,460 DEBUG util.ThreadedServer - IGNORED
java.net.SocketTimeoutException: Accept timed out
at java.net.PlainSocketImpl.socketAccept(Native Method)
at java.net.PlainSocketImpl.accept(PlainSocketImpl.java:353)
at java.net.ServerSocket.implAccept(ServerSocket.java:448)
at java.net.ServerSocket.accept(ServerSocket.java:419)
at
org.mortbay.util.ThreadedServer.acceptSocket(ThreadedServer.java:432)
at
org.mortbay.util.ThreadedServer$Acceptor.run(ThreadedServer.java:631)
2006-07-28 09:36:36,488 INFO ipc.Server - Server connection on port
9000 from 10.50.30.230: starting. Number of active conne
ctions: 3
2006-07-28 09:36:36,490 INFO ipc.Client - Client connection to
10.50.30.230:9000: starting
2006-07-28 09:36:36,490 DEBUG ipc.Client - Client connection to
10.50.30.230:9000 sending #0
I compiled and am running Nutch/Hadoop using j2sdk1.4.2_05.
I'd appreciate any help I can get in getting the setup to work.
Thanks!
-vishal.
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general