I've noticed some of these types of errors keeping the
fetcher from ever gracefully completing (or delaying
the completion for a while)

Looks like ftp:// should exclude robots.txt (not used)
and that the error should be trapped and gracefully
logged :)


040512 181158 STS FetchList is empty
040512 181210 java.net.ConnectException: Connection
timed out
040512 181210   
java.net.PlainSocketImpl.socketConnect(Native Method)
040512 181210   
java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:305)
040512 181210   
java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:171)
040512 181210   
java.net.PlainSocketImpl.connect(PlainSocketImpl.java:158)
040512 181210   
java.net.Socket.connect(Socket.java:452)
040512 181210   
java.net.Socket.connect(Socket.java:402)
040512 181210   
java.net.Socket.<init>(Socket.java:309)
040512 181210   
java.net.Socket.<init>(Socket.java:153)
040512 181210   
org.apache.commons.net.DefaultSocketFactory.createSocket(DefaultSocketFactory.java:66)
040512 181210   
org.apache.commons.net.SocketClient.connect(SocketClient.java:140)
040512 181210   
org.apache.commons.net.SocketClient.connect(SocketClient.java:230)
040512 181210   
net.nutch.net.protocols.ftp.FtpResponse.<init>(FtpResponse.java:173)
040512 181210   
net.nutch.net.protocols.ftp.Ftp.getRawResponse(Ftp.java:150)
040512 181210   
net.nutch.fetcher.FetcherThread.run(FetcherThread.java:139)
040512 181210 UNE ftp://ftp.dl.ac.uk/robots.txt
net.nutch.net.protocols.ftp.FtpException:
java.net.ConnectException: Connection timed out


-------------------------------------------------------
This SF.Net email is sponsored by: SourceForge.net Broadband
Sign-up now for SourceForge Broadband and get the fastest
6.0/768 connection for only $19.95/mo for the first 3 months!
http://ads.osdn.com/?ad_id=2562&alloc_id=6184&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to