This socket exception normally comes , if fetcher is not able to get the page to crawl?? I mean there is some problem with the server connection. if you r crawling for local stored pages, then check whether the server is started or not??
I have tested the same for my local crawl, but for internet specific crawl I don't have enough idea?? Ratnesh V2Solutions India cha wrote: > > HI ppl, > > when i crawl my website , it is giving me following error , though > crawling is doing fine. > > Can anyone tell me what the error is about?? Do i have to set anything in > nutch-site.xml?? > > Following are the error logs: > > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? java.net.SocketTimeoutException: > Read timed out > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.SocketInputStream.socketRead0(Native Method) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.SocketInputStream.read(Unknown Source) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.BufferedInputStream.read1(Unknown Source) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.BufferedInputStream.read(Unknown Source) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.FilterInputStream.read(Unknown Source) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.PushbackInputStream.read(Unknown Source) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.FilterInputStream.read(Unknown Source) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.HttpResponse.readPlainContent(HttpResponse.java:214) > > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:146) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.Http.getResponse(Http.java:63) > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:208) > > [2007-04-04 16:23:21,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:144) > [2007-04-04 16:23:22,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? java.net.SocketTimeoutException: > Read timed out > [2007-04-04 16:23:22,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.SocketInputStream.socketRead0(Native Method) > [2007-04-04 16:23:22,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.SocketInputStream.read(Unknown Source) > [2007-04-04 16:23:22,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.BufferedInputStream.read1(Unknown Source) > [2007-04-04 16:23:22,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.BufferedInputStream.read(Unknown Source) > [2007-04-04 16:23:22,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.FilterInputStream.read(Unknown Source) > [2007-04-04 16:23:22,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.PushbackInputStream.read(Unknown Source) > [2007-04-04 16:23:22,062] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.io.FilterInputStream.read(Unknown Source) > [2007-04-04 16:23:22,062] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.HttpResponse.readPlainContent(HttpResponse.java:214) > > [2007-04-04 16:23:22,062] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:146) > [2007-04-04 16:23:22,062] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.Http.getResponse(Http.java:63) > [2007-04-04 16:23:22,062] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:208) > > [2007-04-04 16:23:22,062] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:144) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? java.net.SocketTimeoutException: > connect timed out > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.PlainSocketImpl.socketConnect(Native Method) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.PlainSocketImpl.doConnect(Unknown Source) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.PlainSocketImpl.connectToAddress(Unknown Source) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.PlainSocketImpl.connect(Unknown Source) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > java.net.SocksSocketImpl.connect(Unknown Source) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at java.net.Socket.connect(Unknown > Source) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:94) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.Http.getResponse(Http.java:63) > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:208) > > [2007-04-04 16:23:32,218] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:144) > [2007-04-04 16:23:33,046] [FetcherThread] ERROR > org.apache.nutch.protocol.http.Http:? > > > Pls do reply me asap. > > Regards, > cha > > -- View this message in context: http://www.nabble.com/ERROR-org.apache.nutch.protocol.http.Http%3A-java.net.SocketTimeoutException%3A-Read-timed-out-tf3525172.html#a9835316 Sent from the Nutch - User mailing list archive at Nabble.com. ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
