Can you connect to it (telnet to it, for example) directly from the machine(s) where you are running Nutch? (this is a network issue, nothing to do with XML/parsing)
Maybe you need to go through some eBay proxy? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- > From: "Del Rio, Ann" <[EMAIL PROTECTED]> > To: [email protected] > Sent: Friday, May 30, 2008 6:24:01 PM > Subject: Indexing XML-based document format per DITA standard > > I added a new URL to index which is in a XML-based document format per > DITA standard and I get the following error. > > java.net.SocketException: Connection reset > 2008-05-27 17:56:58 ERROR Http at > java.net.SocketInputStream.read(SocketInputStream.java:168) > 2008-05-27 17:56:58 ERROR Http at > java.io.BufferedInputStream.fill(BufferedInputStream.java:218) > 2008-05-27 17:56:58 ERROR Http at > java.io.BufferedInputStream.read(BufferedInputStream.java:235) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpParser.readRawLine(HttpParser.java:77) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpParser.readLine(HttpParser.java:105) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpConnection.readLine(HttpConnection.jav > a:1115) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpCon > nectionAdapter.readLine(MultiThreadedHttpConnectionManager.java:1373) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpMethodBase.readStatusLine(HttpMethodBa > se.java:1832) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpMethodBase.readResponse(HttpMethodBase > .java:1590) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpMethodBase.execute(HttpMethodBase.java > :995) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMe > thodDirector.java:397) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMetho > dDirector.java:170) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:3 > 96) > 2008-05-27 17:56:58 ERROR Http at > org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:3 > 24) > 2008-05-27 17:56:58 ERROR Http at > org.apache.nutch.protocol.httpclient.HttpResponse.(HttpResponse.ja > va:96) > 2008-05-27 17:56:58 ERROR Http at > org.apache.nutch.protocol.httpclient.Http.getResponse(Http.java:99) > 2008-05-27 17:56:58 ERROR Http at > org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j > ava:219) > 2008-05-27 17:56:58 ERROR Http at > org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:145) > 2008-05-27 17:56:58 INFO Fetcher fetch of > http://v4:10000/lib failed with: > java.net.SocketException: Connection reset > > i googled and found no solution so far... > > do i need to setup some config / host file to specify the ports? > the URL is an internal website. > > any response will be appreciated. > > Thanks, > Ann Del Rio > Senior Developer > eBay, Inc
