Can you connect to it (telnet to it, for example) directly from the machine(s) 
where you are running Nutch?
(this is a network issue, nothing to do with XML/parsing)


Maybe you need to go through some eBay proxy?

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch


----- Original Message ----
> From: "Del Rio, Ann" <[EMAIL PROTECTED]>
> To: [email protected]
> Sent: Friday, May 30, 2008 6:24:01 PM
> Subject: Indexing XML-based document format per DITA standard
> 
> I added a new URL to index which is in a XML-based document format per
> DITA standard and I get the following error.
> 
> java.net.SocketException: Connection reset
> 2008-05-27 17:56:58 ERROR Http                 at
> java.net.SocketInputStream.read(SocketInputStream.java:168)
> 2008-05-27 17:56:58 ERROR Http                 at
> java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
> 2008-05-27 17:56:58 ERROR Http                 at
> java.io.BufferedInputStream.read(BufferedInputStream.java:235)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpParser.readRawLine(HttpParser.java:77)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpParser.readLine(HttpParser.java:105)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpConnection.readLine(HttpConnection.jav
> a:1115)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpCon
> nectionAdapter.readLine(MultiThreadedHttpConnectionManager.java:1373)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpMethodBase.readStatusLine(HttpMethodBa
> se.java:1832)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpMethodBase.readResponse(HttpMethodBase
> .java:1590)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpMethodBase.execute(HttpMethodBase.java
> :995)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMe
> thodDirector.java:397)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMetho
> dDirector.java:170)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:3
> 96)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.commons.httpclient.HttpClient.executeMethod(HttpClient.java:3
> 24)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.nutch.protocol.httpclient.HttpResponse.(HttpResponse.ja
> va:96)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.nutch.protocol.httpclient.Http.getResponse(Http.java:99)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
> ava:219)
> 2008-05-27 17:56:58 ERROR Http                 at
> org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:145)
> 2008-05-27 17:56:58 INFO  Fetcher              fetch of
> http://v4:10000/lib   failed with:
> java.net.SocketException: Connection reset
> 
> i googled and found no solution so far...
> 
> do i need to setup some config / host file to specify the ports?
> the URL is an internal website.
> 
> any response will be appreciated.
> 
> Thanks,
> Ann Del Rio
> Senior Developer
> eBay, Inc

Reply via email to