Hello Martin. I think that the problem is with httpclient protocol, i have this problem too and if i do parsechecker or indexchecker it happend also. In my context it occours when the certificate es self signed, yo can see http://mail-archives.apache.org/mod_mbox/nutch-user/201401.mbox/%[email protected]%3E you can make the changes and compile again this plugin and try one more time. I have and alternative using httpclient protocol of nutch 1.5.1 version, in this it not occours. Please any progress write to the list again.
----- Mensaje original ----- De: "Martin Krauss" <[email protected]> Para: [email protected] Enviados: Lunes, 23 de Febrero 2015 8:17:49 Asunto: [MASSMAIL]Error SSLHandshakeException Crawling sites with https Hello, when crawling some sites with https I get the error below: Other https sites work o.k. Pls help Martin fetching: https://www.sit.de Fetch failed with protocol status: exception(16), lastModified=0: javax.net.ssl.SSLHandshakeException: Remote host closed connection during handshake 2015-02-20 15:35:14,889 INFO parse.ParserChecker - fetching: https://www.sit.de 2015-02-20 15:35:15,520 INFO httpclient.Http - http.proxy.host = null 2015-02-20 15:35:15,521 INFO httpclient.Http - http.proxy.port = 8080 2015-02-20 15:35:15,522 INFO httpclient.Http - http.timeout = 10000 2015-02-20 15:35:15,522 INFO httpclient.Http - http.content.limit = 65536 2015-02-20 15:35:15,522 INFO httpclient.Http - http.agent = SIT_NUTCH_SPIDER/Nutch-1.8 2015-02-20 15:35:15,522 INFO httpclient.Http - http.accept.language = en-us,en-gb,en;q=0.7,*;q=0.3 2015-02-20 15:35:15,522 INFO httpclient.Http - http.accept = text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8 2015-02-20 15:35:16,232 ERROR httpclient.Http - Failed to get protocol output javax.net.ssl.SSLHandshakeException: Remote host closed connection during handshake at sun.security.ssl.SSLSocketImpl.readRecord(SSLSocketImpl.java:869) at sun.security.ssl.SSLSocketImpl.performInitialHandshake(SSLSocketImpl.java:1190) at sun.security.ssl.SSLSocketImpl.writeRecord(SSLSocketImpl.java:657) at sun.security.ssl.AppOutputStream.write(AppOutputStream.java:108) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:82) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:140) at org.apache.commons.httpclient.HttpConnection.flushRequestOutputStream(HttpConnection.java:828) at org.apache.commons.httpclient.MultiThreadedHttpConnectionManager$HttpConnectionAdapter.flushRequestOutputStream(MultiThreadedHttpConnectionManager.java:1565) at org.apache.commons.httpclient.HttpMethodBase.writeRequest(HttpMethodBase.java:2116) at org.apache.commons.httpclient.HttpMethodBase.execute(HttpMethodBase.java:1096) at org.apache.commons.httpclient.HttpMethodDirector.executeWithRetry(HttpMethodDirector.java:398) at org.apache.commons.httpclient.HttpMethodDirector.executeMethod(HttpMethodDirector.java:171)Mit freundlichen Grüßen Martin Krauß Gottlieb-Daimler-Schule 2 mit Abteilung Akademie für Datenverarbeitung Böblinger Straße 73 71065 Sindelfingen Phone: +49 (0)7031 6117-135 Fax: +49 (0)7031 6117-119 E-Mail: [email protected]

