[
https://issues.apache.org/jira/browse/NUTCH-2447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16215055#comment-16215055
]
Markus Jelsma edited comment on NUTCH-2447 at 10/23/17 12:36 PM:
-----------------------------------------------------------------
Patch for master! Keep in mind, this only works for protocol-http!
was (Author: markus17):
Patch for master! Keep in mind, this only work for protocol-http!
> Work-around SSLProtocolException: handshake alert: unrecognized_name
> --------------------------------------------------------------------
>
> Key: NUTCH-2447
> URL: https://issues.apache.org/jira/browse/NUTCH-2447
> Project: Nutch
> Issue Type: Bug
> Components: protocol
> Affects Versions: 1.13
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Priority: Critical
> Fix For: 1.14
>
> Attachments: NUTCH-2447.patch
>
>
> Nutch is unable to crawl some websites, regardless of protocol plugin you are
> using. The work-around you frequently find (-Djsse.enableSNIExtension=false)
> does not work at all, so the internet is clearly lying to us!
> {code}
> 2017-10-23 12:43:52,911 INFO api.HttpRobotRulesParser - Couldn't get
> robots.txt for https://www.eidsiva.net/: javax.net.ssl.SSLProtocolException:
> handshake alert: unrecognized_name
> 2017-10-23 12:43:53,011 ERROR http.Http - Failed to get protocol output
> javax.net.ssl.SSLProtocolException: handshake alert: unrecognized_name
> at
> sun.security.ssl.ClientHandshaker.handshakeAlert(ClientHandshaker.java:1446)
> at sun.security.ssl.SSLSocketImpl.recvAlert(SSLSocketImpl.java:2016)
> at sun.security.ssl.SSLSocketImpl.readRecord(SSLSocketImpl.java:1125)
> at
> sun.security.ssl.SSLSocketImpl.performInitialHandshake(SSLSocketImpl.java:1375)
> at
> sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1403)
> at
> sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1387)
> at
> org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:152)
> at org.apache.nutch.protocol.http.Http.getResponse(Http.java:72)
> at
> org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.java:271)
> at org.apache.nutch.fetcher.FetcherThread.run(FetcherThread.java:327)
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)