[
https://issues.apache.org/jira/browse/NUTCH-2273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15328015#comment-15328015
]
Lewis John McGibbney commented on NUTCH-2273:
---------------------------------------------
Thanks [~bmzhao] nice catch. I will put a patch together later unless you beat
me to it.
> Selenium and InteractiveSelenium Do Not Support HTTPS
> -----------------------------------------------------
>
> Key: NUTCH-2273
> URL: https://issues.apache.org/jira/browse/NUTCH-2273
> Project: Nutch
> Issue Type: Bug
> Components: plugin
> Affects Versions: 1.11
> Reporter: Brian Zhao
> Assignee: Lewis John McGibbney
>
> Both Selenium and InteractiveSelenium plugins do not have the https protocol
> specified in their plugin.xml, and will not fetch https links.
> To fix for the Selenium plugin you should add:
>
> <implementation id="org.apache.nutch.protocol.selenium.Http"
> class="org.apache.nutch.protocol.selenium.Http">
> <parameter name="protocolName" value="https"/>
> </implementation>
> to Selenium's plugin.xml (as a child element of the "extension" element)
> An implementation already exists in protocol-http HttpResponse.java, and I've
> merged it into selenium's HttpResponse.java here: http://pastebin.com/ZAPfwee4
> This should probably be similarly done for the InteractiveSelenium plugin.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)