[ 
https://issues.apache.org/jira/browse/NUTCH-2310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joey Hong updated NUTCH-2310:
-----------------------------
    Description: 
The protocol-selenium and protocol-interactiveselenium plugins raise errors 
whenever there is a URL with the HTTPS protocol.

 From the source code for those plugins, we can see that HTTP is the only 
scheme currently accepted, which makes Nutch unable to crawl HTTPS sites with 
JS using Selenium Webdrivers. 

  was:The protocol-selenium and protocol-interactiveselenium plugins raise 
errors whenever there is a URL with the HTTPS protocol. From the source code 
for those plugins, we can see that HTTP is the only scheme currently accepted, 
which makes Nutch unable to crawl HTTPS sites with JS using Selenium 
Webdrivers. 


> Protocol-Selenium does not support HTTPS protocol
> -------------------------------------------------
>
>                 Key: NUTCH-2310
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2310
>             Project: Nutch
>          Issue Type: Bug
>          Components: protocol
>    Affects Versions: 1.12
>            Reporter: Joey Hong
>              Labels: easyfix
>             Fix For: 1.13
>
>   Original Estimate: 48h
>  Remaining Estimate: 48h
>
> The protocol-selenium and protocol-interactiveselenium plugins raise errors 
> whenever there is a URL with the HTTPS protocol.
>  From the source code for those plugins, we can see that HTTP is the only 
> scheme currently accepted, which makes Nutch unable to crawl HTTPS sites with 
> JS using Selenium Webdrivers. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to