Hmm, this was true before we had decent URL normalization. It should run fine 
although you can encounter SSL issues. But those SSL issues might also be in 
protocol-http, which now also supports SSL. You should be fine with either 
plugin.
Markus
 
-----Original message-----
> From:Joseph Naegele <[email protected]>
> Sent: Tuesday 8th March 2016 16:27
> To: [email protected]
> Subject: protocol-http or protocol-httpclient?
> 
> I'm using Nutch 1.11. The "plugin.includes" section of nutch-default.xml
> still states that the protocol-httpclient plugin may present intermittent
> problems. Is this still the case? What are the problems?
> 
> There doesn't appear to be any problem crawling HTTPS using the
> protocol-http plugin. Why do I need to use protocol-httpclient for crawling
> via HTTPS?
> 
> In short, I want to use the "correct" plugin because I am extending it to
> perform a bit of extra work. "Correct" in this case means:
> - The "recommended" of the two
> - Whichever can crawl both HTTP and HTTPS connections
> - Whichever performs better
> 
> Thanks,
> Joe
> 
> 

Reply via email to