Hello Sid.
I am using protocol-httpclient because in my modest opinion it have a better 
handling of https websites than protocol-http.
Since java 1.7 my problems with self signed certificates was deleted and using 
protocol-httpclient and nutch 1.12.
But if you have problems with websites that have self signed certificates maybe 
you need to insert certificates into java keystore using portecle tool
you can download here: https://sourceforge.net/projects/portecle/

Best regards.



----- Mensaje original -----
De: "Sadiki Latty" <[email protected]>
Para: [email protected]
Enviados: Martes, 28 de Noviembre 2017 11:08:28
Asunto: [MASSMAIL]Certificates

Hey all,

I have a question regarding self-signed certs. I will be using nutch to crawl 
http and https sites, as well as using it to index to self-signed https Solr 
servers. I managed to add certificates to Solr and it fixed their inter-node 
communication butI am yet to find where in nutch I can do a similar 
configuration. I have seen articles saying that the protocol-httpclient plugin 
should be able to do it with some code modifications but the caveat is that 
httpclient may have underlying bugs so protocol-http is recommended. These 
articles were also almost 3 years old so options may have evolved now. Can some 
someone provide some insight into what my next steps should be. Essentially 
here are my questions:

1.       Should I use protocol-http, protocol-httpclient or other?



2.       Is there somewhere in a config file that I can tell Nutch to use a 
java keystore file similar to Solr?

Thanks

Sid

**********************
Text below is autogenerated by my email suplier.
La @universidad_uci es Fidel: 15 años conectados al futuro... conectados a la 
Revolución
2002-2017

Reply via email to