Hello.
I want to use nutch 1.9 but there are some things that i don´t understand 
because i was using nutch 1.5.1 before and some things are changed in nutch 1.9.
Sorry if is a basic things.
Some questions:

1- How i can do a crawl process without solr parameter like in nutch 1.5.1 that 
the spider jump this step if i don´t set solr parameter ?

2- It is possible to use topN or similar parameter in nutch 1.9 or every round 
include all link in crawldb ?

3- I have activated httpclient plugin and when i crawl a website that use https 
protocol i get this error in the output console 
*********************************
fetch of https://dragones.uci.cu/ failed with: 
javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: 
PKIX path building failed: 
sun.security.provider.certpath.SunCertPathBuilderException: unable to find 
valid certification path to requested target

parsechecker tool throw similar error.

Please any suggestion or advice will be appreciated.


---------------------------------------------------
XII Aniversario de la creación de la Universidad de las Ciencias Informáticas. 
12 años de historia junto a Fidel. 12 de diciembre de 2014.

Reply via email to