Hello,

updatedb in nutch-2.0 increases fetch time of all pages independent of if they 
have already been fetched or not.
For example if updatedb is applied in depth 1 and page A is fetched and its 
fetchTime is 30 days from now, then as a result of running updatedb in depth 2 
fetch time of page A will be 60 days from now and so on.

Also, I wondered if it is possible to remove pages that do not pass filters 
from hbase datastore by using updatedb?.

Thanks.
Alex.

Reply via email to