Thank you for the reply. Does it mean that it is not supported in latest stable 
release of Nutch?


-----Original Message-----
From: Markus Jelsma [mailto:[email protected]] 
Sent: den 24 augusti 2012 17:21
To: [email protected]; Max Dzyuba
Subject: RE: recrawl a URL?

Hi,

Trunk has a feature for this: indexer.skip.notmodified

Cheers 
 
-----Original message-----
> From:Max Dzyuba <[email protected]>
> Sent: Fri 24-Aug-2012 17:19
> To: [email protected]
> Subject: recrawl a URL?
> 
> Hello everyone,
> 
>  
> 
> I run a crawl command every day, but I don't want Nutch to submit an 
> update to Solr if a particular page hasn't changed. How do I achieve 
> that? Right now the value of db.fetch.interval.default doesn't seem to 
> help prevent the crawl since the updates are submitted to Solr as if 
> the page has been changed. I know for sure that the page has not been 
> changed. This happens for every new crawl command.
> 
>  
> 
>  
> 
> Thanks in advance,
> 
> Max
> 
> 

Reply via email to