Hi,

Trunk has a feature for this: indexer.skip.notmodified

Cheers 
 
-----Original message-----
> From:Max Dzyuba <[email protected]>
> Sent: Fri 24-Aug-2012 17:19
> To: [email protected]
> Subject: recrawl a URL?
> 
> Hello everyone,
> 
>  
> 
> I run a crawl command every day, but I don't want Nutch to submit an update
> to Solr if a particular page hasn't changed. How do I achieve that? Right
> now the value of db.fetch.interval.default doesn't seem to help prevent the
> crawl since the updates are submitted to Solr as if the page has been
> changed. I know for sure that the page has not been changed. This happens
> for every new crawl command.
> 
>  
> 
>  
> 
> Thanks in advance,
> 
> Max
> 
> 

Reply via email to