Hi,

I'm using the AdaptiveFetchSchedule to crawl my sites. So far, it works as
expected. The pages are re-crawled at different interval depending if the
pages were modified or not since the last visit.

But I have trouble understanding the "db.fetch.schedule.adaptive.sync_delta"
setting. When that setting is set to false, the next fetch is always set to
fetchtime + interval. But when that setting is activated, the next fetch is
reftime + interval. The reftime is computed by subtracting a percentage of
the delta (fetch time - modified time) to the fetch time. 

Can someone explains me in which case this setting is useful?
-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/Adaptive-sync-with-the-time-of-page-change-tp870842p870842.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to