Hi Fred,

Nutch does nothing "proactively", the crawl jobs must be explicitly called.
But you need no special command:
- let's say the you didn't change the defaults and
  db.fetch.interval.default == 30 days
- if you launch bin/crawl one month later, all pages are refetched,
  and optionally reindexed (404s removed)
- just to clarify: new segments will be created, old segments can be
  removed, except you need same to recover eg. if the index is lost

Best,
Sebastian

On 04/09/2018 09:13 PM, Fred Zimmerman wrote:
> When I run bin/crawl once and it generates a segment list with a bunch of
> fetch dates in the future, does nutch proactively run those fetches on
> those future dates, or do I have to do something to make that happen?
> 

Reply via email to