On Sun, Jan 13, 2013 at 12:47 PM, Tejas Patil <[email protected]>wrote:
> > Well, if you know that the front page is updated frequently, set > "db.fetch.interval.default" to lower value so that urls will be eligible > for re-fetch sooner. By default, if a url is fetched successfully, it > becomes eligible for re-fetching after 30 days Very clear! In summary, Nutch can not identify if a page is being updated hence (if page is updated frequently) we should set to lower value "db.fetch.interval.default" to re-fetch the page. Thanks so much! -- wassalam, [bayu]

