Hi, Does "Fetch time" in CrawlDatum really represent "Next fetch time"?
Example: The URL below was just fetched. After that bin/nutch readdb was run: $ bin/nutch readdb /user/foo/crawl/crawldb -url http://www.foobar.com/ URL: http://www.foobar.com/ Version: 6 Status: 6 (db_notmodified) Fetch time: Fri May 09 17:17:31 EDT 2008 <---- NOTE: 30 days from now?? Modified time: Wed Dec 31 19:00:00 EST 1969 Retries since fetch: 0 Retry interval: 2592000 seconds (30 days) Score: 3.955374E-8 Signature: f3ee31dcfde9ca40f4ed4a4e1bf66e24 Metadata: _pst_:temp_moved(13), lastModified=0: http://foobar.com/ Either the above "Fetch time" is off by 1 month, or the above "Fetch time" should really be labeled "Next fetch fime". Looking at CrawlDatum, it looks like it's the later. Is that so? Thanks, Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
