Hi,

Does "Fetch time" in CrawlDatum really represent "Next fetch time"?

Example:
The URL below was just fetched.  After that bin/nutch readdb was run:

$ bin/nutch readdb /user/foo/crawl/crawldb -url http://www.foobar.com/

URL: http://www.foobar.com/
Version: 6
Status: 6 (db_notmodified)
Fetch time: Fri May 09 17:17:31 EDT 2008          <---- NOTE: 30 days from now??
Modified time: Wed Dec 31 19:00:00 EST 1969
Retries since fetch: 0
Retry interval: 2592000 seconds (30 days)
Score: 3.955374E-8
Signature: f3ee31dcfde9ca40f4ed4a4e1bf66e24
Metadata: _pst_:temp_moved(13), lastModified=0: http://foobar.com/
 

Either the above "Fetch time" is off by 1 month, or the above "Fetch time" 
should really be labeled "Next fetch fime".
Looking at CrawlDatum, it looks like it's the later.  Is that so?

Thanks,
Otis

--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch


Reply via email to