Jurian Broertjes created NUTCH-2242:
---------------------------------------
Summary: lastModified not always set
Key: NUTCH-2242
URL: https://issues.apache.org/jira/browse/NUTCH-2242
Project: Nutch
Issue Type: Bug
Components: crawldb
Affects Versions: 1.11
Reporter: Jurian Broertjes
Priority: Minor
I observed two issues:
- When using the DefaultFetchSchedule, CrawlDatum's modifiedTime field is not
updated on the first successful fetch.
- When a document modification is detected (protocol- or signature-wise), the
modifiedTime isn't updated
I can provide a patch later today.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)