[ https://issues.apache.org/jira/browse/NUTCH-1038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel updated NUTCH-1038: ----------------------------------- Attachment: NUTCH-1038v2.patch Hi Lewis, it's a problem of the patch: the fetch time of a WebPage (unlike CrawlDatum) must be explicitly set. Good catch! Attached improved patch. > Port IndexingFiltersChecker to 2.0 > ---------------------------------- > > Key: NUTCH-1038 > URL: https://issues.apache.org/jira/browse/NUTCH-1038 > Project: Nutch > Issue Type: New Feature > Affects Versions: nutchgora > Reporter: Markus Jelsma > Fix For: 2.2 > > Attachments: NUTCH-1038.patch, NUTCH-1038v2.patch > > -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira