Sebastian Nagel created NUTCH-1775:
--------------------------------------
Summary: IndexingFilter: document origin of passed CrawlDatum
Key: NUTCH-1775
URL: https://issues.apache.org/jira/browse/NUTCH-1775
Project: Nutch
Issue Type: Improvement
Components: indexer
Affects Versions: 1.8
Reporter: Sebastian Nagel
Priority: Trivial
Fix For: 1.9
Attachments: NUTCH-1775-trunk.patch
Only the fetch datum from segment is passed to IndexingFilters, the datum from
CrawlDb is not available to IndexingFilters. This fact should be documented
because there may be subtle differences between fetch and db datum (e.g., fetch
time).
--
This message was sent by Atlassian JIRA
(v6.2#6252)