[
https://issues.apache.org/jira/browse/NUTCH-1775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1775:
-----------------------------------
Attachment: NUTCH-1775-trunk.patch
> IndexingFilter: document origin of passed CrawlDatum
> ----------------------------------------------------
>
> Key: NUTCH-1775
> URL: https://issues.apache.org/jira/browse/NUTCH-1775
> Project: Nutch
> Issue Type: Improvement
> Components: indexer
> Affects Versions: 1.8
> Reporter: Sebastian Nagel
> Priority: Trivial
> Fix For: 1.9
>
> Attachments: NUTCH-1775-trunk.patch
>
>
> Only the fetch datum from segment is passed to IndexingFilters, the datum
> from CrawlDb is not available to IndexingFilters. This fact should be
> documented because there may be subtle differences between fetch and db datum
> (e.g., fetch time).
--
This message was sent by Atlassian JIRA
(v6.2#6252)