Sebastian Nagel created NUTCH-1775:
--------------------------------------

             Summary: IndexingFilter: document origin of passed CrawlDatum
                 Key: NUTCH-1775
                 URL: https://issues.apache.org/jira/browse/NUTCH-1775
             Project: Nutch
          Issue Type: Improvement
          Components: indexer
    Affects Versions: 1.8
            Reporter: Sebastian Nagel
            Priority: Trivial
             Fix For: 1.9
         Attachments: NUTCH-1775-trunk.patch

Only the fetch datum from segment is passed to IndexingFilters, the datum from 
CrawlDb is not available to IndexingFilters. This fact should be documented 
because there may be subtle differences between fetch and db datum (e.g., fetch 
time).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to