Markus Jelsma created NUTCH-2093:
------------------------------------

             Summary: Indexing filters have no signature in CrawlDatum if 
crawled via FreeGenerator
                 Key: NUTCH-2093
                 URL: https://issues.apache.org/jira/browse/NUTCH-2093
             Project: Nutch
          Issue Type: Bug
          Components: indexer
    Affects Versions: 1.10
            Reporter: Markus Jelsma
            Priority: Minor
             Fix For: 1.11
         Attachments: NUTCH-2093.patch

In IndexerMapReduce, a fetchDatum is passed to the indexing filters. However, 
when this fetchDatum was created via FreeGenerator, it has no signature 
attached, and indexing filters don't see it.

This patch copies the signature from the dbDatum just before passed to indexing 
filters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to