Avoid cloningCrawlDatum in CrawlDbReducer
--
Key: NUTCH-761
URL: https://issues.apache.org/jira/browse/NUTCH-761
Project: Nutch
Issue Type: Improvement
Reporter: Julien Nioche
[
https://issues.apache.org/jira/browse/NUTCH-761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-761:
Attachment: optiCrawlReducer.patch
Avoid cloningCrawlDatum in CrawlDbReducer
[
https://issues.apache.org/jira/browse/NUTCH-762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-762:
Attachment: NUTCH-762-MultiGenerator.patch
Patch for the MultiGenerator
Alternative Generator
Alternative Generator which can generate several segments in one parse of the
crawlDB
-
Key: NUTCH-762
URL: https://issues.apache.org/jira/browse/NUTCH-762
Project: