DeleteDuplicates based on crawlDB only
---
Key: NUTCH-656
URL: https://issues.apache.org/jira/browse/NUTCH-656
Project: Nutch
Issue Type: Wish
Components: indexer
Reporter: julien
[
https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-442:
Component/s: searcher
indexer
Fix Version/s: 1.0.0
Integrate Solr/Nutch
[
https://issues.apache.org/jira/browse/NUTCH-656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
julien nioche reopened NUTCH-656:
-
I suppose that the SOLR dedup mechanism is valid on a single instance. If the
documents are
[
https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney reassigned NUTCH-442:
---
Assignee: Doğacan Güney
Integrate Solr/Nutch
Key:
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/595/changes
--
started
Building remotely on lucene.zones.apache.org
ERROR: svn: timed out waiting for server
svn: OPTIONS request failed on '/repos/asf/lucene/nutch/trunk'