[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508748
]
Hudson commented on NUTCH-498:
--
Integrated in Nutch-Nightly #131 (See
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508505
]
Doğacan Güney commented on NUTCH-498:
-
I tested creating a linkdb from ~6M urls:
Combine input records
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508506
]
Andrzej Bialecki commented on NUTCH-498:
-
+1.
Use Combiner in LinkDb to increase speed of linkdb
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508508
]
Sami Siren commented on NUTCH-498:
--
+1
Use Combiner in LinkDb to increase speed of linkdb generation
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12505454
]
Doğacan Güney commented on NUTCH-498:
-
Currently there is no difference, indeed. The version in LinkDb.reduce is
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12505197
]
Doğacan Güney commented on NUTCH-498:
-
Why can't we just set combiner class as LinkDb? AFAICS, you are not doing
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12505242
]
Espen Amble Kolstad commented on NUTCH-498:
---
Yes, you're right
I forgot I added a new class just to get
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12505249
]
Doğacan Güney commented on NUTCH-498:
-
After examining the code better, I am a bit confused. We have a
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12505302
]
Andrzej Bialecki commented on NUTCH-498:
-
Currently there is no difference, indeed. The version in