[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12741225#action_12741225
]
Doğacan Güney commented on NUTCH-721:
-
Thanks for the analysis, Julien! Can you make a
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12741082#action_12741082
]
Julien Nioche commented on NUTCH-721:
-
I had another look at this issue after applying
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12741092#action_12741092
]
Andrzej Bialecki commented on NUTCH-721:
-
+1. Current defaults are sub-optimal due
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12730242#action_12730242
]
Steven Denny commented on NUTCH-721:
I've done some testing on this and looked at the
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12730402#action_12730402
]
Doğacan Güney commented on NUTCH-721:
-
Steven, if you have time/hardware, can you retry
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12712492#action_12712492
]
Otis Gospodnetic commented on NUTCH-721:
Questions:
Has anyone tried profiling this?
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12712494#action_12712494
]
Otis Gospodnetic commented on NUTCH-721:
Ken's thoughts:
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12712506#action_12712506
]
Roger Dunk commented on NUTCH-721:
--
My tests were done on a segment with only 1 URL per
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695277#action_12695277
]
Doğacan Güney commented on NUTCH-721:
-
Wow, 53 min vs 3 min !?
Thanks a lot for testing
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695298#action_12695298
]
Roger Dunk commented on NUTCH-721:
--
I did a -topN 5000, so only a subset of the attached,
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695394#action_12695394
]
Julien Nioche commented on NUTCH-721:
-
The message about the Aborted hung threads looks
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695600#action_12695600
]
Roger Dunk commented on NUTCH-721:
--
Julien, yes, fetcher.threads.per.host.by.ip was set to
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12694986#action_12694986
]
Doğacan Güney commented on NUTCH-721:
-
I've committed nutch 0.9 fetcher as OldFetcher.
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695170#action_12695170
]
Roger Dunk commented on NUTCH-721:
--
For the following tests I've used the same segment
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12695233#action_12695233
]
Hudson commented on NUTCH-721:
--
Integrated in Nutch-trunk #772 (See
[
https://issues.apache.org/jira/browse/NUTCH-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12694708#action_12694708
]
Doğacan Güney commented on NUTCH-721:
-
OK, there is clearly a problem with the new
16 matches
Mail list logo