[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108633#comment-13108633
]
Julien Nioche commented on NUTCH-1052:
--
I like the original idea and agree that
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108641#comment-13108641
]
Markus Jelsma commented on NUTCH-1052:
--
Thanks for your comments! Just to make sure i
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108701#comment-13108701
]
Julien Nioche commented on NUTCH-1052:
--
Yep, that's the idea.
The class will have to
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108731#comment-13108731
]
Markus Jelsma commented on NUTCH-1052:
--
I see. I did a quick modification and came up
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108757#comment-13108757
]
Julien Nioche commented on NUTCH-1052:
--
{quote}
Julien, will it break on Hadoop
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108763#comment-13108763
]
Markus Jelsma commented on NUTCH-1052:
--
Thank, I already did :) I now write the
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13097796#comment-13097796
]
Markus Jelsma commented on NUTCH-1052:
--
Perhaps an even better solution is to keep
[
https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13093746#comment-13093746
]
Markus Jelsma commented on NUTCH-1052:
--
Updating the CrawlDB is a tedious process and
8 matches
Mail list logo