[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108633#comment-13108633 ] Julien Nioche commented on NUTCH-1052: -- I like the original idea and agree that

[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108641#comment-13108641 ] Markus Jelsma commented on NUTCH-1052: -- Thanks for your comments! Just to make sure i

[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108701#comment-13108701 ] Julien Nioche commented on NUTCH-1052: -- Yep, that's the idea. The class will have to

[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108731#comment-13108731 ] Markus Jelsma commented on NUTCH-1052: -- I see. I did a quick modification and came up

[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108757#comment-13108757 ] Julien Nioche commented on NUTCH-1052: -- {quote} Julien, will it break on Hadoop

[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13108763#comment-13108763 ] Markus Jelsma commented on NUTCH-1052: -- Thank, I already did :) I now write the

[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-09-06 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13097796#comment-13097796 ] Markus Jelsma commented on NUTCH-1052: -- Perhaps an even better solution is to keep

[jira] [Commented] (NUTCH-1052) Multiple deletes of the same URL using SolrClean

2011-08-30 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13093746#comment-13093746 ] Markus Jelsma commented on NUTCH-1052: -- Updating the CrawlDB is a tedious process and