[jira] Created: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment

2007-07-24 Thread Vishal Shah (JIRA)
DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment - Key: NUTCH-525 URL:

[jira] Updated: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment

2007-07-24 Thread Vishal Shah (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vishal Shah updated NUTCH-525: -- Attachment: deleteDups.patch Patch for the bug attached here. DeleteDuplicates generates

[jira] Commented: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment

2007-07-24 Thread Vishal Shah (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12514910 ] Vishal Shah commented on NUTCH-525: --- Hi, I'll add a unit test. For the undelete thing, the need could arise

[jira] Updated: (NUTCH-525) DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment

2007-07-24 Thread Vishal Shah (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vishal Shah updated NUTCH-525: -- Attachment: RededupUnitTest.patch I have modified the existing junit test for DeleteDuplicates to test

[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-22 Thread Vishal Shah (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507144 ] Vishal Shah commented on NUTCH-503: --- Hi Emmanuel, Can you please dump the contents of your crawldb after

[jira] Created: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-21 Thread Vishal Shah (JIRA)
Generator exits incorrectly for small fetchlists - Key: NUTCH-503 URL: https://issues.apache.org/jira/browse/NUTCH-503 Project: Nutch Issue Type: Bug Components: generator

[jira] Updated: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-21 Thread Vishal Shah (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vishal Shah updated NUTCH-503: -- Attachment: emptyfetchlist.patch Hi, The previous patch is missing a header line. I've reattached