[jira] Commented: (NUTCH-606) Refactoring of Generator, run all urls through checks

2008-02-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12568423#action_12568423 ] Hudson commented on NUTCH-606: -- Integrated in Nutch-trunk #360 (See

[jira] Commented: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

2008-02-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12568421#action_12568421 ] Hudson commented on NUTCH-608: -- Integrated in Nutch-trunk #360 (See

[jira] Commented: (NUTCH-605) Change deprecated configuration methods for Hadoop

2008-02-12 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12568422#action_12568422 ] Hudson commented on NUTCH-605: -- Integrated in Nutch-trunk #360 (See

[jira] Updated: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16

2008-02-12 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes updated NUTCH-611: --- Attachment: NUTCH-611-1-20080212.patch This patch upgrades the jar and native libraries, also fixes

[jira] Updated: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s)

2008-02-12 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes updated NUTCH-609: --- Attachment: NUTCH-609-1-20080212.patch Rough first draft of patch. After research I determined

[jira] Created: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16

2008-02-12 Thread Dennis Kubes (JIRA)
Upgrade Nutch to use Hadoop 0.16 Key: NUTCH-611 URL: https://issues.apache.org/jira/browse/NUTCH-611 Project: Nutch Issue Type: Improvement Affects Versions: 1.0.0 Environment: All

[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s)

2008-02-12 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12568207#action_12568207 ] Dennis Kubes commented on NUTCH-609: Well, as it turns out I haven't found a way to put

[jira] Resolved: (NUTCH-605) Change deprecated configuration methods for Hadoop

2008-02-12 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-605. Resolution: Fixed Committed. Change deprecated configuration methods for Hadoop

[jira] Closed: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

2008-02-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann closed NUTCH-608. --- - Patch applied to trunk: http://svn.apache.org/viewvc?rev=620811view=rev Thanks for the

[jira] Resolved: (NUTCH-608) Upgrade nutch to use released apache-tika-0.1-incubating

2008-02-12 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-608. - Resolution: Fixed - added MimeUtil facade class to insulate Nutch from underlying mime

[jira] Resolved: (NUTCH-606) Refactoring of Generator, run all urls through checks

2008-02-12 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-606. Resolution: Fixed Committed. Refactoring of Generator, run all urls through checks

[jira] Updated: (NUTCH-603) Add more default url normalizations

2008-02-12 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes updated NUTCH-603: --- Attachment: NUTCH-603-2-20080212.patch This patch comments out the default page removal (i.e