[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-691: -- Remaining Estimate: 0.25h Original Estimate: 0.25h > Update jakarta poi jars to the most re

[jira] Commented: (NUTCH-591) StringIndexOutOfBoundsException when extracting text from a Word document.

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674478#action_12674478 ] Dmitry Lihachev commented on NUTCH-591: --- can be resolved via NUTCH-691 > StringIndexO

[jira] Issue Comment Edited: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674468#action_12674468 ] dmitry.lihachev edited comment on NUTCH-691 at 2/17/09 9:39 PM: --

[jira] Commented: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674468#action_12674468 ] Dmitry Lihachev commented on NUTCH-691: --- Steps to reproduce NUTCH-591 (you must have t

[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-691: -- Attachment: NUTCH-691-v1-test.patch > Update jakarta poi jars to the most relevant version > ---

[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-691: -- Attachment: (was: NUTCH-691-v1-test.patch) > Update jakarta poi jars to the most relevant ve

[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-691: -- Attachment: NUTCH-691-v1-poi.patch > Update jakarta poi jars to the most relevant version >

[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-691: -- Attachment: NUTCH-691-v1-test.patch cd nutch; > Update jakarta poi jars to the most relevant v

[jira] Updated: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-691: -- Comment: was deleted > Update jakarta poi jars to the most relevant version > --

[jira] Created: (NUTCH-691) Update jakarta poi jars to the most relevant version

2009-02-17 Thread Dmitry Lihachev (JIRA)
Update jakarta poi jars to the most relevant version Key: NUTCH-691 URL: https://issues.apache.org/jira/browse/NUTCH-691 Project: Nutch Issue Type: Improvement Components: fetche

Hudson build is back to normal: Nutch-trunk #728

2009-02-17 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/728/changes

[jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links

2009-02-17 Thread Peter Sparks (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Sparks updated NUTCH-689: --- Attachment: parse-swf.patch Here is the patch. > Swf parser doesn't seem to handle relative links > -

[jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links

2009-02-17 Thread Peter Sparks (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Sparks updated NUTCH-689: --- Attachment: (was: SWFParser.java) > Swf parser doesn't seem to handle relative links > ---

[jira] Commented: (NUTCH-689) Swf parser doesn't seem to handle relative links

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674360#action_12674360 ] Sami Siren commented on NUTCH-689: -- about development: check url http://wiki.apache.org/nu

[jira] Created: (NUTCH-690) bug in DomContentUtils.shouldThrowAwayLink?

2009-02-17 Thread Peter Sparks (JIRA)
bug in DomContentUtils.shouldThrowAwayLink? --- Key: NUTCH-690 URL: https://issues.apache.org/jira/browse/NUTCH-690 Project: Nutch Issue Type: Bug Affects Versions: 0.9.0 Reporter: Pete

[jira] Updated: (NUTCH-689) Swf parser doesn't seem to handle relative links

2009-02-17 Thread Peter Sparks (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Sparks updated NUTCH-689: --- Attachment: SWFParser.java I'm new to both open source and nutch so I'm not sure of the right way to

[jira] Created: (NUTCH-689) Swf parser doesn't seem to handle relative links

2009-02-17 Thread Peter Sparks (JIRA)
Swf parser doesn't seem to handle relative links Key: NUTCH-689 URL: https://issues.apache.org/jira/browse/NUTCH-689 Project: Nutch Issue Type: Bug Affects Versions: 0.9.0 Repo

[jira] Updated: (NUTCH-310) Review Log Levels

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-310: - Fix Version/s: (was: 1.0.0) 1.1 pushing to 1.1 > Review Log Levels > -

[jira] Updated: (NUTCH-249) black- white list url filtering

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-249: - Fix Version/s: (was: 1.0.0) 1.1 pushing to 1.1 > black- white list url filtering >

[jira] Updated: (NUTCH-309) Uses commons logging Code Guards

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-309: - Fix Version/s: (was: 1.0.0) 1.1 pushing this to 1.1 > Uses commons logging Code Gu

[jira] Updated: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-469: - Fix Version/s: (was: 1.0.0) 1.1 pushing this to 1.1 > changes to geoPosition plugi

[jira] Updated: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s)

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-609: - Fix Version/s: (was: 1.0.0) 1.1 pushing this to 1.1, feel free to put back if there

[jira] Updated: (NUTCH-86) LanguageIdentifier API enhancements

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-86: Fix Version/s: (was: 1.0.0) removing from 1.0 queue since there has been no activity lately > LanguageId

[jira] Resolved: (NUTCH-582) Add missing type parameters

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren resolved NUTCH-582. -- Resolution: Fixed yep, all of this has been committed > Add missing type parameters > -

[jira] Commented: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException

2009-02-17 Thread hasan (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674244#action_12674244 ] hasan commented on NUTCH-631: - I had this problem after I install NUTCH-631.patch, the problem h

[jira] Commented: (NUTCH-609) Allow Plugins to be Loaded from Jar File(s)

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674226#action_12674226 ] Sami Siren commented on NUTCH-609: -- I think the direction in general is good, I know that w

[jira] Resolved: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren resolved NUTCH-631. -- Resolution: Fixed Assignee: Sami Siren (was: Chris A. Mattmann) committed, thanks > MoreIndexing

[jira] Commented: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException

2009-02-17 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674219#action_12674219 ] Chris A. Mattmann commented on NUTCH-631: - Sami, +1. Sorry I didn't have time to get

[jira] Created: (NUTCH-688) Fix missing/wrong headers in source files

2009-02-17 Thread Sami Siren (JIRA)
Fix missing/wrong headers in source files - Key: NUTCH-688 URL: https://issues.apache.org/jira/browse/NUTCH-688 Project: Nutch Issue Type: Bug Reporter: Sami Siren Assignee: Sam

[jira] Updated: (NUTCH-687) Add RAT

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-687: - Attachment: NUTCH-687.patch > Add RAT > --- > > Key: NUTCH-687 > URL:

[jira] Created: (NUTCH-687) Add RAT

2009-02-17 Thread Sami Siren (JIRA)
Add RAT --- Key: NUTCH-687 URL: https://issues.apache.org/jira/browse/NUTCH-687 Project: Nutch Issue Type: Improvement Reporter: Sami Siren Assignee: Sami Siren Priority: Minor Attachments: NU

[jira] Updated: (NUTCH-631) MoreIndexingFilter fails with NoSuchElementException

2009-02-17 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated NUTCH-631: - Attachment: NUTCH-631.patch Attaching a patch that fixes the problem as proposed, If there are no objecti