[jira] [Updated] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-29 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1078: Attachment: NUTCH-1078-trunk-20110929.patch The attached patch changes the LogUtil

[jira] [Updated] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-09-29 Thread Lewis John McGibbney (Updated) (JIRA)
Fix For: 1.4, 2.0 Attachments: 0001-NUTCH-672-allow-junit-tests-to-be-run-from-bin-nutc.patch, NUTCH-672-junit-test-commandline.patch, NUTCH-672-trunk-1.4-20110929.patch In development it's handy to be able to run a single test case easily. You can do it with ant

[jira] [Updated] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-09-29 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-672: --- Attachment: NUTCH-672-nutchgora-20110929.patch patch attachment for nutchgora. In my

[jira] [Closed] (NUTCH-623) Change plugin source directory languageidentifier to language-identifier

2011-09-29 Thread Lewis John McGibbney (Closed) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-623. -- Change plugin source directory languageidentifier to language-identifier

Re: Prepare for 1.4 release?

2011-09-29 Thread lewis john mcgibbney
Hi, On the JIRA I see 32 unresolved issues for 1.4... Is it possible for us to agree some kind of programme for establishing what we would like to be in the 1.4 release? I am keen to focus on something which we all have a common interest in progressing. Thanks On Wed, Sep 28, 2011 at 10:13 AM,

[jira] [Commented] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-09-29 Thread Julien Nioche (Commented) (JIRA)
-nutchgora-20110929.patch, NUTCH-672-trunk-1.4-20110929.patch In development it's handy to be able to run a single test case easily. You can do it with ant -Dtestcase=foo test, but that's slow since it still checks all the plugins for changes, rebuilds jars, etc. This patch adds a command

[jira] [Updated] (NUTCH-1046) Add tests for indexing to SOLR

2011-09-29 Thread Julien Nioche (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1046: - Affects Version/s: (was: 1.4) (was: 2.0) Fix Version/s:

[jira] [Updated] (NUTCH-1064) o.a.n.util.MimeUtil uses deprecated Tika methods

2011-09-29 Thread Julien Nioche (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1064: - Fix Version/s: (was: 1.4) 1.5 Postpone to 1.5. Should have new Tika

Re: Prepare for 1.4 release?

2011-09-29 Thread Julien Nioche
BTW I have renamed the version '2.0' in JIRA into 'nutchgora' to reflect the location of the code + have deleted the version '2.1' which was empty On 29 September 2011 12:08, lewis john mcgibbney lewis.mcgibb...@gmail.comwrote: Hi, On the JIRA I see 32 unresolved issues for 1.4... Is it

[jira] [Updated] (NUTCH-1061) Migrate MoreIndexingFilter from Apache ORO to java.util.regex

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1061: - Fix Version/s: (was: 1.4) (was: nutchgora) 1.5

[jira] [Updated] (NUTCH-1084) ReadDB url throws exception

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1084: - Affects Version/s: (was: 1.4) 1.3 Fix Version/s: (was:

[jira] [Updated] (NUTCH-1021) Migrate OutlinkExtractor from Apache ORO to java.util.regex

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1021: - Fix Version/s: (was: 1.4) (was: nutchgora) 1.5

[jira] [Updated] (NUTCH-1041) Not reading mime-type correctly

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1041: - Affects Version/s: (was: 1.4) 1.3 Fix Version/s: (was:

[jira] [Updated] (NUTCH-1060) URL filters to produce regexes to be used by OutlinkExtractor.

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1060: - Fix Version/s: (was: 1.4) (was: nutchgora) 1.5

[jira] [Updated] (NUTCH-1017) Exception getting mime type by name

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1017: - Fix Version/s: (was: 1.4) (was: nutchgora) 1.5

[jira] [Updated] (NUTCH-1014) Migrate from Apache ORO to java.util.regex

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1014: - Fix Version/s: (was: 1.4) (was: nutchgora) 1.5

[jira] [Updated] (NUTCH-1063) OutlinkExtractor test generates an exception but does not fail

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1063: - Fix Version/s: (was: 1.4) 1.5 OutlinkExtractor test generates an

[jira] [Updated] (NUTCH-1087) Deprecate crawl command and replace with example script

2011-09-29 Thread Markus Jelsma (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1087: - Fix Version/s: (was: 1.4) 1.5 Deprecate crawl command and replace

[jira] [Updated] (NUTCH-208) http: proxy exception list:

2011-09-29 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-208: --- Fix Version/s: (was: 1.4) 1.5 http: proxy exception list:

[jira] [Commented] (NUTCH-629) Detect slow and timeout servers and drop their URLs

2011-09-29 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117234#comment-13117234 ] Lewis John McGibbney commented on NUTCH-629: What is the situation with this

[jira] [Commented] (NUTCH-629) Detect slow and timeout servers and drop their URLs

2011-09-29 Thread Markus Jelsma (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117236#comment-13117236 ] Markus Jelsma commented on NUTCH-629: - I think this can be can be marked as won't fix.

[jira] [Commented] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-29 Thread Markus Jelsma (Commented) (JIRA)
, NUTCH-1078-branch-1.4-20110824-v2.patch, NUTCH-1078-branch-1.4-20110911-v3.patch, NUTCH-1078-branch-1.4-20110916-v4.patch, NUTCH-1078-trunk-20110929.patch Whilst working on another issue, I noticed that some classes still import and use commons logging for example HttpBase.java {code

[jira] [Resolved] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-29 Thread Lewis John McGibbney (Resolved) (JIRA)
For: 1.4 Attachments: NUTCH-1078-branch-1.4-20110816.patch, NUTCH-1078-branch-1.4-20110824-v2.patch, NUTCH-1078-branch-1.4-20110911-v3.patch, NUTCH-1078-branch-1.4-20110916-v4.patch, NUTCH-1078-trunk-20110929.patch Whilst working on another issue, I noticed that some classes still import

[jira] [Closed] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-29 Thread Lewis John McGibbney (Closed) (JIRA)
-trunk-20110929.patch Whilst working on another issue, I noticed that some classes still import and use commons logging for example HttpBase.java {code} import java.util.*; // Commons Logging imports import org.apache.commons.logging.Log; import org.apache.commons.logging.LogFactory; // Nutch

[jira] [Commented] (NUTCH-609) Allow Plugins to be Loaded from Jar File(s)

2011-09-29 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117469#comment-13117469 ] Lewis John McGibbney commented on NUTCH-609: Not been activity on this one for

[jira] [Commented] (NUTCH-896) Gora-based tests need to have their own config files

2011-09-29 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117654#comment-13117654 ] Lewis John McGibbney commented on NUTCH-896: This has taken some time to get

[jira] [Updated] (NUTCH-1136) Ant pmd target is broken

2011-09-29 Thread Lewis John McGibbney (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1136: Affects Version/s: nutchgora Fix Version/s: nutchgora Ant pmd target

[jira] [Commented] (NUTCH-965) Skip parsing for truncated documents

2011-09-29 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117668#comment-13117668 ] Lewis John McGibbney commented on NUTCH-965: This would be great to get into

[jira] [Commented] (NUTCH-1091) Remove commons logging dependency from Nutch branch and trunk

2011-09-29 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117670#comment-13117670 ] Lewis John McGibbney commented on NUTCH-1091: - any issues with me committing

[jira] [Commented] (NUTCH-1058) Upgrade Solr schema to version 1.4

2011-09-29 Thread Lewis John McGibbney (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117672#comment-13117672 ] Lewis John McGibbney commented on NUTCH-1058: - It is important that we address

[jira] [Commented] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-09-29 Thread Hudson (Commented) (JIRA)
-commandline.patch, NUTCH-672-nutchgora-20110929.patch, NUTCH-672-trunk-1.4-20110929.patch In development it's handy to be able to run a single test case easily. You can do it with ant -Dtestcase=foo test, but that's slow since it still checks all the plugins for changes, rebuilds jars, etc

Build failed in Jenkins: Nutch-nutchgora #21

2011-09-29 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-nutchgora/21/changes Changes: [lewismc] commit to address NUTCH-672 to update to changes.txt -- [...truncated 2520 lines...] resolve-default: [ivy:resolve] :: loading settings :: file =

[jira] [Commented] (NUTCH-1091) Remove commons logging dependency from Nutch branch and trunk

2011-09-29 Thread Sami Siren (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13117862#comment-13117862 ] Sami Siren commented on NUTCH-1091: --- +1, go ahead Lewis Remove commons

[jira] [Commented] (NUTCH-1078) Upgrade all instances of commons logging to slf4j (with log4j backend)

2011-09-29 Thread Hudson (Commented) (JIRA)
-v3.patch, NUTCH-1078-branch-1.4-20110916-v4.patch, NUTCH-1078-trunk-20110929.patch Whilst working on another issue, I noticed that some classes still import and use commons logging for example HttpBase.java {code} import java.util.*; // Commons Logging imports import

[jira] [Commented] (NUTCH-672) allow unit tests to be run from bin/nutch

2011-09-29 Thread Hudson (Commented) (JIRA)
: Todd Lipcon Assignee: Lewis John McGibbney Priority: Minor Fix For: 1.4, nutchgora Attachments: 0001-NUTCH-672-allow-junit-tests-to-be-run-from-bin-nutc.patch, NUTCH-672-junit-test-commandline.patch, NUTCH-672-nutchgora-20110929.patch, NUTCH-672