Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
That was not intented. Just that am on holidays, it's raining and the children were either asleep or playing nicely :-) On 15 June 2012 18:19, Mattmann, Chris A (388J) < chris.a.mattm...@jpl.nasa.gov> wrote: > OK you are just making us all look bad now Juls ;) > > Super fast! > > Cheers, > Chris

[jira] [Commented] (NUTCH-1330) OutlinkDB to preserve back up

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295799#comment-13295799 ] Hudson commented on NUTCH-1330: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[jira] [Commented] (NUTCH-1386) Headings filter not to add empty values

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295802#comment-13295802 ] Hudson commented on NUTCH-1386: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295801#comment-13295801 ] Hudson commented on NUTCH-1352: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[jira] [Commented] (NUTCH-1024) Dynamically set fetchInterval by MIME-type

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295797#comment-13295797 ] Hudson commented on NUTCH-1024: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[jira] [Commented] (NUTCH-1398) Upgrade to Hadoop 1.0.3

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295800#comment-13295800 ] Hudson commented on NUTCH-1398: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[jira] [Commented] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling.

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295803#comment-13295803 ] Hudson commented on NUTCH-1356: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[jira] [Commented] (NUTCH-1319) HostNormalizer

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295804#comment-13295804 ] Hudson commented on NUTCH-1319: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[jira] [Commented] (NUTCH-1300) Indexer to normalize URL's

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295798#comment-13295798 ] Hudson commented on NUTCH-1300: --- Integrated in Nutch-trunk #1869 (See [https://builds.apach

[VOTE] Apache Nutch 2.0 RC2

2012-06-15 Thread lewis john mcgibbney
Hi Everyone, A candidate for the Apache Nutch 2.0 RC2 is available at: http://people.apache.org/~lewismc/apache-nutch-2.0rc2 The release candidate is a src.zip and src.tar.gz ONLY archive of the sources in: http://svn.apache.org/repos/asf/nutch/tags/release-2.0rc2 We release Nutch 2.0 in this

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Mattmann, Chris A (388J)
OK you are just making us all look bad now Juls ;) Super fast! Cheers, Chris On Jun 15, 2012, at 2:54 AM, Julien Nioche wrote: > see https://issues.apache.org/jira/browse/NUTCH-1396 > > On 15 June 2012 10:43, Julien Nioche wrote: > Before you do, could you check that NutchGora passes ant tes

[jira] [Commented] (NUTCH-1392) -force and -resume arguments being ignored in ParserJob

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295767#comment-13295767 ] Hudson commented on NUTCH-1392: --- Integrated in Nutch-nutchgora #281 (See [https://builds.ap

[jira] [Commented] (NUTCH-1396) Upgrade to Tika 1.1

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295766#comment-13295766 ] Hudson commented on NUTCH-1396: --- Integrated in Nutch-nutchgora #281 (See [https://builds.ap

[jira] [Commented] (NUTCH-1397) language-identifier incorrectly handles double-barreled language properties

2012-06-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295740#comment-13295740 ] Lewis John McGibbney commented on NUTCH-1397: - Aye Julien. I was just this min

[jira] [Commented] (NUTCH-1397) language-identifier incorrectly handles double-barreled language properties

2012-06-15 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295738#comment-13295738 ] Julien Nioche commented on NUTCH-1397: -- Lewis, the language identification is a combi

[jira] [Closed] (NUTCH-1081) ant tests fail

2012-06-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1081. --- > ant tests fail > --- > > Key: NUTCH-1081 >

[jira] [Resolved] (NUTCH-1081) ant tests fail

2012-06-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1081. - Resolution: Fixed Fix Version/s: (was: 2.1) nutchgor

[jira] [Commented] (NUTCH-1398) Upgrade to Hadoop 1.0.3

2012-06-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295734#comment-13295734 ] Lewis John McGibbney commented on NUTCH-1398: - We have a rather optimistic tic

[jira] [Comment Edited] (NUTCH-1397) language-identifier incorrectly handles double-barreled language properties

2012-06-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295732#comment-13295732 ] Lewis John McGibbney edited comment on NUTCH-1397 at 6/15/12 3:52 PM: --

[jira] [Commented] (NUTCH-1397) language-identifier incorrectly handles double-barreled language properties

2012-06-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295732#comment-13295732 ] Lewis John McGibbney commented on NUTCH-1397: - Hi KEN, this is exactly what fl

[jira] [Commented] (NUTCH-1398) Upgrade to Hadoop 1.0.3

2012-06-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295718#comment-13295718 ] Hudson commented on NUTCH-1398: --- Integrated in nutch-trunk-maven #314 (See [https://builds.

[jira] [Commented] (NUTCH-1397) language-identifier incorrectly handles double-barreled language properties

2012-06-15 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295683#comment-13295683 ] Ken Krugler commented on NUTCH-1397: Should this issue be filed against Tika, versus N

[jira] [Commented] (NUTCH-1081) ant tests fail

2012-06-15 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295676#comment-13295676 ] Ferdy Galema commented on NUTCH-1081: - Yes this one should be closed.

[jira] [Commented] (NUTCH-1398) Upgrade to Hadoop 1.0.3

2012-06-15 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295674#comment-13295674 ] Julien Nioche commented on NUTCH-1398: -- trunk : Committed revision 1350630. will wait

[jira] [Created] (NUTCH-1398) Upgrade to Hadoop 1.0.3

2012-06-15 Thread Julien Nioche (JIRA)
Julien Nioche created NUTCH-1398: Summary: Upgrade to Hadoop 1.0.3 Key: NUTCH-1398 URL: https://issues.apache.org/jira/browse/NUTCH-1398 Project: Nutch Issue Type: Improvement Affects Ver

[jira] [Commented] (NUTCH-1081) ant tests fail

2012-06-15 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13295672#comment-13295672 ] Julien Nioche commented on NUTCH-1081: -- The tests for nutchgora seem to work fine now

[jira] [Closed] (NUTCH-1396) Upgrade to Tika 1.1

2012-06-15 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche closed NUTCH-1396. Assignee: Julien Nioche Thanks Lewis > Upgrade to Tika 1.1 > --- >

[jira] [Created] (NUTCH-1397) language-identifier incorrectly handles double-barreled language properties

2012-06-15 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1397: --- Summary: language-identifier incorrectly handles double-barreled language properties Key: NUTCH-1397 URL: https://issues.apache.org/jira/browse/NUTCH-1397

[jira] [Resolved] (NUTCH-1396) Upgrade to Tika 1.1

2012-06-15 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1396. - Resolution: Fixed Perfect Julien. Tested locally against test suite and within sm

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
see https://issues.apache.org/jira/browse/NUTCH-1396 On 15 June 2012 10:43, Julien Nioche wrote: > Before you do, could you check that NutchGora passes ant test > successfully. I just tried and got an error related to the parse-tika > tests. Am about to open a JIRA to update to the latest versio

[jira] [Updated] (NUTCH-1396) Upgrade to Tika 1.1

2012-06-15 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1396: - Attachment: NUTCH-1396.patch > Upgrade to Tika 1.1 > --- > >

[jira] [Created] (NUTCH-1396) Upgrade to Tika 1.1

2012-06-15 Thread Julien Nioche (JIRA)
Julien Nioche created NUTCH-1396: Summary: Upgrade to Tika 1.1 Key: NUTCH-1396 URL: https://issues.apache.org/jira/browse/NUTCH-1396 Project: Nutch Issue Type: Bug Affects Versions: nutch

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
Before you do, could you check that NutchGora passes ant test successfully. I just tried and got an error related to the parse-tika tests. Am about to open a JIRA to update to the latest version of Tika for NutchGora which should fix the problem and put it at the same level as trunk J On 15 June

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Lewis John Mcgibbney
I'll push this in an hour or so guys. Thanks for the input. Lewis On Fri, Jun 15, 2012 at 9:39 AM, Julien Nioche < lists.digitalpeb...@gmail.com> wrote: > +1 > > > On 15 June 2012 09:00, Ferdy Galema wrote: > >> Agree with only releasing src. >> >> >> On Thu, Jun 14, 2012 at 11:32 PM, Mattmann

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Julien Nioche
+1 On 15 June 2012 09:00, Ferdy Galema wrote: > Agree with only releasing src. > > > On Thu, Jun 14, 2012 at 11:32 PM, Mattmann, Chris A (388J) < > chris.a.mattm...@jpl.nasa.gov> wrote: > >> Or just not ship a bin release at all. Src is the only thing we really >> VOTE on legally though bin is

Re: VOTE Apache Nutch 2.0 RC1

2012-06-15 Thread Ferdy Galema
Agree with only releasing src. On Thu, Jun 14, 2012 at 11:32 PM, Mattmann, Chris A (388J) < chris.a.mattm...@jpl.nasa.gov> wrote: > Or just not ship a bin release at all. Src is the only thing we really > VOTE on legally though bin is provided for convenience purposes. Will type > more on this l