[jira] [Created] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-05-22 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1370: --- Summary: Expose exact number of urls injected @runtime Key: NUTCH-1370 URL: https://issues.apache.org/jira/browse/NUTCH-1370 Project: Nutch

Re: svn commit: r1341365 - /nutch/trunk/ivy/mvn.template

2012-05-22 Thread Lewis John Mcgibbney
Hi Julian. Trivial mistake in this commit. It seems that Ferdy's email has changed to mine!!! Thanks Lewis On Tue, May 22, 2012 at 10:10 AM, jnio...@apache.org wrote: Author: jnioche Date: Tue May 22 09:10:00 2012 New Revision: 1341365 URL:

Re: 1.5 RC2

2012-05-22 Thread Lewis John Mcgibbney
OK doke this sounds fine to me then. I will make the relevant commits to the 1.5 branch then work at it later this evening. I'll make a new thread when the stuff is sorted out and we are ready to VOTE on the new RC. Thanks Lewis On Tue, May 22, 2012 at 10:15 AM, Julien Nioche

[jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime

2012-05-22 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1370: - Priority: Minor (was: Major) Running in pseudo-distributed mode gives you more information if

Re: svn commit: r1341365 - /nutch/trunk/ivy/mvn.template

2012-05-22 Thread Julien Nioche
cut and paste :-) Ferdy wasn't there at all etc... Fixed! Thanks On 22 May 2012 10:33, Lewis John Mcgibbney lewis.mcgibb...@gmail.comwrote: Hi Julian. Trivial mistake in this commit. It seems that Ferdy's email has changed to mine!!! Thanks Lewis On Tue, May 22, 2012 at 10:10 AM,

[jira] [Created] (NUTCH-1371) Replace Ivy with Maven Ant tasks

2012-05-22 Thread Julien Nioche (JIRA)
Julien Nioche created NUTCH-1371: Summary: Replace Ivy with Maven Ant tasks Key: NUTCH-1371 URL: https://issues.apache.org/jira/browse/NUTCH-1371 Project: Nutch Issue Type: Improvement

[jira] [Updated] (NUTCH-1371) Replace Ivy with Maven Ant tasks

2012-05-22 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1371: - Attachment: NUTCH-1371.patch Preliminary version. Needs maven-ant-tasks-2.1.3.jar in ivy dir +

[jira] [Created] (NUTCH-1372) Improve execution of normalisers

2012-05-22 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1372: --- Summary: Improve execution of normalisers Key: NUTCH-1372 URL: https://issues.apache.org/jira/browse/NUTCH-1372 Project: Nutch Issue Type:

[jira] [Created] (NUTCH-1373) Implement consistent execution of normalising and filtering in Generator

2012-05-22 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1373: --- Summary: Implement consistent execution of normalising and filtering in Generator Key: NUTCH-1373 URL: https://issues.apache.org/jira/browse/NUTCH-1373

[jira] [Resolved] (NUTCH-1372) Improve execution of normalisers

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1372. - Resolution: Invalid accidental entry Improve execution of

[jira] [Closed] (NUTCH-1372) Improve execution of normalisers

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-1372. --- Improve execution of normalisers

Re: 1.5 RC2

2012-05-22 Thread Lewis John Mcgibbney
Hi, As I say, I am able to stick time in tonight to roll this RC, however does anyone have a problem with me rolling the 2.0 RC tonight after the 1.5RC2? I would like to get them out the way saving me time during this week if possible. Thanks Lewis On Tue, May 22, 2012 at 10:35 AM, Lewis John

Re: 1.5 RC2

2012-05-22 Thread Mattmann, Chris A (388J)
+1 happy for Lewis to try I've been swamped! Sent from my iPhone On May 22, 2012, at 2:16 AM, Julien Nioche lists.digitalpeb...@gmail.commailto:lists.digitalpeb...@gmail.com wrote: Hi Lewis, I am sure that Chris will have no problem with you doing the RC2. Chris? It would be a good thing to

Re: 1.5 RC2

2012-05-22 Thread Mattmann, Chris A (388J)
+1 Sent from my iPhone On May 22, 2012, at 4:43 AM, Lewis John Mcgibbney lewis.mcgibb...@gmail.commailto:lewis.mcgibb...@gmail.com wrote: Hi, As I say, I am able to stick time in tonight to roll this RC, however does anyone have a problem with me rolling the 2.0 RC tonight after the 1.5RC2?

[jira] [Created] (NUTCH-1375) extract main content of a html file

2012-05-22 Thread behnam nikbakht (JIRA)
behnam nikbakht created NUTCH-1375: -- Summary: extract main content of a html file Key: NUTCH-1375 URL: https://issues.apache.org/jira/browse/NUTCH-1375 Project: Nutch Issue Type: Bug

[jira] [Updated] (NUTCH-1375) extract main content of a html file

2012-05-22 Thread behnam nikbakht (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] behnam nikbakht updated NUTCH-1375: --- Attachment: NUTCH-1375.patch extract main content of a html file

[jira] [Commented] (NUTCH-1375) extract main content of a html file

2012-05-22 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13281024#comment-13281024 ] Julien Nioche commented on NUTCH-1375: -- your patch generates noise (a +

Re: 1.5 RC2

2012-05-22 Thread Lewis John Mcgibbney
As you will have seen the staging repository is closed so we are half way there. I'm now about to sign then upload the generated artifacts to my Apache area for us to view and VOTE, however I am struggling to produce anything other than the nutch-1.5.tar.gz and the nutch-1.5.zip e.g. no

[jira] [Updated] (NUTCH-879) URL-s getting lost

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-879: --- bump to 2.1 URL-s getting lost --

[jira] [Updated] (NUTCH-879) URL-s getting lost

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-879: --- Fix Version/s: (was: nutchgora) 2.1 URL-s getting lost

[jira] [Updated] (NUTCH-1369) Improve ParserChecker in Nutchgora

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1369: Fix Version/s: (was: nutchgora) 2.1 set and classify for

Re: 1.5 RC2

2012-05-22 Thread Julien Nioche
we'd need to duplicate the tasks tar and zip so that they operate on what package-bin produces + rename the output of the standard package into nutch-X-src. The modif I made to build.xml does not deal with that :-( On 22 May 2012 19:43, Lewis John Mcgibbney lewis.mcgibb...@gmail.comwrote: As

[jira] [Updated] (NUTCH-1301) Index job resume switch to resume a failed job

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1301: Attachment: NUTCH-1301-v2.patch patch for nutchgora but I haven't tested and think

[jira] [Updated] (NUTCH-1301) Index job resume switch to resume a failed job

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1301: Fix Version/s: (was: nutchgora) 2.1 Index job resume

[jira] [Updated] (NUTCH-1294) IndexClean job with solr implementation.

2012-05-22 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1294: Fix Version/s: (was: nutchgora) 2.1 Still not tested

Re: 1.5 RC2

2012-05-22 Thread Lewis John Mcgibbney
Done. The artifacts will be signed and uploaded in a couple of mins. Thanks On Tue, May 22, 2012 at 8:22 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: I've been working on it, so I'll try to commit and do it ASAP. Thanks Julien Lewis On Tue, May 22, 2012 at 8:17 PM, Julien

Apache Nutch release 1.5 RC2

2012-05-22 Thread Lewis John Mcgibbney
Good Evening Everyone, A candidate for the Apache Nutch 1.5 release is available at: http://people.apache.org/~lewismc/apache-gora-0.2/rc1/ The release candidate is a src.zip, bin.zip, src.tar.gz and bin.tar.gz archive of the sources in:

Re: 1.5 RC2

2012-05-22 Thread Julien Nioche
Brilliant! Thanks Lewis On 22 May 2012 20:30, Lewis John Mcgibbney lewis.mcgibb...@gmail.comwrote: Done. The artifacts will be signed and uploaded in a couple of mins. Thanks On Tue, May 22, 2012 at 8:22 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: I've been working on

Re: Apache Nutch release 1.5 RC2

2012-05-22 Thread Julien Nioche
Read http://people.apache.org/~lewismc/nutch-1.5-rc2/ :-) On 22 May 2012 20:59, Lewis John Mcgibbney lewis.mcgibb...@gmail.comwrote: Good Evening Everyone, A candidate for the Apache Nutch 1.5 release is available at: http://people.apache.org/~lewismc/apache-gora-0.2/rc1/ The release

[jira] [Created] (NUTCH-1376) Add description parameter to every ant task

2012-05-22 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1376: --- Summary: Add description parameter to every ant task Key: NUTCH-1376 URL: https://issues.apache.org/jira/browse/NUTCH-1376 Project: Nutch

Re: 1.5 RC2

2012-05-22 Thread Lewis John Mcgibbney
An update on 2.0 RC. The maven stuff has been done, but I've run into a problemm generating the project artifacts, which I can't seem to solve so I've headed over to user@ant to see if they can help me out. Hopefully I can spin the RC for 2.0 on Thursday as I won't be able to do it tomorrow.

RE: Apache Nutch release 1.5 RC2

2012-05-22 Thread Markus Jelsma
Great! My +1 for a new release based on the state of the codebase. -Original message- From:Julien Nioche lists.digitalpeb...@gmail.com Sent: Tue 22-May-2012 22:19 To: dev@nutch.apache.org Cc: u...@nutch.apache.org Subject: Re: Apache Nutch release 1.5 RC2 Read