Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-05-01 Thread Phil Barnett
On Sat, May 1, 2010 at 2:34 AM, Mattmann, Chris A (388J) < chris.a.mattm...@jpl.nasa.gov> wrote: > > Sure, hopefully you'll find the answer you're looking for. In the > meanwhile, > it's my job to keep cutting release candidates as the RM, that at least > pass > the basic criteria for release and

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-30 Thread Mattmann, Chris A (388J)
Hi Phil, Thanks for your comments. Mine below: >> Unfortunately some parts of the documentation on Nutch (namely the >> tutorial, >> and other parts of the static site) have been out of date for a while. This >> has occurred really independent of the releases, and independent of the >> wiki >> [1

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-30 Thread Phil Barnett
Oh yeah, I built a presentation and gave it to our local Linux User Group meeting. You might find it useful: http://leap-cf.org/presentations/nutch/NutchWebCrawler.odp On Sat, May 1, 2010 at 2:10 AM, Phil Barnett wrote: > > > On Wed, Apr 28, 2010 at 10:27 AM, matthew a. grisius > wrote: > >> I

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-30 Thread Phil Barnett
On Wed, Apr 28, 2010 at 10:27 AM, matthew a. grisius wrote: > I also share many of Phil's sentiments. I really want the project > (bin/nutch crawl) to work for me as well and I want to help somehow. I > would like to share a 5gb 'intranet' web site with ~50 people. And I > have not graduated to ma

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-30 Thread Phil Barnett
On Wed, Apr 28, 2010 at 11:01 AM, Mattmann, Chris A (388J) < chris.a.mattm...@jpl.nasa.gov> wrote: > > Unfortunately some parts of the documentation on Nutch (namely the > tutorial, > and other parts of the static site) have been out of date for a while. This > has occurred really independent of t

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-28 Thread Mattmann, Chris A (388J)
Hi Matthew, Thanks for your feedback. If you have any specific updates/improvements/actionable items based on your comments below, we'd love to have you contribute them back in the form of contributions to the community. Otherwise, we will take your feedback, put it into the queue of other item

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-28 Thread Mattmann, Chris A (388J)
Hi Phil, Thanks very much for the feedback. I¹d like to take a second to address your points: > > How do you test to see if Nutch works like the documentation says it works? > I still find major differences between how existing documentation tells me, > a newcomer to the project, how to get it r

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-28 Thread matthew a. grisius
I also share many of Phil's sentiments. I really want the project (bin/nutch crawl) to work for me as well and I want to help somehow. I would like to share a 5gb 'intranet' web site with ~50 people. And I have not graduated to making the 'deepcrawl' script work yet either, as I'm thinking that may

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-28 Thread Phil Barnett
On Mon, Apr 26, 2010 at 1:55 AM, Mattmann, Chris A (388J) < chris.a.mattm...@jpl.nasa.gov> wrote: > > Please vote on releasing these packages as Apache Nutch 1.1. The vote is > open for the next 72 hours. > How do you test to see if Nutch works like the documentation says it works? I still find m

Re: Running ANT; was -- Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Andrzej Bialecki
On 2010-04-26 17:30, Mattmann, Chris A (388J) wrote: > Hey Andrzej, > >> Actually, we don't have a build target (yet) that produces a binary-only >> distribution that we can ship and which you can run out of the box (not >> counting the build/nutch.job alone, because it needs the Hadoop >> infrast

Re: Running ANT; was -- Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
Hey Andrzej, > Actually, we don't have a build target (yet) that produces a binary-only > distribution that we can ship and which you can run out of the box (not > counting the build/nutch.job alone, because it needs the Hadoop > infrastructure to run). I thought ant tar did this? That's what it

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
Hi Grant, Thanks. I think it actually makes sense to finish off 1.1, and since there is overlap with the Nutch PMC and the Lucene PMC and since the thread started in Lucene before the TLP, I think it would be great e.g., if Andrzej, and Sami could check the release and that way we still have th

Re: Running ANT; was -- Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Andrzej Bialecki
On 2010-04-26 16:24, David M. Cole wrote: > At 10:55 PM -0700 4/25/10, Mattmann, Chris A (388J) wrote: >> Most folks that use Nutch are likely >> familiar with running ant IMHO. > > I guess then I fall into the category of "not most folks." Have been > running Nutch for about 14 months and I haven

Re: Running ANT; was -- Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
Hi David, Thanks. In fact, running ant is probably simpler than running Nutch. The steps would be: * what OS are you on (Ant is available for all of them to my knowledge)? * if you need ant, grab a distro from ant.apache.org, otherwise, I'll assume that you've got ant installed and calla

Running ANT; was -- Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread David M. Cole
At 10:55 PM -0700 4/25/10, Mattmann, Chris A (388J) wrote: Most folks that use Nutch are likely familiar with running ant IMHO. I guess then I fall into the category of "not most folks." Have been running Nutch for about 14 months and I haven't a clue how to run ant. If there's a place to vo

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Grant Ingersoll
Might I suggest, that since Nutch is now a TLP that you delay this release by a few weeks and have the vote done under the auspices of the Nutch PMC? Cheers, Grant On Apr 26, 2010, at 1:55 AM, Mattmann, Chris A (388J) wrote: > Hi Folks, > > I have posted an updated candidate for the Apache Nut

[VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-25 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted an updated candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc2/ The major difference between this release and rc #1 is the application of NUTCH-812 - Crawl.java incorrectly uses the Generator API resul