[ANNOUNCE] New Nutch Committer: Julien Nioche

2009-12-24 Thread Mattmann, Chris A (388J)
All, A little while ago I nominated Julien Nioche to be Nutch committer based on his contributions to the Nutch project (10+ patches in this release alone, and all the mailing list help and thoughtful design discussion). I'm happy to announce that the Lucene PMC has voted to make Julien a Nutch co

1.1 release?

2010-03-09 Thread Mattmann, Chris A (388J)
Hey Guys, I have some extra time this weekend and early next week. Want me to be the RM and push out a 1.1 release? Any blockers? I'm happy to do it just let me know. Cheers, Chris ++ Chris Mattmann, Ph.D. Senior Computer Scientist

Re: [DISCUSS] Nutch as a top level project (TLP)?

2010-03-20 Thread Mattmann, Chris A (388J)
Hey Andrzej, I'd be +1 for Nutch being a TLP. I don't think it'll change much (other than to provide more visibility/etc., and to allow more focused decision making by the folks in the Nutch community). The infrastructure moves required to move to TLP status are moving mailing lists, moving JIR

Re: 1.1 release?

2010-03-31 Thread Mattmann, Chris A (388J)
Hey Guys, OK I'm finally getting around to this: I am going to push all the current 1.1 JIRA issues "out" and set their fix version to nil. Once I'm done with this, I'll wait 48 hrs to see if there is anything that anyone really wants to get into 1.1. So, please, take a look here [1] and make s

Re: [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-02 Thread Mattmann, Chris A (388J)
to include the sha1 of the src archive from jzitting. Will do on both, going forward. * +1 for having a direct link to tika-app on the website. Cheers, Chris On 4/1/10 11:41 PM, "Jukka Zitting" wrote: > Hi, > > On Wed, Mar 31, 2010 at 10:01 PM, Mattmann, Chris A (388J)

Re: [VOTE] Apache Tika 0.7 Release Candidate #1

2010-04-02 Thread Mattmann, Chris A (388J)
2, 2010 at 4:14 PM, Mattmann, Chris A (388J) wrote: > +1s, so technically we could still do the 72 hrs and still be OK, but I'm > fine with giving folks some more time to take a look I'm fine with closing the vote already at 72 hours since the p.a.o outage only see

Question: Nutch 0.8.2 and Nutch 0.7.3?

2010-04-03 Thread Mattmann, Chris A (388J)
Hey Guys, Question. I see 2 releases that haven't been cut in JIRA: 0.8.2: https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&pid=106 80&fixfor=12312064 0.7.3: https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&pid=106 80&fixfor=12312176 I'm happy to cut 0.

Re: Question: Nutch 0.8.2 and Nutch 0.7.3?

2010-04-04 Thread Mattmann, Chris A (388J)
Hey Andrzej, >> http://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8/ > > That's the code that was intended to become 0.8.2 ... > > However, I'm not sure whether there's any benefit in releasing either of > these. Those who really had the need to track this branch (or 0.7) > likely u

Re: release of 1.1?

2010-04-06 Thread Mattmann, Chris A (388J)
Thanks Julien! OK, I'll cut the RC at some point today. Thanks! Cheers, Chris On 4/6/10 4:47 AM, "Julien Nioche" wrote: Chris, Just to let you know that I have committed https://issues.apache.org/jira/browse/NUTCH-810 which was the last open issue before the release of 1.1 Thanks Julien

[VOTE] Apache Nutch 1.1 Release Candidate #1

2010-04-06 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted a candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc1/ See the included CHANGES.txt file for details on release contents and latest changes. The release was made using the Nutch release process, docume

Re: [VOTE] Apache Nutch 1.1 Release Candidate #1

2010-04-06 Thread Mattmann, Chris A (388J)
Oh, per usual, forgot to throw in my +1. So, +1! Cheers, Chris On 4/7/10 1:14 AM, "Mattmann, Chris A (388J)" wrote: Hi Folks, I have posted a candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc1/ See th

Re: [DISCUSS] Board resolution for Nutch as TLP

2010-04-09 Thread Mattmann, Chris A (388J)
Hi Andrzej, +1, with the following amendment: > > RESOLVED, that all responsibilities pertaining to the Apache > Lucene Nutch sub-project encumbered upon the > Apache Nutch Project are hereafter discharged. This should read: > RESOLVED, that all responsibilities pertaining to the Apache > Luce

Re: Adding jpeg parser to nutch

2010-04-10 Thread Mattmann, Chris A (388J)
Hi David, The latest Nutch release candidate (1.1, http://svn.apache.org/repos/asf/lucene/nutch/tags/1.1) includes the tika-parser plugin, which provides a JpegParser (see here: http://bit.ly/b0zRX8) that hopefully can suit your needs. Let me know what you think. Cheers, Chris On 4/10/10 6:

Re: [DISCUSS] Board resolution for Nutch as TLP

2010-04-11 Thread Mattmann, Chris A (388J)
Hi Dogacan, +1 to calling it a "web search platform", since I agree, it’s not just a crawler. Cheers, Chris On 4/11/10 11:40 AM, "Doğacan Güney" wrote: > Hi, > > On Sat, Apr 10, 2010 at 16:32, Jukka Zitting wrote: >> Hi, >> >> On Fri, Apr 9, 2010 at 6:52 PM, Andrzej Bialecki wrote: >>> WH

Re: [VOTE 2] Board resolution for Nutch as TLP

2010-04-12 Thread Mattmann, Chris A (388J)
+1, thanks for pushing this forward Andrzej! Cheers, Chris On 4/12/10 4:32 AM, "Doğacan Güney" wrote: On Mon, Apr 12, 2010 at 14:08, Andrzej Bialecki wrote: > Hi, > > Take two, after s/crawling/search/ ... > > Following the discussion, below is the text of the proposed Board > Resolution to v

Re: [VOTE] Apache Nutch 1.1 Release Candidate #1

2010-04-15 Thread Mattmann, Chris A (388J)
*nudge* Hi guys, so far we have 2 +1 votes on this RC from myself and Andrzej -- another PMC member review would be great so I can push this release out... Thanks! Cheers, Chris On 4/9/10 9:19 AM, "Andrzej Bialecki" wrote: > On 2010-04-07 07:14, Mattmann, Chris A (388J) wrote

Re: [VOTE] Apache Nutch 1.1 Release Candidate #1

2010-04-16 Thread Mattmann, Chris A (388J)
Hi Sami, > I did not yet have time to functionally review the package but I > spotted couple of things: > > -I ran rat (this should really be integrated to the build) and fixed > few java source files that were lacking license headers. Saw that, thanks. I can cut a new RC later today with your

Re: [VOTE 2] Board resolution for Nutch as TLP

2010-04-16 Thread Mattmann, Chris A (388J)
w00t! On 4/16/10 1:12 PM, "Andrzej Bialecki" wrote: On 2010-04-12 13:08, Andrzej Bialecki wrote: > Hi, > > Take two, after s/crawling/search/ ... > > Following the discussion, below is the text of the proposed Board > Resolution to vote upon. > > [] +1. Request the Board make Nutch a TLP > []

[VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-25 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted an updated candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc2/ The major difference between this release and rc #1 is the application of NUTCH-812 - Crawl.java incorrectly uses the Generator API resul

Re: Running ANT; was -- Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
bove. Cheers, Chris On 4/26/10 7:24 AM, "David M. Cole" wrote: At 10:55 PM -0700 4/25/10, Mattmann, Chris A (388J) wrote: >Most folks that use Nutch are likely >familiar with running ant IMHO. I guess then I fall into the category of "not most folks." Have been run

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
P that you delay this release by a few weeks and have the vote done under the auspices of the Nutch PMC? Cheers, Grant On Apr 26, 2010, at 1:55 AM, Mattmann, Chris A (388J) wrote: > Hi Folks, > > I have posted an updated candidate for the Apache Nutch 1.1 release. The > sourc

Re: [VOTE] Apache Nutch 1.1 Release Candidate #2

2010-04-26 Thread Mattmann, Chris A (388J)
Hey Andrzej, Okey dokey, np! Let's get the patch in first :) I can cut as many RCs as needed. Cheers, Chris On 4/26/10 11:30 AM, "Andrzej Bialecki" wrote: On 2010-04-26 17:19, Mattmann, Chris A (388J) wrote: > Hi Grant, > > Thanks. I think it actually makes sense to f

[VOTE] Apache Nutch 1.1 Release Candidate #3

2010-05-08 Thread Mattmann, Chris A (388J)
Hi Folks, I have posted an updated candidate for the Apache Nutch 1.1 release. The source code is at: http://people.apache.org/~mattmann/apache-nutch-1.1/rc3/ The major differences between this release and rc #2 are the application of: NUTCH-816, NUTCH-732, NUTCH-815, NUTCH-814, and NUTCH-812 ba