Release planning

Andrzej Bialecki Tue, 04 Jan 2011 12:28:48 -0800

Hi users & devs,

As you probably know, there are currently two active lines ofdevelopment for Nutch:

* Nutch trunk, a.k.a. Nutch 2.0: this is based on a completelyredesigned storage layer that uses Apache Gora, which in turn can usevarious storage implementations such as HBase, Cassandra, and MySQL.This branch is still largely experimental and unstable, but work isprogressing, and at the current pace I think a release should bepossible within the next ~6 months. Another important addition on thisbranch is a REST API that allows using Nutch as a black-box crawlingservice.

* Nutch branch-1.3: this started as a snapshot of Nutch trunk justbefore merging with nutchbase (i.e. switching to Gora as a storagelayer). This branch is still largely similar to the previous versions ofNutch, and uses Hadoop MapFile/SequenceFile and "segments". As comparedwith release 1.2 it does NOT ship with any search infrastructure,because all search functionality has been delegated to Solr (viaSolrIndexer). This is BTW also true about Nutch trunk.

Regarding branch-1.2 (which is a maintenance branch after release 1.2)there have been pretty no updates there, if any. Nutch committerresources are very limited (when it comes to active committers), so Idon't expect any maintenance release from this branch to happen...

I think that considering the relatively remote release date for Nutch2.-0 it would make sense to roll out a 1.3 release based on branch-1.3,after making sure that all critical patches from trunk have been mergedin there.


What do you think?

--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Release planning

Reply via email to