Thanks Julien for the prompt response.

Actually since the model for 1.9 version is all plugin based I shouldn't be 
expecting an ivy.xml like in 2.x to have a elastic config. So ignore that 
comment.

Yes I mean HDFS (new to big data and Hadoop). Isn't HBase the default one for 
1.9 too ?

Perhaps this article is a bit misleading 
http://www.infoq.com/articles/nioche-apache-nutch2 based on your clarification. 
Maybe there should be another follow on to that article.

Thanks,
Iqbal Shaikh
________________________________________
From: Julien Nioche [[email protected]]
Sent: 29 August 2014 12:41
To: [email protected]
Subject: Re: Nutch Confusion

Hi Iqbal,

Am doing a POC to help decide if we should be using Nutch 1.9 or 2.2.1
> version.
>
> We would be indexing our crawled data in ElasticSearch 1.x version.
>
> I know the 2.2.1 version provides OTB support for Elastic 0.x version but
> to use 2.x I need to change the code (ElasticWriter.java) This means its a
> customised Nutch installation, which I don't prefer.
>
> However even though 1.9 doesn't provide Elastic as default it does support
> 1.x OTB which means no code change at all. And this is a big advantage.
>

what do you mean by '1.9 doesn't provide ES by default'?


>
> I don't really need the flexibility provided by GORA as we're ok to use
> HBase.


do you mean HDFS?


> Also 2.x doesn't seem to have periodic commits compared to 1.9
>
> Therefore I was wondering what others think as am not sure about the
> Roadmap going forward, are we going to cease 1.x at some point and migrate
> the missing functionality to 2.x or we going to continue to have two
> parallel versions.
>

more likely two parallel versions. 2.x is not making much progress. IMHO of
the two versions 1.x is not the one which is going to die first ;-)


>
> Any suggestion to help me make my decision please?
>

See discussion on this list (
http://www.mail-archive.com/[email protected]/msg12550.html). 1.x is
more robust, faster and more actively maintained. Since it sounds like you
don't have any need for any specific features from 2.x then I'd recommend
to use 1.x.

HTH

Julien




>
> Thanks,
>
> Iqbal Shaikh
> Transform is a trading division of Engine Partners UK LLP, a limited
> liability partnership registered in England & Wales with registered number
> OC365812.
> Our registered office is at 60 Great Portland Street, London  W1W 7RT,
> United Kingdom.
> A list of our members is open for inspection at our registered office.




--

Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble

Transform is a trading division of Engine Partners UK LLP, a limited liability 
partnership registered in England & Wales with registered number OC365812. 
Our registered office is at 60 Great Portland Street, London  W1W 7RT, United 
Kingdom. 
A list of our members is open for inspection at our registered office.

Reply via email to