> > @julien > i just started with latest version, > [big sigh] it used to be called NutchGora which was probably a better name for it. People (reasonnably) expect a 2.x version to be better than the 1.x one and the de-facto version to go for.
2.x is not as stable as 1.x, it lacks some of the 1.x features, is a lot slower, is more complex to install and configure.... but it can do some things that 1.x can't (e.g resumable fetch and parse steps). So unless you have a specific reason to use 2.x, my advice would be to go for 1.x. Of course if you want to contribute and make 2.x better, that would be great! It needs to be used in order to get improved. > will check the 1.8 version, > Pull the code from trunk instead - we've fixed loads of bugs recently > can you suggest me some tutorials, so that i can quickly start with custom > Indexwriter. > Look at the indexer-solr and indexer-elastic plugins + the nutch index command. Julien > > > -yeshwanth > > > > On Tue, Jul 15, 2014 at 4:12 PM, Talat Uyarer <[email protected]> wrote: > > > Hi Yeshwanth, > > > > Our last stable relase (Nutch 2.2.1) doesn't support higher version > > than 0.90 version of hbase. If you use Nutch 2.x trunk version you can > > use Hbase 0.94 version with gora 0.4. Unfortunately Gora 0.4 release > > doesn't support Hadoop 2.x. If you want to use your own structure you > > should use gora trunk version and you should change nutch ivy > > dependecy for using your gora trunk version. > > > > Talat > > > > > > 2014-07-15 13:31 GMT+03:00 yeshwanth kumar <[email protected]>: > > > hi , > > > > > > i am using hbase 0.94.10 on top of hadoop 2.2. > > > > > > now i need to crawl the websites and store the results in hbase. > > > i saw that nutch doesn't have integration with gora 0.4 and higher > > versions > > > of hbase. > > > > > > i went through nutch java api documentation for the possibility of > > crawling > > > through custom code. > > > where i found the nutch is totally dependent on gora. > > > i don't see any other possible ways here. > > > > > > can someone suggest me a way to store the crawled data using Nutch > into > > > hbase > > > > > > > > > thanks, > > > yeshwanth > > > > > > > > -- > > Talat UYARER > > Websitesi: http://talat.uyarer.com > > Twitter: http://twitter.com/talatuyarer > > Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304 > > > -- Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

