thanks for the valuable suggestions julien
On Tue, Jul 15, 2014 at 4:59 PM, Julien Nioche < [email protected]> wrote: > > > > @julien > > i just started with latest version, > > > > [big sigh] it used to be called NutchGora which was probably a better name > for it. People (reasonnably) expect a 2.x version to be better than the 1.x > one and the de-facto version to go for. > > 2.x is not as stable as 1.x, it lacks some of the 1.x features, is a lot > slower, is more complex to install and configure.... but it can do some > things that 1.x can't (e.g resumable fetch and parse steps). So unless you > have a specific reason to use 2.x, my advice would be to go for 1.x. Of > course if you want to contribute and make 2.x better, that would be great! > It needs to be used in order to get improved. > > > > will check the 1.8 version, > > > > Pull the code from trunk instead - we've fixed loads of bugs recently > > > > can you suggest me some tutorials, so that i can quickly start with > custom > > Indexwriter. > > > > Look at the indexer-solr and indexer-elastic plugins + the nutch index > command. > > Julien > > > > > > > > > -yeshwanth > > > > > > > > On Tue, Jul 15, 2014 at 4:12 PM, Talat Uyarer <[email protected]> wrote: > > > > > Hi Yeshwanth, > > > > > > Our last stable relase (Nutch 2.2.1) doesn't support higher version > > > than 0.90 version of hbase. If you use Nutch 2.x trunk version you can > > > use Hbase 0.94 version with gora 0.4. Unfortunately Gora 0.4 release > > > doesn't support Hadoop 2.x. If you want to use your own structure you > > > should use gora trunk version and you should change nutch ivy > > > dependecy for using your gora trunk version. > > > > > > Talat > > > > > > > > > 2014-07-15 13:31 GMT+03:00 yeshwanth kumar <[email protected]>: > > > > hi , > > > > > > > > i am using hbase 0.94.10 on top of hadoop 2.2. > > > > > > > > now i need to crawl the websites and store the results in hbase. > > > > i saw that nutch doesn't have integration with gora 0.4 and higher > > > versions > > > > of hbase. > > > > > > > > i went through nutch java api documentation for the possibility of > > > crawling > > > > through custom code. > > > > where i found the nutch is totally dependent on gora. > > > > i don't see any other possible ways here. > > > > > > > > can someone suggest me a way to store the crawled data using Nutch > > into > > > > hbase > > > > > > > > > > > > thanks, > > > > yeshwanth > > > > > > > > > > > > -- > > > Talat UYARER > > > Websitesi: http://talat.uyarer.com > > > Twitter: http://twitter.com/talatuyarer > > > Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304 > > > > > > > > > -- > > Open Source Solutions for Text Engineering > > http://digitalpebble.blogspot.com/ > http://www.digitalpebble.com > http://twitter.com/digitalpebble >

