Re: Nutch Integration with hbase 94.x and hadoop 2.2

yeshwanth kumar Tue, 15 Jul 2014 05:16:32 -0700

thanks for the valuable suggestions julien


On Tue, Jul 15, 2014 at 4:59 PM, Julien Nioche <
[email protected]> wrote:

> >
> > @julien
> > i just started with latest version,
> >
>
> [big sigh] it used to be called NutchGora which was probably a better name
> for it. People (reasonnably) expect a 2.x version to be better than the 1.x
> one and the de-facto version to go for.
>
> 2.x is not as stable as 1.x, it lacks some of the 1.x features, is a lot
> slower, is more complex to install and configure.... but it can do some
> things that 1.x can't (e.g resumable fetch and parse steps).  So unless you
> have a specific reason to use 2.x, my advice would be to go for 1.x. Of
> course if you want to contribute and make 2.x better, that would be great!
> It needs to be used in order to get improved.
>
>
> > will check the 1.8 version,
> >
>
> Pull the code from trunk instead - we've fixed loads of bugs recently
>
>
> > can you suggest me some tutorials, so that i can quickly start with
> custom
> > Indexwriter.
> >
>
> Look at the indexer-solr and indexer-elastic plugins + the nutch index
> command.
>
> Julien
>
>
>
> >
> >
> > -yeshwanth
> >
> >
> >
> > On Tue, Jul 15, 2014 at 4:12 PM, Talat Uyarer <[email protected]> wrote:
> >
> > > Hi Yeshwanth,
> > >
> > > Our last stable relase (Nutch 2.2.1) doesn't support higher version
> > > than 0.90 version of hbase. If you use Nutch 2.x trunk version you can
> > > use Hbase 0.94 version with gora 0.4. Unfortunately Gora 0.4 release
> > > doesn't support Hadoop 2.x. If you want to use your own structure you
> > > should use gora trunk version and you should change nutch ivy
> > > dependecy for using your gora trunk version.
> > >
> > > Talat
> > >
> > >
> > > 2014-07-15 13:31 GMT+03:00 yeshwanth kumar <[email protected]>:
> > > > hi ,
> > > >
> > > > i am using hbase 0.94.10 on top of hadoop 2.2.
> > > >
> > > > now i need to crawl the websites and store the results in hbase.
> > > > i saw that nutch doesn't have integration with gora 0.4 and higher
> > > versions
> > > > of hbase.
> > > >
> > > > i went through nutch java api documentation for the possibility of
> > > crawling
> > > > through custom code.
> > > > where i found the nutch is totally dependent on gora.
> > > > i don't see any other possible ways here.
> > > >
> > > > can someone suggest me a  way to store the crawled data using Nutch
> > into
> > > > hbase
> > > >
> > > >
> > > > thanks,
> > > > yeshwanth
> > >
> > >
> > >
> > > --
> > > Talat UYARER
> > > Websitesi: http://talat.uyarer.com
> > > Twitter: http://twitter.com/talatuyarer
> > > Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304
> > >
> >
>
>
>
> --
>
> Open Source Solutions for Text Engineering
>
> http://digitalpebble.blogspot.com/
> http://www.digitalpebble.com
> http://twitter.com/digitalpebble
>

Re: Nutch Integration with hbase 94.x and hadoop 2.2

Reply via email to