Re: Nutch Integration with hbase 94.x and hadoop 2.2

Julien Nioche Tue, 15 Jul 2014 04:30:07 -0700

>
> @julien
> i just started with latest version,
>

[big sigh] it used to be called NutchGora which was probably a better name
for it. People (reasonnably) expect a 2.x version to be better than the 1.x
one and the de-facto version to go for.


2.x is not as stable as 1.x, it lacks some of the 1.x features, is a lot
slower, is more complex to install and configure.... but it can do some
things that 1.x can't (e.g resumable fetch and parse steps).  So unless you
have a specific reason to use 2.x, my advice would be to go for 1.x. Of
course if you want to contribute and make 2.x better, that would be great!
It needs to be used in order to get improved.


> will check the 1.8 version,
>

Pull the code from trunk instead - we've fixed loads of bugs recently


> can you suggest me some tutorials, so that i can quickly start with custom
> Indexwriter.
>

Look at the indexer-solr and indexer-elastic plugins + the nutch index
command.

Julien



>
>
> -yeshwanth
>
>
>
> On Tue, Jul 15, 2014 at 4:12 PM, Talat Uyarer <[email protected]> wrote:
>
> > Hi Yeshwanth,
> >
> > Our last stable relase (Nutch 2.2.1) doesn't support higher version
> > than 0.90 version of hbase. If you use Nutch 2.x trunk version you can
> > use Hbase 0.94 version with gora 0.4. Unfortunately Gora 0.4 release
> > doesn't support Hadoop 2.x. If you want to use your own structure you
> > should use gora trunk version and you should change nutch ivy
> > dependecy for using your gora trunk version.
> >
> > Talat
> >
> >
> > 2014-07-15 13:31 GMT+03:00 yeshwanth kumar <[email protected]>:
> > > hi ,
> > >
> > > i am using hbase 0.94.10 on top of hadoop 2.2.
> > >
> > > now i need to crawl the websites and store the results in hbase.
> > > i saw that nutch doesn't have integration with gora 0.4 and higher
> > versions
> > > of hbase.
> > >
> > > i went through nutch java api documentation for the possibility of
> > crawling
> > > through custom code.
> > > where i found the nutch is totally dependent on gora.
> > > i don't see any other possible ways here.
> > >
> > > can someone suggest me a  way to store the crawled data using Nutch
> into
> > > hbase
> > >
> > >
> > > thanks,
> > > yeshwanth
> >
> >
> >
> > --
> > Talat UYARER
> > Websitesi: http://talat.uyarer.com
> > Twitter: http://twitter.com/talatuyarer
> > Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304
> >
>



-- 

Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble

Re: Nutch Integration with hbase 94.x and hadoop 2.2

Reply via email to