Excellent, I'll take a look and see what we can do. Thanks!

Alex


On Thu, May 2, 2013 at 3:57 AM, Julien Nioche <[email protected]
> wrote:

> See https://issues.apache.org/jira/browse/NUTCH-1047 which is in trunk for
> writing indexing plugins. You will have the same issues with the versions
> of HBase if you use GORA within your plugin so in your case a  more direct
> approach might be more appropriate. It would be good to help the GORA
> people with upgrading their version of HBase then using GORA within your
> custom indexing plugin, which would make it more generic and would indeed
> be a nice contribution.
>
> J.
>
>
> On 2 May 2013 04:55, AC Nutch <[email protected]> wrote:
>
> > Thanks a lot for the suggestion Julien, I suspected that might be the
> case
> > and I really appreciate the recommendation of using 1.x for robustness.
> >
> > Also that sounds like a wonderful idea regarding extending the indexer. I
> > think that's exactly what we'll do! Is this something you all would be
> > interested in having as part of the 1.x code base? We would be glad to
> > contribute it back to you all once we have done this.
> >
> > Alex
> >
> >
> > On Wed, May 1, 2013 at 4:25 PM, Julien Nioche <
> > [email protected]
> > > wrote:
> >
> > > Nutch 1.x is definitely more tested and robust than 2.x. Loads of work
> is
> > > done for the latter but the former is probably a safer option in
> > > production. You could use the pluggable indexer and send the documents
> to
> > > HBase (ideally via GORA)? This would be an elegant way of migrating
> from
> > > 1.x to 2.x BTW.
> > >
> > >
> > > On 1 May 2013 19:41, AC Nutch <[email protected]> wrote:
> > >
> > > > Hello All,
> > > >
> > > > Has anyone gotten the latest version of HBase 0.94.6 to work with
> Nutch
> > > 2.1
> > > > on Ubuntu with Hadoop >= 1.0.X. I keep getting the error:
> > > >
> > > > Exception in thread "main" org.apache.gora.util.GoraException:
> > > > java.lang.IllegalArgumentException: Not a host:port pair:
> > > >
> > > > Googling around I saw the suggestion to replace the hbase-0.90.4 jar
> > with
> > > > the hbase-0.94.6.jar from my hbase distro (btw I understand I'm
> trying
> > to
> > > > do something that is unsupported by using the latest hbase version).
> > The
> > > > suggestion didn't appear to work - I get the same error. Has anyone
> > > gotten
> > > > the latest HBase to work with Nutch 2.1 and if so, how did you get
> > around
> > > > this error?
> > > >
> > > > As a little bit of background, the overall problem I'm trying to
> solve
> > is
> > > > that I really want to use Nutch 2.1 as opposed to the 1.6 branch for
> > what
> > > > will become a production application. However, I have the requirement
> > of
> > > > using at least Hadoop 1.0.X which, as I understand it, is not
> supported
> > > by
> > > > HBase 0.90.x. On the other hand, Nutch 2.1 (or rather GORA) doesn't
> > > support
> > > > later HBase versions, which leaves me in quite the pickle - it seems
> > that
> > > > either I use an older Hadoop (which I can't do) or I use Nutch 1.6
> > > (which I
> > > > don't want to do). Any suggestions?
> > > >
> > >
> > >
> > >
> > > --
> > > *
> > > *Open Source Solutions for Text Engineering
> > >
> > > http://digitalpebble.blogspot.com/
> > > http://www.digitalpebble.com
> > > http://twitter.com/digitalpebble
> > >
> >
>
>
>
> --
> *
> *Open Source Solutions for Text Engineering
>
> http://digitalpebble.blogspot.com/
> http://www.digitalpebble.com
> http://twitter.com/digitalpebble
>

Reply via email to