Bill,  I have done all that both you and Jameson have suggested and still
get the same error.

I can telnet into the zookeeper.  I have also used the zkClient.sh and can
look at /hbase/rs to see the regionservers.
Should I be able to see anything at /hbase/root-region-server?

thanks,
daniel


On Tue, Apr 12, 2011 at 11:58 AM, Bill Graham <[email protected]> wrote:

> Yes, Pig's HBaseStorage using the HBase client to read/write directly
> to HBase from within a MR job, but chains to other Pig-generated MR
> jobs as needed to transform.
>
> Daniel, check that you have defined HBASE_CONF_DIR properly, or that
> you have hbase-site.xml in your classpath. Then try to telnet to the
> defined zookeeper host from the machine where the exception is being
> generated. There is some communication from Pig to HBase/ZK from the
> node that the client runs on before the MR jobs start on the cluster
> FYI.
>
>
> On Tue, Apr 12, 2011 at 8:40 AM, Jameson Lopp <[email protected]> wrote:
> > I'm by no means an expert, but I think it's the latter. My rudimentary
> > understanding is that pig uses HBaseStorage to load the data from hbase
> and
> > passes the input splits along to hadoop/MR. Feel free to correct me if
> I'm
> > wrong.
> > --
> > Jameson Lopp
> > Software Engineer
> > Bronto Software, Inc.
> >
> > On 04/12/2011 10:50 AM, Daniel Eklund wrote:
> >>
> >> As a follow-up to my own question, which accurately describes the
> >> component
> >> call-stack of the pig script I included in my post?
> >>
> >> pig ->  mapreduce/hadoop ->  Hbase
> >> pig  ->  Hbase ->  mapreduce/hadoop
> >>
> >>
> >>
> >> On Tue, Apr 12, 2011 at 9:53 AM, Daniel Eklund<[email protected]>
>  wrote:
> >>
> >>> This question might be better diagnosed as an Hbase issue, but since
> it's
> >>> ultimately a Pig script I want to use, I figure someone on this group
> >>> could
> >>> help me out. I tried asking the IRC channel, but I think it was in a
> >>> lull.
> >>>
> >>> My scenario:  I want to use Pig to call an HBase store.
> >>> My installs:   Apache Pig version 0.8.0-CDH3B4  --- hbase version:
> >>> hbase-0.90.1-CDH3B4.
> >>> My sample script:
> >>>
> >>> -----------
> >>> A = load 'passwd' using PigStorage(':');
> >>> rawDocs = LOAD 'hbase://daniel_product'
> >>>         USING
> >>> org.apache.pig.backend.hadoop.hbase.HBaseStorage('base:testCol1');
> >>> vals = foreach rawDocs generate $0 as val;
> >>> dump vals;
> >>> store vals into 'daniel.out';
> >>> -----------
> >>>
> >>> I am consistently getting a
> >>> Failed Jobs:
> >>> JobId   Alias   Feature Message Outputs
> >>> N/A     rawDocs,vals    MAP_ONLY        Message:
> >>> org.apache.pig.backend.executionengine.ExecException: ERROR 2118: Timed
> >>> out
> >>> trying to locate root region
> >>>         at
> >>>
> >>>
> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigInputFormat.getSplits(PigInputFormat.java:280)
> >>>
> >>>
> >>> Googling shows me similar issues:
> >>>
> >>>
> >>>
> http://search-hadoop.com/m/RPLkD1bmY4l&subj=Re+Cannot+connect+HBase+to+Pig
> >>>
> >>> My current understanding is that somewhere in the interaction between
> >>> Pig,
> >>> Hadoop, HBase, and Zookeper, there is a configuration file that needs
> to
> >>> be
> >>> included in a classpath or a configuration directory somewhere.  I have
> >>> tried various combinations of making hadoop aware of Hbase and
> >>> vice-versa.
> >>> I have tried ZK running on its own, and also managed by HBase.
> >>>
> >>> Can someone explain the dependencies here?  Any insight as to what I am
> >>> missing?  What would your diagnosis of the above message be?
> >>>
> >>> thanks,
> >>> daniel
> >>>
> >>>
> >>>
> >>>
> >>
> >
>

Reply via email to