As a follow-up to my own question, which accurately describes the component call-stack of the pig script I included in my post?
pig -> mapreduce/hadoop -> Hbase pig -> Hbase -> mapreduce/hadoop On Tue, Apr 12, 2011 at 9:53 AM, Daniel Eklund <[email protected]> wrote: > This question might be better diagnosed as an Hbase issue, but since it's > ultimately a Pig script I want to use, I figure someone on this group could > help me out. I tried asking the IRC channel, but I think it was in a lull. > > My scenario: I want to use Pig to call an HBase store. > My installs: Apache Pig version 0.8.0-CDH3B4 --- hbase version: > hbase-0.90.1-CDH3B4. > My sample script: > > ----------- > A = load 'passwd' using PigStorage(':'); > rawDocs = LOAD 'hbase://daniel_product' > USING > org.apache.pig.backend.hadoop.hbase.HBaseStorage('base:testCol1'); > vals = foreach rawDocs generate $0 as val; > dump vals; > store vals into 'daniel.out'; > ----------- > > I am consistently getting a > Failed Jobs: > JobId Alias Feature Message Outputs > N/A rawDocs,vals MAP_ONLY Message: > org.apache.pig.backend.executionengine.ExecException: ERROR 2118: Timed out > trying to locate root region > at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigInputFormat.getSplits(PigInputFormat.java:280) > > > Googling shows me similar issues: > > http://search-hadoop.com/m/RPLkD1bmY4l&subj=Re+Cannot+connect+HBase+to+Pig > > My current understanding is that somewhere in the interaction between Pig, > Hadoop, HBase, and Zookeper, there is a configuration file that needs to be > included in a classpath or a configuration directory somewhere. I have > tried various combinations of making hadoop aware of Hbase and vice-versa. > I have tried ZK running on its own, and also managed by HBase. > > Can someone explain the dependencies here? Any insight as to what I am > missing? What would your diagnosis of the above message be? > > thanks, > daniel > > > >
