Hi Stack, thanks for your reply.

I'm running on small instances. It's a custom single thread data loader, no MR.

You're right, and the the hadoop  dfs -fs hdfs://domU-12-31-39-00-E9-23:50001/ 
-lsr /hbase command worked and showed all the hbase files, so that looks 
better. I doubled the lease times for master and region servers, but still get 
a reliable timeout. The exact error is:

java.net.SocketTimeoutException: timed out waiting for rpc response
        at org.apache.hadoop.ipc.Client.call(Client.java:514)
        at 
org.apache.hadoop.hbase.ipc.HbaseRPC$Invoker.invoke(HbaseRPC.java:210)
        at $Proxy1.batchUpdate(Unknown Source)
        at org.apache.hadoop.hbase.HTable$8.call(HTable.java:766)
        at org.apache.hadoop.hbase.HTable$8.call(HTable.java:764)
        at 
org.apache.hadoop.hbase.HTable.getRegionServerWithRetries(HTable.java:1037)
        at org.apache.hadoop.hbase.HTable.commit(HTable.java:763)
        at org.apache.hadoop.hbase.HTable.commit(HTable.java:744)





----- Original Message ----
From: stack <[EMAIL PROTECTED]>
To: [email protected]
Sent: Friday, July 25, 2008 11:59:56 AM
Subject: Re: failure after importing 42million rows

Mark Snow wrote:
> I'm running a hbase data import on 0.1.3. After 42million rows, the import 
> fails with an RPC timeout exception. I've tried twice- once on a 2 node 
> cluster and once on a 10 node cluster (ec2 with the same configuration) and 
> it failed both times in the same spot, somewhere between 42 and 43 million 
> rows. 
Small, medium, or X-large instances?

> Where should I look to debug this?
>
> >From the hbase shell, I can query the table and see the rows have been 
> >inserted, but when I do a 'hadoop dfs -ls' I don't see the /hbase dir I 
> >specified, so I'm suspicious it's not storing the data into dfs, and unsure 
> >where it is storing this data.
>  

The $HADOOP_HOME that you are running the 'hadoop dfs -ls' under has in 
its conf file hdfs://domU-12-31-39-00-E9-23:5001/ as the  fs.default.name?

Perhaps 'hadoop  dfs -fs hdfs://domU-12-31-39-00-E9-23:50001/ -lsr 
/hbase' works?

Otherwise, nothing untoward in what you sent in email.  Whats the RPC 
error you're seeing?  Try things like upping your lease periods.    Try 
doubling hbase.regionserver.lease.period and hbase.master.lease.period.  
Are you loading via MR or via a custom script?  If the former, are 
TaskTrackers running on all nodes beside Regionservers and Datanodes?

St.Ack


      

Reply via email to