HBASE garbage collection problem

2012-09-12 Thread Amlan Roy
Hi, I was doing some load testing on my cluster. I am writing to HBase (version 0.92.0) from 20 threads simultaneously. After running the program for some time, one of my machines got unresponsive. I checked the GC log and found occurrences of both concurrent mode failure and promotion failed

HBase MapReduce - Using mutiple tables as source

2012-08-06 Thread Amlan Roy
Hi, While writing a MapReduce job for HBase, can I use multiple tables as input? I think TableMapReduceUtil.initTableMapperJob() takes a single table as parameter. For my requirement, I want to specify multiple tables and scan instances. I read about MultiTableInputCollection in the document

RE: HBase MapReduce - Using mutiple tables as source

2012-08-06 Thread Amlan Roy
, Mohammad Tariq wrote: Hello Amlan, Issue is still unresolved...Will get fixed in 0.96.0. Regards, Mohammad Tariq On Mon, Aug 6, 2012 at 5:01 PM, Amlan Roy amlan@cleartrip.com wrote: Hi, While writing a MapReduce job for HBase, can I use multiple tables as input? I think

RE: Hbase bkup options

2012-07-23 Thread Amlan Roy
to a different cluster. hadoop fs -copyToLocal will let you copy off the cluster. You could write your own code, but you don't get much gain over existing UNIX/Linux tools. On Jul 23, 2012, at 7:52 AM, Amlan Roy wrote: Hi, Is it feasible to do disk or tape backup for Hbase tables? I have

Reporting tool for Hbase

2012-07-12 Thread Amlan Roy
Hi, I am looking for a reporting tool that can use Hbase data as input. Any recommendation? I am using Pentaho PDI because it can use Hbase data as input. But I am getting a strange error. My cluster is running, I can access data from my client program. But Pentaho is giving the following

RE: Reporting tool for Hbase

2012-07-12 Thread Amlan Roy
? For reporting over Hbase, you can also take a look at Crux at http://github.com/sonalgoyal/crux Best Regards, Sonal [1] Crux: Reporting for HBase https://github.com/sonalgoyal/crux Nube Technologies http://www.nubetech.co http://in.linkedin.com/in/sonalgoyal On Sun, Jul 8, 2012 at 12:15 PM, Amlan Roy

Reporting tool for Hbase

2012-07-12 Thread Amlan Roy
Hi, I am looking for a reporting tool that can use Hbase data as input. Any recommendation? I am using Pentaho PDI because it can use Hbase data as input. But I am getting a strange error. My cluster is running, I can access data from my client program. But Pentaho is giving the following

RE: Recommended Production setup

2012-07-08 Thread Amlan Roy
is generally sensible except you can 'overlay' the zookeeper nodes on any of the other ones, no need to use dedicated machines for them. 2012/7/6 Jean-Daniel Cryans jdcry...@apache.org On Fri, Jul 6, 2012 at 3:01 AM, Amlan Roy amlan@cleartrip.com wrote: 1. I have used Hbase-0.92.0

Recommended Production setup

2012-07-06 Thread Amlan Roy
Hi All, We want to use Hbase in Production for our new Analytics module. I need some suggestions before I setup the production cluster. 1. I have used Hbase-0.92.0 with Hadoop-1.0.0 for our POC. Are these versions production ready? 2. Initial setup I have thought is: a. One