RE: Scan addFamily vs FamilyFilter(EQUAL, ...)

2012-05-31 Thread Anoop Sam John
Hi, As per my understanding of the Scan code in your scenario where you want to go with scanning of some CFs ( not all) You go with Scan#addFamily. The FamilyFilter also doing the same thing. But there is a difference in the performance. When one specify the CFs in the scan, the scanner

Re: Scan addFamily vs FamilyFilter(EQUAL, ...)

2012-05-31 Thread Stack
On Wed, May 30, 2012 at 11:18 PM, Anoop Sam John anoo...@huawei.com wrote: @Stack One thing I ran into when using the Scan.addFamily / Scan.addColumn is that those two methods overwrite each other. In the Scan#addColumn javadoc it is clearly telling about this overwrites...   So this seems

Re: Problems with scan after lot of Puts

2012-05-31 Thread Ondřej Stašek
Hallo J-D. Thanks for reply. I've modified my code to use scanner copies - table.getScanner(new Scan(scan)) and run it again. Even after that I got an error: 12/05/31 10:42:39 INFO hbase.TestPutScan: Run 5 put 100 rows 12/05/31 10:44:09 INFO hbase.TestPutScan: Run 5 scan + del every

RE: Scan addFamily vs FamilyFilter(EQUAL, ...)

2012-05-31 Thread Ramkrishna.S.Vasudevan
Just to add on. The java doc clearly says in FamilyFilter that * If an already known column family is looked for, use {@link org.apache.hadoop.hbase.client.Get#addFamily(byte[])} * directly rather than a filter. So addFamily should be better. Regards Ram -Original Message- From:

Re: performance of a hbase map/reduce job

2012-05-31 Thread Kevin O'dell
It only matters if you are using them all. On Wed, May 30, 2012 at 12:15 AM, Ey-Chih chow eyc...@gmail.com wrote: By the way, the number of maximal client connections is set to default value, i.e. 60. Does this matter? Thanks. Ey-Chih Chow On May 29, 2012, at 11:48 PM, Ey-Chih chow

Re: hbase data

2012-05-31 Thread Andrew Nguyen
I am very interested in this also. I posed the question somewhere a couple years ago and hadn't heard anything. We decided to go with hbase to store a working set of the data - data that we would want to view with low latency and relatively randomly. Then, we store everything else to HDFS

Re: HBase Hush Application

2012-05-31 Thread Richipal
I had a similar problem, I fixed it. The reason for the problem was in my client code library I was referring a different version of hadoop-core and hbase, and running a different version, I changed my code libraries to have the same version as I was running and that took care of the problem.

HBase 0.94 security configurations

2012-05-31 Thread Amit Sela
Hi all, I'm upgrading our cluster with the following versions: HBase 0.90.2 to 0.94.0 Hadoop 0.20.3 to 1.0.3 zookeeper 3.3.2 to 3.4.3 As a first step, I'm trying to run some tests on my PC and I get a SecurityException from zookeeper: SecurityException: java.lang.SecurityException: Unable to

Re: HBase 0.94 security configurations

2012-05-31 Thread Andrew Purtell
See https://cwiki.apache.org/ZOOKEEPER/zookeeper-and-sasl.html For a fully baked (but simple) example configuration, have a look at https://github.com/apurtell/tm-ec2-demo/ . Start with https://github.com/apurtell/tm-ec2-demo/blob/master/bin/image/tarball/setup-remote On Thu, May 31, 2012 at

Task tracker timeout with filtered table scan

2012-05-31 Thread Bryan Keller
I have a large table that I am running a map reduce job on. The job scans for a particular column value in the table using a TableInputFormat with a filter on the scan. This value only matches a few rows, so most of the rows are filtered out. The problem is that the TableInputFormat will not

Re: HBase 0.94 security configurations

2012-05-31 Thread Amit Sela
I still don't understand if it is optional to use the security or not ? if i'll set the following in hbase-site.xml: property namehbase.zookeeper.property.requireClientAuthScheme/name value/value descriptionProperty from ZooKeeper's config zoo.cfg.

Re: Problems with scan after lot of Puts

2012-05-31 Thread Jean-Daniel Cryans
There's concurrent thread on the mailing list that refers to atomicity issues in 0.90 and issues with scans, may I suggest you run the test on 0.92.1 or 0.94.0? I did my testing on 0.94 and didn't get any issues after fixing the scanner. J-D On Thu, May 31, 2012 at 3:05 AM, Ondřej Stašek

Re: HBase 0.94 security configurations

2012-05-31 Thread Amit Sela
I'm trying to run a test for HBase (some think we wrote, internal) on my laptop - runs perfectly with the old versions of Hadoop, HBase and ZooKeeeper. After deploying the new versions and re-compiling our code, I run the test. When I try to instantiate new HBaseAdmin(getConfiguration()) -

Re: HBase 0.94 security configurations

2012-05-31 Thread Andrew Purtell
Security is optional. If you don't set any SASL protected ACLs on znodes, then the client doesn't need to authenticate, you should not add any security options to the site file like you currently are, and the message on startup about the state of JAAS configuration is informative only and is

Re: HBase 0.94 security configurations

2012-05-31 Thread Andrew Purtell
Server null usually means you haven't configured hbase.zookeeper.quorum in your client's hbase-site.xml file. And that is usually because you are using a Configuration not created by HBaseConfiguration.create() If so the JAAS warning is a red herring. On May 31, 2012, at 8:52 PM, Amit Sela

Re: HBase 0.94 security configurations

2012-05-31 Thread Andrew Purtell
I mean of course server null means that hbase.zookeeper.quorum config property is unset. And the two most common reasons are: 1. Not defined in the site file 2. Configuration object not created with HBaseConfiguration.create() I hope this is clearer. On May 31, 2012, at 8:59 PM, Andrew

Re: hosts unreachables

2012-05-31 Thread Cyril Scetbon
I'm still having region nodes that crash :( You can see reasons on 2 region nodes : *node1 :* 2012-05-31 17:57:03,937 INFO org.apache.hadoop.hbase.regionserver.HRegion: Starting compaction on region ise,ID-TST-100-1Rw6ue7L8i

Re: HBase 0.94 security configurations

2012-05-31 Thread Amit Sela
I did some debug and the code does calls HBaseConfiguration.create() - since my test extends HBaseTestingUtility. and conf.properties.get(hbase.zookeeper.quorum) returns localhost. Is that properly set, or should it be something else ? keep in my it's a test running on my laptop, so it seems OK

Re: HBase 0.94 security configurations

2012-05-31 Thread Andrew Purtell
Great, now remove any security related Zookeeper properties that you added in hbase-site.xml. Only keep hbase.zookeeper.quorum. On May 31, 2012, at 9:52 PM, Amit Sela am...@infolinks.com wrote: I did some debug and the code does calls HBaseConfiguration.create() - since my test extends

Re: hosts unreachables

2012-05-31 Thread Jean-Daniel Cryans
What I'm seeing looks more like GC issues. Start reading this: http://hbase.apache.org/book.html#gc J-D On Thu, May 31, 2012 at 12:10 PM, Cyril Scetbon cyril.scet...@free.fr wrote: I'm still having region nodes that crash :( You can see reasons on 2 region nodes : *node1 :* 2012-05-31

Re: hosts unreachables

2012-05-31 Thread Cyril Scetbon
On 5/31/12 11:00 PM, Jean-Daniel Cryans wrote: What I'm seeing looks more like GC issues. Start reading this: http://hbase.apache.org/book.html#gc J-D Hi, Really not sure cause I've enabled gcc's verbose option and I don't see anything taking a long time. Maybe I can check again on one node.

Re: hosts unreachables

2012-05-31 Thread Jean-Daniel Cryans
Both, also you could bigger log snippets (post them on something like pastebin.com) and we could see more evidence of the issue. J-D On Thu, May 31, 2012 at 2:09 PM, Cyril Scetbon cyril.scet...@free.fr wrote: On 5/31/12 11:00 PM, Jean-Daniel Cryans wrote: What I'm seeing looks more like GC

Re: EC2 AMI (from cloudera??) for hbase?

2012-05-31 Thread Elliott Clark
Whirr( http://whirr.apache.org/ ) has the ability to spin up an hbase/hadoop cluster. I haven't played with it in a while though; let us know what you find. On Thu, May 31, 2012 at 7:37 PM, Yang tedd...@gmail.com wrote: I'm trying to kick up a hbase + hadoop cluster really easily (I am a