Re: Where to run Thrift

2012-07-31 Thread syed kather
Eric , why you are trying to run thrift on all the server.why don't you run on only master machine . Really after seeing your post i also had this doubt whether we need separate thrift setup or not ? Is it enough to run thrift on single machine . Thanks and Regards, S SYED

How to query by rowKey-infix

2012-07-31 Thread Christian Schäfer
Hello there, I designed a row key for queries that need best performance (~100 ms) which looks like this: userId-date-sessionId These queries(scans) are always based on a userId and sometimes additionally on a date, too. That's no problem with the key above. However, another kind of queries

Re: How to query by rowKey-infix

2012-07-31 Thread Jerry Lam
Hi Chris: I'm thinking about building a secondary index for primary key lookup, then query using the primary keys in parallel. I'm interested to see if there is other option too. Best Regards, Jerry On Tue, Jul 31, 2012 at 11:27 AM, Christian Schäfer syrious3...@yahoo.dewrote: Hello there,

Re: How to query by rowKey-infix

2012-07-31 Thread Matt Corgan
When deciding between a table scan vs secondary index, you should try to estimate what percent of the underlying data blocks will be used in the query. By default, each block is 64KB. If each user's data is small and you are fitting multiple users per block, then you're going to need all the

Re: Parallel scans

2012-07-31 Thread Tom Brown
I think you could do it manually by looking up all the different regions and starting a separate scan for each region. Not quite as handy as the built-in multi get, but essentially the same. Of course, that leaves the question of processing-- If you're processing it in a single-threaded

Re: Retrieve Put timestamp

2012-07-31 Thread Asaf Mesika
What do you mean by using TS as version? Are you determining the ts long value before and then setting it in the Put object? If so, I think you can use a specific cell as a counter (Sequence in Oracle language, or Auto Increment column in MySQL). In that case of course you need the value of the

Re: Retrieve Put timestamp

2012-07-31 Thread Stack
On Mon, Jul 30, 2012 at 11:13 PM, Pablo Musa pa...@psafe.com wrote: Hey guys, in my application the HBase timestamp is used as version in my logic. I would like to know what is the best way to insert a new record and get its timestamp. I have come up with two possibilities: /* I could

Re: Where to run Thrift

2012-07-31 Thread Stack
On Tue, Jul 31, 2012 at 12:32 PM, Eric eric.x...@gmail.com wrote: I'm currently running thrift on all region server nodes. The reasoning is that you can run jobs on this cluster and these jobs, when using thrift, can connect to localhost. The drawback is that I'm running lots of thrift daemons

RE: Retrieve Put timestamp

2012-07-31 Thread Pablo Musa
What do you mean by using TS as version? My application uses versioning between client and server. So I am using the timestamp as a version. Are you determining the ts long value before and then setting it in the Put object? If so, I think you can use a specific cell as a counter. In that

Re: Null row key

2012-07-31 Thread Michael Segel
Which release? On Jul 31, 2012, at 5:13 PM, Mohit Anchlia mohitanch...@gmail.com wrote: I am seeing null row key and I am wondering how I got the nulls in there. Is it possible when using HBaseClient that a null row might have got inserted?

Re: Null row key

2012-07-31 Thread Mohit Anchlia
HBase 90.4 On Tue, Jul 31, 2012 at 4:18 PM, Michael Segel michael_se...@hotmail.comwrote: Which release? On Jul 31, 2012, at 5:13 PM, Mohit Anchlia mohitanch...@gmail.com wrote: I am seeing null row key and I am wondering how I got the nulls in there. Is it possible when using

Re: Null row key

2012-07-31 Thread Mohit Anchlia
Not sure how but I am getting one null row per 9 writes when I do a GET in result.getRow(). Is it even possible to write null rows? On Tue, Jul 31, 2012 at 4:49 PM, Mohit Anchlia mohitanch...@gmail.comwrote: HBase 90.4 On Tue, Jul 31, 2012 at 4:18 PM, Michael Segel

HBase won't run on OSX 10.8

2012-07-31 Thread Ryan Rawson
Hi all, Something has changed in how OSX and java handles IPv6, and now you will get a log like: 2012-07-31 18:21:39,824 INFO org.apache.hadoop.hbase.master.HMaster: Server active/primary master; 0:0:0:0:0:0:0:0%0, 59736,1343784093521, sessionid=0x138dfc60416, cluster-up flag was=false

Re: HBase won't run on OSX 10.8

2012-07-31 Thread Andrew Purtell
-Djava.net.preferIPv4Stack=true ? Does that still work? On Tue, Jul 31, 2012 at 6:24 PM, Ryan Rawson ryano...@gmail.com wrote: Hi all, Something has changed in how OSX and java handles IPv6, and now you will get a log like: 2012-07-31 18:21:39,824 INFO

Re: HBase won't run on OSX 10.8

2012-07-31 Thread Ryan Rawson
I shall try that. I submitted a patch too that quashes the extra % where it is causing problems. On Tue, Jul 31, 2012 at 6:28 PM, Andrew Purtell apurt...@apache.org wrote: -Djava.net.preferIPv4Stack=true ? Does that still work? On Tue, Jul 31, 2012 at 6:24 PM, Ryan Rawson

Re: Zookeeper Client port issue

2012-07-31 Thread Mohammad Tariq
Hello Manu, Use only one HTable instance at a time, and do not forget to close it once you are done. If you need multiple HTables, use HTablePool class instead. HTablePool acts as a pool for all tables. Regards, Mohammad Tariq On Wed, Aug 1, 2012 at 9:55 AM, Manu S manupk...@gmail.com

Re: Multiple CF and versions

2012-07-31 Thread Mohammad Tariq
Hello Mohit, It is advised not to go above two or three column families. This is because, when we have many column families, the flushing and compaction can cause too many needless i/o operations, thus reducing the performance. As far as versions are concerned, default value is 3.