how client location a region/tablet?

2012-08-18 Thread Lin Ma
Hello guys, I am referencing the Big Table paper about how a client locates a tablet. In section 5.1 Tablet location, it is mentioned that client will cache all tablet locations, I think it means client will cache root tablet in METADATA table, and all other tablets in METADATA table (which means

Re: Can I specify the range inside of fuzzy rule in FuzzyRowFilter?

2012-08-18 Thread Michael Segel
What row keys are you skipping? Using your example... You have a start row of 200, and an end key of xFFxFFxFFxFFxFFxFF00350. Note that you could also write that end key as xFF(1..6) 01 since it looks like you're trying to match the 00 in positons 7 and 8 of your numeric string.

Re: issues copying data from one table to another

2012-08-18 Thread Michael Segel
Can you disable the table? How much free disk space do you have? Is this a production cluster? Can you upgrade to CDH3u5? Are you running a capacity scheduler or fair scheduler? Just out of curiosity, what would happen if you could disable the table, alter the table's max file size and then

Re: Hbase- Hadoop DFS size not decreased even after deleting a column family

2012-08-18 Thread jmozah
Hi Submitted a patch for this. Should i call for a review for this? i See lot of other testcases are missing for cases like this in handlers... i think a Jira would be good for this to track. ./Zahoor HBase Musings On 13-Aug-2012, at 11:13 AM, J Mohamed Zahoor jmo...@gmail.com wrote:

Re: Can I specify the range inside of fuzzy rule in FuzzyRowFilter?

2012-08-18 Thread Alex Baranau
@Michael, This is not a simple partial key scan. Take this example of rows: a_11_20120801 a_11_20120802 a_11_20120802 a_11_20120803 a_11_20120804 a_11_20120805 a_12_20120801 a_12_20120802 a_12_20120802 a_12_20120803

Re: HBase row level cache for random read

2012-08-18 Thread Stack
On Fri, Aug 17, 2012 at 4:42 PM, Gen Liu ge...@zynga.com wrote: I assume block cache store compressed data, Generally its not, not unless you use block encoding. one block can hold 6 rows, but in random read, maybe 1 row is ever accessed, 5/6 of the cache space is wasted. Is there a better

Re: HBase replication

2012-08-18 Thread Stack
On Fri, Aug 17, 2012 at 5:36 PM, Mohit Anchlia mohitanch...@gmail.com wrote: Are clients local to slave DC able to read data from HBase slave when replicating data from one DC to remote DC? Yes. If not then is there a way to design such a thing where clients are able to actively read/write

Re: HBase replication

2012-08-18 Thread Mohit Anchlia
On Sat, Aug 18, 2012 at 12:35 PM, Stack st...@duboce.net wrote: On Fri, Aug 17, 2012 at 5:36 PM, Mohit Anchlia mohitanch...@gmail.com wrote: Are clients local to slave DC able to read data from HBase slave when replicating data from one DC to remote DC? Yes. Is it also possible to setup

Re: Can I specify the range inside of fuzzy rule in FuzzyRowFilter?

2012-08-18 Thread anil gupta
Hi Alex, Apart from the query which i mentioned in last email. Till now, i have implemented the following queries using filters and coprocessors: 1. Getting all the records for a customer. 2. Perform min,max,avg,sum aggregation for a customer using coprocessors. I am storing some of the data as

Re: Hbase master startup error: cannot assign requested address

2012-08-18 Thread Harsh J
Seems more of an address bind issue. What does your $ host `hostname -f` resolve to, on the master? Can you check if the IP reported back correlates with the IP assigned to your machine (Check via ifconfig)? Perhaps you have a bad IP entry for the master in /etc/hosts and may need to fix that. On

Re: how client location a region/tablet?

2012-08-18 Thread Stack
On Sat, Aug 18, 2012 at 2:13 AM, Lin Ma lin...@gmail.com wrote: Hello guys, I am referencing the Big Table paper about how a client locates a tablet. In section 5.1 Tablet location, it is mentioned that client will cache all tablet locations, I think it means client will cache root tablet in