Can you give an example of what you're trying to do?

BTW what we mean when we say that filters don't work across region
servers (actually it's more across regions, so it's also a problem on
a single machine) is that if you happened to have some sort of state
in your filter, it wouldn't be carried from one region to another. I
don't think any of the filters HBase ships with have that sort of
issue, so they can all be used to scan a full table if that's what you
fancy.

J-D

On Thu, Apr 28, 2011 at 1:19 PM, Ajay Govindarajan
<[email protected]> wrote:
> Sorry, what I meant was Scans using Filters. There are use-cases for which we 
> will not know the row keys. So we have to resort to filters using 
> SingleColumnValueFilter or PrefixFilter
> Since filters don't work across region servers, are there any alternative 
> APIs or workarounds? Or is there a fundamental schema design issue here?
>
> thanks
> -ajay
>
>
>
>
>
>
>
> ________________________________
> From: Bennett Andrews <[email protected]>
> To: [email protected]; Ajay Govindarajan <[email protected]>
> Sent: Thursday, April 28, 2011 12:54 PM
> Subject: Re: HBase querying across region servers
>
> Scans will work across region servers transparently.  All you need to do is
> specify a start row and end row.  Use this when you reading sequential rows
> as it will be faster.
>
> -bennett
>
>
>
> On Thu, Apr 28, 2011 at 2:30 PM, Ajay Govindarajan
> <[email protected]>wrote:
>
>> We have a bunch of synchronous requests that will read and write data to
>> hbase. I have written some code that uses the HBase  client library to use
>> Puts for writes, Gets for reads with rowkeys and Scans for reads with
>> filters. Currently we have only one region server (since its a dev
>> environment) so the queries work fine. Eventually we will have multiple
>> region servers in our production environment. From the documentation it
>> seems that Gets and Puts will work across multiple region servers while
>> scans don't.
>>
>> So how do I solve this problem to get scans to work across multiple region
>> servers? Should I avoid using scans and replace it with Gets using filters ?
>> Is that a big perfrmance overhead?
>> Or is there a framework to perform scan like queries across multiple region
>> servers?
>>
>> Any help will be appreciated.
>>
>> thanks
>> -ajay
>>

Reply via email to