Thank you St.Ack,

With StartRow I need to pass the full row key, but since my key is made of
date+ticker I can't do that
(I know the desired date, but don't always know the ticker). Is there a way
to do it?

The same thing also apply to the other part of the question - how can I
filter based on the suffix of the key (the ticker)
if not using regex?

Thanks again

On Wed, Jul 6, 2011 at 4:14 PM, Stack <[email protected]> wrote:

> On Tue, Jul 5, 2011 at 1:02 PM, Alt Control <[email protected]>
> wrote:
> > Question is - how can I do that efficiently? I don't know if HBase allow
> me
> > to set multiple filters in a single Scane object,
> > but I can do that with regex (for example (GOOG|IBM|DELL|.......|n|)),
> but
> > is this the right way?
> >
>
> You can pass lists of filters.  See
>
> http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/filter/FilterList.html
>
> For scanning during a certain time, make your Scan start (and
> optionally end) within the time you are interested in by passing the
> appropriate start and stop keys:  See setStartRow and setStopRow in
> http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/Scan.html.
>
> FYI, avoid regex'es if you can.  They are costly.  HBase is all about
> bytes so to do the check, need to go from bytes to String, then do
> regex, and do this for each compare of all values.  It adds up.
>
> St.Ack
>

Reply via email to