Re: Possible to scan row keys only?

Ted Yu Wed, 22 Dec 2010 16:39:05 -0800

This is what you need:
https://issues.apache.org/jira/browse/HBASE-3211


On Wed, Dec 22, 2010 at 4:04 PM, Leo Alekseyev <[email protected]> wrote:

> Looking at HBASE-1537, it seems that it only limits the number of
> columns (or column families?) that the scanner returns.  This is not
> useful to us; we have only one column with one column family.  One
> solution might potentially be to add a dummy column family and then
> use HBASE-1537 with a limit of 1.  We'd do it if it's the only option,
> but we'd rather not online and offline our tables (we are talking
> about many terabytes in a production cluster).
>
> Is it somehow possible to use KeyValue.getRow() or KeyValue.getKey()
> in a scanner without returning KeyValue's?  Basically, any code that
> tries to return or make copies of entire KeyValue's is incredibly
> inefficient, since (a) we only need the key and (b) sizeof(key) is
> many orders of magnitude smaller than sizeof(value).
>
> On Wed, Dec 22, 2010 at 1:53 PM, Ted Yu <[email protected]> wrote:
> > How about HBASE-1537 ?
> >
> > On Wed, Dec 22, 2010 at 1:38 PM, Leo Alekseyev <[email protected]>
> wrote:
> >
> >> I need to retrieve row keys from several big tables.  Is it possible
> >> to do so by just reading the key and truncating the value?  I see that
> >> HBASE-1481 implements FirstKeyOnlyFilter, but it doesn't help in our
> >> case, since we have only one KeyValue per row, and it stores binary
> >> data.
> >>
> >> Is there an easy way to accomplish fast key-only retrieval?
> >>
> >> --Leo
> >>
> >
>

Re: Possible to scan row keys only?

Reply via email to