Re: Result set overhead?

James Taylor Wed, 17 Dec 2014 18:34:02 -0800

Hi Kristoffer,
Yes, you're correct - for a non aggregate, non join query, the
underlying result set is backed by the bloated HBase Result and
KeyValue/Cell. See PHOENIX-1489 - maybe we can continue the discussion
there? Your comments here would be valuable over there too.
Thanks,
James


On Wed, Dec 17, 2014 at 2:43 PM, Kristoffer Sjögren <sto...@gmail.com> wrote:
> Correct me if i'm wrong here but it seems that columns not part of the
> primary key are stored as a column qualifiers. And each key value in HBase
> store both the qualifier name and the value. So both qualifier name and
> value are transferred from region server to client, for all column values
> and rows.
>
> This is especially bad for some of our tables which have 2-3 column keys and
> 20-25 column values. And the problem gets worse when column value names
> sometimes are 20-30 characters long.
>
> Any suggestions on how to reduce this overhead?
>
> Cheers,
> -Kristoffer
>
>
>
>
> On Wed, Dec 17, 2014 at 4:51 PM, Kristoffer Sjögren <sto...@gmail.com>
> wrote:
>>
>> Hi
>>
>> I have done some tracing and it seems like _each_ 'select' result contain
>> (redundant?) column names? This cause a lot of overhead when having
>> descriptive column names. Especially when values in these columns are very
>> small.
>>
>> Is this correct? Is it possible to make result sets less chatty?
>>
>> Cheers,
>> -Kristoffer
>>
>

Re: Result set overhead?

Reply via email to