Re: experiencing high latency for few reads in HBase

Saurabh Yahoo Thu, 29 Aug 2013 13:59:42 -0700

Thanks Adrian.

Based on hbase book, it is listed as experimental item. ( 
http://hbase.apache.org/book/upgrade0.92.html), even it had been implemented 
back in 2011.


Is anyone running this in production? Any feedback..

Thanks,
Saurabh. 

On Aug 29, 2013, at 4:07 PM, Adrien Mogenet <[email protected]> wrote:

> Another point that could help to stay under the `1s SLA': enable direct
> byte buffers for LruBlockCache. Have a look at HBASE-4027.
> 
> 
> On Thu, Aug 29, 2013 at 9:27 PM, Kiru Pakkirisamy <[email protected]
>> wrote:
> 
>> Yes, in that case, it matters. I was talking about a case where you are
>> mostly serving from cache.
>> 
>> Regards,
>> - kiru
>> 
>> 
>> Kiru Pakkirisamy | webcloudtech.wordpress.com
>> 
>> 
>> ________________________________
>> From: Saurabh Yahoo <[email protected]>
>> To: "[email protected]" <[email protected]>
>> Cc: "[email protected]" <[email protected]>
>> Sent: Thursday, August 29, 2013 12:09 PM
>> Subject: Re: experiencing high latency for few reads in HBase
>> 
>> 
>> Thanks Kiru.
>> 
>> We have 10TB of data on disk. It would not fit in memory. Also for the
>> first time, hbase need to read from the disk. And it has to go through the
>> network to read the blocks which are stored at other data node.
>> 
>> So in my opinion, locality matters.
>> 
>> Thanks,
>> Saurabh.
>> 
>> On Aug 29, 2013, at 2:33 PM, Kiru Pakkirisamy <[email protected]>
>> wrote:
>> 
>>> But locality index should not matter right if you are in IN_MEMORY most
>> and you are running the test after  a few runs to make sure they are
>> already in IN_MEMORY  (ie blockCacheHit is high or blockCacheMiss is low)
>> (?)
>>> 
>>> Regards,
>>> - kiru
>>> 
>>> 
>>> Kiru Pakkirisamy | webcloudtech.wordpress.com
>>> 
>>> 
>>> ________________________________
>>> From: Vladimir Rodionov <[email protected]>
>>> To: "[email protected]" <[email protected]>
>>> Sent: Thursday, August 29, 2013 11:11 AM
>>> Subject: RE: experiencing high latency for few reads in HBase
>>> 
>>> 
>>> Usually, either cluster restart or major compaction helps improving
>> locality index.
>>> There is an issue in region assignment after table disable/enable in
>> 0.94.x (x <11) which
>>> breaks HDFS locality. Fixed in 0.94.11
>>> 
>>> You can write your own routine to manually "localize" particular table
>> using public HBase Client API.
>>> 
>>> But this won't help you to stay withing 1 sec anyway.
>>> 
>>> Best regards,
>>> Vladimir Rodionov
>>> Principal Platform Engineer
>>> Carrier IQ, www.carrieriq.com
>>> e-mail: [email protected]
>>> 
>>> ________________________________________
>>> From: Saurabh Yahoo [[email protected]]
>>> Sent: Thursday, August 29, 2013 10:52 AM
>>> To: [email protected]
>>> Cc: [email protected]
>>> Subject: Re: experiencing high latency for few reads in HBase
>>> 
>>> Thanks Vlad.
>>> 
>>> Quick question. I notice hdfsBlocksLocalityIndex is around 50 in all
>> region servers.
>>> 
>>> Does that could be a problem? If it is, how to solve that? We already
>> ran the major compaction after ingesting the data.
>>> 
>>> Thanks,
>>> Saurabh.
>>> 
>>> On Aug 29, 2013, at 12:17 PM, Vladimir Rodionov <[email protected]>
>> wrote:
>>> 
>>>> Yes. HBase won't guarantee strict sub-second latency.
>>>> 
>>>> Best regards,
>>>> Vladimir Rodionov
>>>> Principal Platform Engineer
>>>> Carrier IQ, www.carrieriq.com
>>>> e-mail: [email protected]
>>>> 
>>>> ________________________________________
>>>> From: Saurabh Yahoo [[email protected]]
>>>> Sent: Thursday, August 29, 2013 2:49 AM
>>>> To: [email protected]
>>>> Cc: [email protected]
>>>> Subject: Re: experiencing high latency for few reads in HBase
>>>> 
>>>> Hi Vlad,
>>>> 
>>>> We do have strict latency requirement as it is financial data requiring
>> direct access from clients.
>>>> 
>>>> Are you saying that it is not possible to achieve sub second latency
>> using hbase (because it is based on java.) ?
>>>> 
>>>> 
>>>> 
>>>> 
>>>> 
>>>> 
>>>> 
>>>> On Aug 28, 2013, at 8:10 PM, Vladimir Rodionov <[email protected]>
>> wrote:
>>>> 
>>>>> Increasing Java heap size will make latency worse, actually.
>>>>> You can't guarantee 1 sec max latency if run Java app (unless your
>> heap size is much less than 1GB).
>>>>> I have never heard about strict maximum latency limit. Usually , its
>> 99% , 99.9 or 99.99% query percentiles.
>>>>> 
>>>>> You can greatly reduce your 99.xxx% percentile latency by storing you
>> data in 2 replicas to two different region servers.
>>>>> Issue two read operations to those two region servers in parallel and
>> get the first response. Probability theory states that  probability
>>>>> of two independent events (slow requests) is  the product of event's
>> probabilities themselves.
>>>>> 
>>>>> 
>>>>> Best regards,
>>>>> Vladimir Rodionov
>>>>> Principal Platform Engineer
>>>>> Carrier IQ, www.carrieriq.com
>>>>> e-mail: [email protected]
>>>>> 
>>>>> ________________________________________
>>>>> From: Saurabh Yahoo [[email protected]]
>>>>> Sent: Wednesday, August 28, 2013 4:18 PM
>>>>> To: [email protected]
>>>>> Subject: Re: experiencing high latency for few reads in HBase
>>>>> 
>>>>> Thanks Kiru,
>>>>> 
>>>>> Scan is not an option for our use cases.  Our read is pretty random.
>>>>> 
>>>>> Any other suggestion to bring down the latency.
>>>>> 
>>>>> Thanks,
>>>>> Saurabh.
>>>>> 
>>>>> 
>>>>> On Aug 28, 2013, at 7:01 PM, Kiru Pakkirisamy <
>> [email protected]> wrote:
>>>>> 
>>>>>> Saurabh, we are able to 600K rowxcolumns in 400 msec. We have put
>> what was a 40million row table as 400K rows and columns. We Get about 100
>> of the rows from this 400K , do quite a bit of calculations in the
>> coprocessor (almost a group-order by) and return in this time.
>>>>>> Maybe should consider replacing the MultiGets with Scan with Filter.
>> I like the FuzzyRowFilter even though you might need to match with exact
>> key. It works only with fixed length key.
>>>>>> (I do have an issue right now, it is not scaling to multiple clients.)
>>>>>> 
>>>>>> Regards,
>>>>>> - kiru
>>>>>> 
>>>>>> 
>>>>>> Kiru Pakkirisamy | webcloudtech.wordpress.com
>>>>>> 
>>>>>> 
>>>>>> ________________________________
>>>>>> From: Saurabh Yahoo <[email protected]>
>>>>>> To: "[email protected]" <[email protected]>
>>>>>> Cc: "[email protected]" <[email protected]>
>>>>>> Sent: Wednesday, August 28, 2013 3:20 PM
>>>>>> Subject: Re: experiencing high latency for few reads in HBase
>>>>>> 
>>>>>> 
>>>>>> Thanks Kitu. We need less than 1 sec latency.
>>>>>> 
>>>>>> We are using both muliGet and get.
>>>>>> 
>>>>>> We have three concurrent clients running 10 threads each. ( that
>> makes total 30 concurrent clients).
>>>>>> 
>>>>>> Thanks,
>>>>>> Saurabh.
>>>>>> 
>>>>>> On Aug 28, 2013, at 4:30 PM, Kiru Pakkirisamy <
>> [email protected]> wrote:
>>>>>> 
>>>>>>> Right 4 sec is good.
>>>>>>> @Saurabh - so your read is - getting 20 out of 25 millions rows ?.
>> Is this a Get or a Scan ?
>>>>>>> BTW, in this stress test how many concurrent clients do you have ?
>>>>>>> 
>>>>>>> Regards,
>>>>>>> - kiru
>>>>>>> 
>>>>>>> 
>>>>>>> ________________________________
>>>>>>> From: Vladimir Rodionov <[email protected]>
>>>>>>> To: "[email protected]" <[email protected]>
>>>>>>> Sent: Wednesday, August 28, 2013 12:15 PM
>>>>>>> Subject: RE: experiencing high latency for few reads in HBase
>>>>>>> 
>>>>>>> 
>>>>>>> 1. 4 sec max latency is not that bad taking into account 12GB heap.
>> It can be much larger. What is your SLA?
>>>>>>> 2. Block evictions is the result of a poor cache hit rate and the
>> root cause of a periodic stop-the-world GC pauses (max latencies
>>>>>>>    latencies you have been observing in the test)
>>>>>>> 3. Block cache consists of 3 parts (25% young generation, 50% -
>> tenured, 25% - permanent). Permanent part is for CF with
>>>>>>> IN_MEMORY = true (you can specify this when you create CF).  Block
>> first stored in 'young gen' space, then gets promoted to 'tenured gen' space
>>>>>>> (or gets evicted). May be your 'perm gen' space is underutilized?
>> This is exact 25% of 4GB (1GB). Although HBase LruBlockCache should use all
>> the space allocated for block cache -
>>>>>>> there is no guarantee (as usual). If you don have in_memory column
>> families you may decrease
>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> Best regards,
>>>>>>> Vladimir Rodionov
>>>>>>> Principal Platform Engineer
>>>>>>> Carrier IQ, www.carrieriq.com
>>>>>>> e-mail: [email protected]
>>>>>>> 
>>>>>>> ________________________________________
>>>>>>> From: Saurabh Yahoo [[email protected]]
>>>>>>> Sent: Wednesday, August 28, 2013 5:10 AM
>>>>>>> To: [email protected]
>>>>>>> Subject: experiencing high latency for few reads in HBase
>>>>>>> 
>>>>>>> Hi,
>>>>>>> 
>>>>>>> We are running a stress test in our 5 node cluster and we are
>> getting the expected mean latency of 10ms. But we are seeing around 20
>> reads out of 25 million reads having latency more than 4 seconds. Can
>> anyone provide the insight what we can do to meet below second SLA for each
>> and every read?
>>>>>>> 
>>>>>>> We observe the following things -
>>>>>>> 
>>>>>>> 1. Reads are evenly distributed among 5 nodes.  CPUs remain under 5%
>> utilized.
>>>>>>> 
>>>>>>> 2. We have 4gb block cache (30% block cache out of 12gb) setup. 3gb
>> block cache got filled up but around 1gb remained free. There are a large
>> number of cache eviction.
>>>>>>> 
>>>>>>> Questions to experts -
>>>>>>> 
>>>>>>> 1. If there are still 1gb of free block cache available, why is
>> hbase evicting the block from cache?
>>>>>>> 
>>>>>>> 4. We are seeing memory went up to 10gb three times before dropping
>> sharply to 5gb.
>>>>>>> 
>>>>>>> Any help is highly appreciable,
>>>>>>> 
>>>>>>> Thanks,
>>>>>>> Saurabh.
>>>>>>> 
>>>>>>> Confidentiality Notice:  The information contained in this message,
>> including any attachments hereto, may be confidential and is intended to be
>> read only by the individual or entity to whom this message is addressed. If
>> the reader of this message is not the intended recipient or an agent or
>> designee of the intended recipient, please note that any review, use,
>> disclosure or distribution of this message or its attachments, in any form,
>> is strictly prohibited.  If you have received this message in error, please
>> immediately notify the sender and/or [email protected] and
>> delete or destroy any copy of this message and its attachments.
> 
> 
> 
> -- 
> Adrien Mogenet
> http://www.borntosegfault.com

Re: experiencing high latency for few reads in HBase

Reply via email to