You could see the same question in serverfault as well.
If anyone is interested in a better text, image layout.
https://serverfault.com/q/973434/428655

Thanks
Hakan

On 2019/07/04 18:39:17, Hakan Baba <a...@gmail.com> wrote:
> The ipc numActiveHandler is documented here>
> <https://hbase.apache.org/1.2/book.html> as:>
>
> The number of RPC handlers actively servicing requests>
>
> I am looking for a more detailed explanation about the significance of
that>
> metric. I am trying to debug a scenario, where numActiveHandler is stuck
at>
> 32. I think 32 is a pre-configured max.[image: numActiveHandler stuck at>
> max timing] <https://i.stack.imgur.com/Lnqo7.png>>
>
> During that time, the same regionserver is stuck at 100% cpu
consumption.>
> For one of the regions on that reqionserver, the rate of processed read>
> requests look like they are reduced by some pressure, a bottleneck>
> somewhere. The read request latencies also increase about 5X.>
>
> What could lead to this behavior ? My intuition is that there has been
too>
> many connections to that region server during that time and the
bottleneck>
> is before a read request could be processed. Any suggestions where to
look>
> next ?>
> Update>
>
> The numActiveHandler metric was added here>
> <https://jira.apache.org/jira/plugins/servlet/mobile#issue/HBASE-10212>.>
> The description in that ticket says:>
>
> We found [numActiveHandler] is a good metric to measure how busy of a>
> server. If this number is too high (compared to the total number of>
> handlers), the server has risks in getting call queue full.>
>
> Update2>
>
> During the same period, another metric>
> hbase.regionserver.ipc.numCallsInGeneralQueuebehaves abnormally too.>
> Attaching a plot showing them together.>
>
> [image: enter image description here] <https://i.stack.imgur.com/dDuBd.png>>

>
> Thanks>
>
> Hakan>
>

Reply via email to