[ 
https://issues.apache.org/jira/browse/HBASE-2629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Kyle Purtell closed HBASE-2629.
--------------------------------------

> Piggyback basic "alarm" framework on RS heartbeats
> --------------------------------------------------
>
>                 Key: HBASE-2629
>                 URL: https://issues.apache.org/jira/browse/HBASE-2629
>             Project: HBase
>          Issue Type: New Feature
>          Components: master, regionserver
>            Reporter: Todd Lipcon
>            Priority: Major
>
> There are a number of system conditions that can cause HBase to perform badly 
> or have stability issues. For example, significant swapping activity or 
> overloaded ZK will result in all kinds of problems.
> It would be nice to put a very lightweight "alarm" framework in place, so 
> that when the RS notices something is amiss, it can raise an alarm flag for 
> some period of time. These could be exposed by JMX to external monitoring 
> tools, and also displayed on the master web UI.
> Some example alarms:
> - "ZK read took >1000ms"
> - "Long garbage collection pause detected"
> - "Writes blocked on region for longer than 5 seconds"
> etc etc



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to