Robert Levas created AMBARI-20309:
-------------------------------------

             Summary: HBase Master CPU Utilization Alert is in unknown state 
due to kinit error
                 Key: AMBARI-20309
                 URL: https://issues.apache.org/jira/browse/AMBARI-20309
             Project: Ambari
          Issue Type: Bug
          Components: ambari-server
    Affects Versions: 2.5.0
            Reporter: Robert Levas
            Assignee: Robert Levas
             Fix For: 2.5.0


HBase Master CPU Utilization Alert is in unknown state due to kinit error:

{noformat}
Execution of '/usr/bin/kinit -c 
/var/lib/ambari-agent/tmp/curl_krb_cache/metric_alert_ambari-qa_cc_56787c2122a8214ca9775f3433361f8b
 -kt HTTP/[email protected] /etc/security/keytabs/spnego.service.keytab > 
/dev/null' returned 1. kinit: Client not found in Kerberos database while 
getting initial credentials
{noformat}

This issue is also seen in /var/log/krb5kdc.log:

{noformat}
Mar 03 16:43:06 c6401.ambari.apache.org krb5kdc[4749](info): AS_REQ (4 etypes 
{18 17 16 23}) 192.168.64.101: CLIENT_NOT_FOUND: 
/etc/security/keytabs/[email protected] for 
krbtgt/[email protected], Client not found in Kerberos database
{noformat}

*Cause*
It appears that the HBASE alerts.json file 
({{common-services/HBASE/0.96.0.2.0/alerts.json}}) has swapped values for the 
{{kerberos_keytab}} and {{kerberos_principal}} properties.

{code}
      {
        "name": "hbase_master_cpu",
        "label": "HBase Master CPU Utilization",
        "description": "This host-level alert is triggered if CPU utilization 
of the HBase Master exceeds certain warning and critical thresholds. It checks 
the HBase Master JMX Servlet for the SystemCPULoad property. The threshold 
values are in percent.",
        "interval": 5,
        "scope": "ANY",
        "enabled": true,
        "source": {
          "type": "METRIC",
          "uri": {
            "http": "{{hbase-site/hbase.master.info.port}}",
            "default_port": 60010,
            "connection_timeout": 5.0,
            "kerberos_keytab": 
"{{hbase-site/hbase.security.authentication.spnego.kerberos.principal}}",
            "kerberos_principal": 
"{{hbase-site/hbase.security.authentication.spnego.kerberos.keytab}}"
          },
          "reporting": {
            "ok": {
              "text": "{1} CPU, load {0:.1%}"
            },
            "warning": {
              "text": "{1} CPU, load {0:.1%}",
              "value": 200
            },
            "critical": {
              "text": "{1} CPU, load {0:.1%}",
              "value": 250
            },
            "units" : "%",
            "type": "PERCENT"
          },
          "jmx": {
            "property_list": [
              "java.lang:type=OperatingSystem/SystemCpuLoad",
              "java.lang:type=OperatingSystem/AvailableProcessors"
            ],
            "value": "{0} * 100"
          }
        }
      }
{code}

Notice:
{code}
            "kerberos_keytab": 
"{{hbase-site/hbase.security.authentication.spnego.kerberos.principal}}",
            "kerberos_principal": 
"{{hbase-site/hbase.security.authentication.spnego.kerberos.keytab}}"
{code}

*Solution*
Fix values for the {{kerberos_keytab}} and {{kerberos_principal}} properties in 
{{common-services/HBASE/0.96.0.2.0/alerts.json}}:

{code}
            "kerberos_principal": 
"{{hbase-site/hbase.security.authentication.spnego.kerberos.principal}}",
            "kerberos_keytab": 
"{{hbase-site/hbase.security.authentication.spnego.kerberos.keytab}}"
{code}




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to