Dmytro Sen created AMBARI-13793:
-----------------------------------
Summary: Resource Manager (HA) fails to start after enabling
Kerberos
Key: AMBARI-13793
URL: https://issues.apache.org/jira/browse/AMBARI-13793
Project: Ambari
Issue Type: Bug
Components: stacks
Affects Versions: 2.1.2
Reporter: Dmytro Sen
Assignee: Dmytro Sen
Priority: Critical
Fix For: 2.1.3
Resource Manager (HA) fails to start after enabling Kerberos after Ambari
upgrade from 1.6.1 to 2.1.2.1
*Steps to reproduce*
# Install Ambari 1.6.1
# Create 3-node cluster with ZK, HDFS, YARN, TEZ (HDP 2.1)
# Upgrade to Ambari 2.1.2.1
# Add host (I am not sure if this is relevant)
# Enable NN HA
# Enable RM HA
# Enable Kerberos (via Enable Kerberos Wizard)
While starting services after enable Kerberos, Resource Managers take a long
time to try to start up and then fail with the following error:
{noformat:title=StdErr}
Python script has been killed due to timeout after waiting 1200 secs
{noformat}
{noformat:title=StdOut}
2015-10-30 19:56:19,377 - Group['hadoop'] {}
2015-10-30 19:56:19,378 - Group['users'] {}
2015-10-30 19:56:19,385 - User['mapred'] {'gid': 'hadoop', 'groups': ['hadoop']}
2015-10-30 19:56:19,386 - User['ambari-qa'] {'gid': 'hadoop', 'groups':
['users']}
2015-10-30 19:56:19,386 - User['zookeeper'] {'gid': 'hadoop', 'groups':
['hadoop']}
2015-10-30 19:56:19,392 - User['tez'] {'gid': 'hadoop', 'groups': ['users']}
2015-10-30 19:56:19,392 - User['hdfs'] {'gid': 'hadoop', 'groups': ['hadoop']}
2015-10-30 19:56:19,393 - User['yarn'] {'gid': 'hadoop', 'groups': ['hadoop']}
2015-10-30 19:56:19,398 - File['/var/lib/ambari-agent/tmp/changeUid.sh']
{'content': StaticFile('changeToSecureUid.sh'), 'mode': 0555}
2015-10-30 19:56:19,399 - Execute['/var/lib/ambari-agent/tmp/changeUid.sh
ambari-qa
/tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa']
{'not_if': '(test $(id -u ambari-qa) -gt 1000) || (false)'}
2015-10-30 19:56:19,426 - Skipping
Execute['/var/lib/ambari-agent/tmp/changeUid.sh ambari-qa
/tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa']
due to not_if
2015-10-30 19:56:19,426 - Group['hdfs'] {'ignore_failures': False}
2015-10-30 19:56:19,427 - User['hdfs'] {'ignore_failures': False, 'groups':
['hadoop', 'hdfs']}
2015-10-30 19:56:19,427 - Directory['/etc/hadoop'] {'mode': 0755}
2015-10-30 19:56:19,428 - Directory['/etc/hadoop/conf.empty'] {'owner': 'root',
'group': 'hadoop', 'recursive': True}
2015-10-30 19:56:19,428 - Link['/etc/hadoop/conf'] {'not_if': 'ls
/etc/hadoop/conf', 'to': '/etc/hadoop/conf.empty'}
2015-10-30 19:56:19,444 - Skipping Link['/etc/hadoop/conf'] due to not_if
2015-10-30 19:56:19,506 - File['/etc/hadoop/conf/hadoop-env.sh'] {'content':
InlineTemplate(...), 'owner': 'root', 'group': 'hadoop'}
2015-10-30 19:56:19,507 -
Directory['/var/lib/ambari-agent/tmp/hadoop_java_io_tmpdir'] {'owner': 'hdfs',
'group': 'hadoop', 'mode': 0777}
2015-10-30 19:56:19,562 - Execute[('setenforce', '0')] {'not_if': '(! which
getenforce ) || (which getenforce && getenforce | grep -q Disabled)', 'sudo':
True, 'only_if': 'test -f /selinux/enforce'}
2015-10-30 19:56:19,626 - Directory['/var/log/hadoop'] {'owner': 'root',
'mode': 0775, 'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:19,627 - Directory['/var/run/hadoop'] {'owner': 'root',
'group': 'root', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:19,627 - Directory['/tmp/hadoop-hdfs'] {'owner': 'hdfs',
'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:19,641 - File['/etc/hadoop/conf/commons-logging.properties']
{'content': Template('commons-logging.properties.j2'), 'owner': 'root'}
2015-10-30 19:56:19,649 - File['/etc/hadoop/conf/health_check'] {'content':
Template('health_check.j2'), 'owner': 'root'}
2015-10-30 19:56:19,650 - File['/etc/hadoop/conf/log4j.properties'] {'content':
..., 'owner': 'hdfs', 'group': 'hadoop', 'mode': 0644}
2015-10-30 19:56:19,696 - File['/etc/hadoop/conf/hadoop-metrics2.properties']
{'content': Template('hadoop-metrics2.properties.j2'), 'owner': 'hdfs'}
2015-10-30 19:56:19,697 - File['/etc/hadoop/conf/task-log4j.properties']
{'content': StaticFile('task-log4j.properties'), 'mode': 0755}
2015-10-30 19:56:19,698 - File['/etc/hadoop/conf/configuration.xsl'] {'owner':
'hdfs', 'group': 'hadoop'}
2015-10-30 19:56:19,720 - File['/etc/hadoop/conf/topology_mappings.data']
{'owner': 'hdfs', 'content': Template('topology_mappings.data.j2'), 'only_if':
'test -d /etc/hadoop/conf', 'group': 'hadoop'}
2015-10-30 19:56:19,731 - File['/etc/hadoop/conf/topology_script.py']
{'content': StaticFile('topology_script.py'), 'only_if': 'test -d
/etc/hadoop/conf', 'mode': 0755}
2015-10-30 19:56:20,154 - Skipping get_hdp_version since hdp-select is not yet
available
2015-10-30 19:56:20,179 - Directory['/var/run/hadoop-yarn'] {'owner': 'yarn',
'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,192 - Directory['/var/run/hadoop-yarn/yarn'] {'owner':
'yarn', 'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,193 - Directory['/var/log/hadoop-yarn/yarn'] {'owner':
'yarn', 'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,193 - Directory['/var/run/hadoop-mapreduce'] {'owner':
'mapred', 'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,193 - Directory['/var/run/hadoop-mapreduce/mapred']
{'owner': 'mapred', 'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,201 - Directory['/var/log/hadoop-mapreduce'] {'owner':
'mapred', 'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,201 - Directory['/var/log/hadoop-mapreduce/mapred']
{'owner': 'mapred', 'group': 'hadoop', 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,202 - Directory['/var/log/hadoop-yarn'] {'owner': 'yarn',
'ignore_failures': True, 'recursive': True, 'cd_access': 'a'}
2015-10-30 19:56:20,202 - XmlConfig['core-site.xml'] {'group': 'hadoop',
'conf_dir': '/etc/hadoop/conf', 'mode': 0644, 'configuration_attributes': {},
'owner': 'hdfs', 'configurations': ...}
2015-10-30 19:56:20,263 - Generating config: /etc/hadoop/conf/core-site.xml
2015-10-30 19:56:20,263 - File['/etc/hadoop/conf/core-site.xml'] {'owner':
'hdfs', 'content': InlineTemplate(...), 'group': 'hadoop', 'mode': 0644,
'encoding': 'UTF-8'}
2015-10-30 19:56:20,335 - XmlConfig['hdfs-site.xml'] {'group': 'hadoop',
'conf_dir': '/etc/hadoop/conf', 'mode': 0644, 'configuration_attributes': {},
'owner': 'hdfs', 'configurations': ...}
2015-10-30 19:56:20,376 - Generating config: /etc/hadoop/conf/hdfs-site.xml
2015-10-30 19:56:20,377 - File['/etc/hadoop/conf/hdfs-site.xml'] {'owner':
'hdfs', 'content': InlineTemplate(...), 'group': 'hadoop', 'mode': 0644,
'encoding': 'UTF-8'}
2015-10-30 19:56:20,623 - XmlConfig['mapred-site.xml'] {'group': 'hadoop',
'conf_dir': '/etc/hadoop/conf', 'mode': 0644, 'configuration_attributes': {},
'owner': 'yarn', 'configurations': ...}
2015-10-30 19:56:20,666 - Generating config: /etc/hadoop/conf/mapred-site.xml
2015-10-30 19:56:20,667 - File['/etc/hadoop/conf/mapred-site.xml'] {'owner':
'yarn', 'content': InlineTemplate(...), 'group': 'hadoop', 'mode': 0644,
'encoding': 'UTF-8'}
2015-10-30 19:56:20,805 - Changing owner for /etc/hadoop/conf/mapred-site.xml
from 1004 to yarn
2015-10-30 19:56:20,814 - XmlConfig['yarn-site.xml'] {'group': 'hadoop',
'conf_dir': '/etc/hadoop/conf', 'mode': 0644, 'configuration_attributes': {},
'owner': 'yarn', 'configurations': ...}
2015-10-30 19:56:20,848 - Generating config: /etc/hadoop/conf/yarn-site.xml
2015-10-30 19:56:20,856 - File['/etc/hadoop/conf/yarn-site.xml'] {'owner':
'yarn', 'content': InlineTemplate(...), 'group': 'hadoop', 'mode': 0644,
'encoding': 'UTF-8'}
2015-10-30 19:56:21,138 - XmlConfig['capacity-scheduler.xml'] {'group':
'hadoop', 'conf_dir': '/etc/hadoop/conf', 'mode': 0644,
'configuration_attributes': {}, 'owner': 'yarn', 'configurations': ...}
2015-10-30 19:56:21,178 - Generating config:
/etc/hadoop/conf/capacity-scheduler.xml
2015-10-30 19:56:21,184 - File['/etc/hadoop/conf/capacity-scheduler.xml']
{'owner': 'yarn', 'content': InlineTemplate(...), 'group': 'hadoop', 'mode':
0644, 'encoding': 'UTF-8'}
2015-10-30 19:56:21,249 - Changing owner for
/etc/hadoop/conf/capacity-scheduler.xml from 1003 to yarn
2015-10-30 19:56:21,250 -
File['/var/log/hadoop-yarn/yarn/hadoop-mapreduce.jobsummary.log'] {'owner':
'yarn', 'group': 'hadoop'}
2015-10-30 19:56:21,250 - Execute['yarn resourcemanager -format-state-store']
{'user': 'yarn'}
{noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)