-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/62571/
-----------------------------------------------------------
(Updated Sept. 26, 2017, 6:58 p.m.)
Review request for Ambari, Jonathan Hurley and Nate Cole.
Bugs: AMBARI-22060
https://issues.apache.org/jira/browse/AMBARI-22060
Repository: ambari
Description
-------
During the downgrade process, run into the following error whilst it's
restating Ranger Admin:
{code}
Traceback (most recent call last):
File
"/var/lib/ambari-agent/cache/common-services/RANGER/0.4.0/package/scripts/ranger_admin.py",
line 216, in
RangerAdmin().execute()
File
"/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 329, in execute
method(env)
File
"/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
line 850, in restart
self.start(env, upgrade_type=upgrade_type)
File
"/var/lib/ambari-agent/cache/common-services/RANGER/0.4.0/package/scripts/ranger_admin.py",
line 93, in start
setup_ranger_audit_solr()
File
"/var/lib/ambari-agent/cache/common-services/RANGER/0.4.0/package/scripts/setup_ranger_xml.py",
line 705, in setup_ranger_audit_solr
new_service_principals = [params.ranger_admin_jaas_principal])
File
"/usr/lib/python2.6/site-packages/resource_management/libraries/functions/solr_cloud_util.py",
line 329, in add_solr_roles
new_service_users.append(__remove_host_from_principal(new_service_user,
kerberos_realm))
File
"/usr/lib/python2.6/site-packages/resource_management/libraries/functions/solr_cloud_util.py",
line 266, in __remove_host_from_principal
if not realm:
File
"/usr/lib/python2.6/site-packages/resource_management/libraries/script/config_dictionary.py",
line 73, in __getattr__
raise Fail("Configuration parameter '" + self.name + "' was not found in
configurations dictionary!")
resource_management.core.exceptions.Fail: Configuration parameter
'kerberos-env' was not found in configurations dictionary!
{code}
The reason was that server did not have many configs selected, and did not send
them to agent during downgrade. There are few issues here:
- During upgrade from 2.4 to 2.5, finalize did not update current cluster
version. As a result config helpers have gone mad
- As a result of previous issue, some Configure tasks failed to execute
- During downgrade from 2.6 , looks like cluster entity DB state was not
consistent after config selection, so sometimes configs were not selected is
some cases. I managed to reproduce that only once,
it's a race condition that is very hard to catch/trace in debugger.
Diffs (updated)
-----
ambari-server/src/main/java/org/apache/ambari/server/serveraction/upgrades/FinalizeUpgradeAction.java
bbff9fd813
ambari-server/src/main/java/org/apache/ambari/server/state/cluster/ClusterImpl.java
9fd7663b96
Diff: https://reviews.apache.org/r/62571/diff/2/
Changes: https://reviews.apache.org/r/62571/diff/1-2/
Testing
-------
mvn clean test
live test on cluster. Not 100% sure that the last issue is fixed, but probably
yes
Thanks,
Dmitro Lisnichenko