-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/62571/#review186249
-----------------------------------------------------------


Ship it!




Ship It!

- Nate Cole


On Sept. 26, 2017, 9:18 a.m., Dmitro Lisnichenko wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/62571/
> -----------------------------------------------------------
> 
> (Updated Sept. 26, 2017, 9:18 a.m.)
> 
> 
> Review request for Ambari, Jonathan Hurley and Nate Cole.
> 
> 
> Bugs: AMBARI-22060
>     https://issues.apache.org/jira/browse/AMBARI-22060
> 
> 
> Repository: ambari
> 
> 
> Description
> -------
> 
> During the downgrade process, run into the following error whilst it's 
> restating Ranger Admin:
> 
> {code}
> Traceback (most recent call last):
> File 
> "/var/lib/ambari-agent/cache/common-services/RANGER/0.4.0/package/scripts/ranger_admin.py",
>  line 216, in
> RangerAdmin().execute()
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
>  line 329, in execute
> method(env)
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
>  line 850, in restart
> self.start(env, upgrade_type=upgrade_type)
> File 
> "/var/lib/ambari-agent/cache/common-services/RANGER/0.4.0/package/scripts/ranger_admin.py",
>  line 93, in start
> setup_ranger_audit_solr()
> File 
> "/var/lib/ambari-agent/cache/common-services/RANGER/0.4.0/package/scripts/setup_ranger_xml.py",
>  line 705, in setup_ranger_audit_solr
> new_service_principals = [params.ranger_admin_jaas_principal])
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/solr_cloud_util.py",
>  line 329, in add_solr_roles
> new_service_users.append(__remove_host_from_principal(new_service_user, 
> kerberos_realm))
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/solr_cloud_util.py",
>  line 266, in __remove_host_from_principal
> if not realm:
> File 
> "/usr/lib/python2.6/site-packages/resource_management/libraries/script/config_dictionary.py",
>  line 73, in __getattr__
> raise Fail("Configuration parameter '" + self.name + "' was not found in 
> configurations dictionary!")
> resource_management.core.exceptions.Fail: Configuration parameter 
> 'kerberos-env' was not found in configurations dictionary!
> 
> {code}
> 
> The reason was that server did not have many configs selected, and did not 
> send them to agent during downgrade. There are few issues here:
> - During upgrade from 2.4 to 2.5, finalize did not update current cluster 
> version. As a result config helpers have gone mad
> - As a result of previous issue, some Configure tasks failed to execute
> - During downgrade from 2.6 , looks like cluster entity DB state was not 
> consistent after config selection, so sometimes configs were not selected is 
> some cases. I managed to reproduce that only once,
> it's a race condition that is very hard to catch/trace in debugger.
> 
> 
> Diffs
> -----
> 
>   
> ambari-server/src/main/java/org/apache/ambari/server/serveraction/upgrades/FinalizeUpgradeAction.java
>  bbff9fd813 
>   
> ambari-server/src/main/java/org/apache/ambari/server/state/cluster/ClusterImpl.java
>  9fd7663b96 
> 
> 
> Diff: https://reviews.apache.org/r/62571/diff/1/
> 
> 
> Testing
> -------
> 
> mvn clean test
> live test on cluster. Not 100% sure that the last issue is fixed, but 
> probably yes
> 
> 
> Thanks,
> 
> Dmitro Lisnichenko
> 
>

Reply via email to