[ https://issues.apache.org/jira/browse/AMBARI-18930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alejandro Fernandez updated AMBARI-18930: ----------------------------------------- Attachment: AMBARI-19830.patch > Ambari thinks a component is installed, even if the install is partially done > ----------------------------------------------------------------------------- > > Key: AMBARI-18930 > URL: https://issues.apache.org/jira/browse/AMBARI-18930 > Project: Ambari > Issue Type: Bug > Components: ambari-server > Affects Versions: 2.4.2 > Reporter: Zsombor Gegesy > Attachments: AMBARI-19830.patch > > > We have lot's of deployment problems which ends up in a state, where Ambari > thinks a component is installed, but when it tries to start it, the component > fails to start up, because missing kerberos keytabs. > The investigation found the following scenario : > * blueprint install > * For the datanode to install it's needs the following steps: > ** install the datanode package on host A > ** generate keytab(s) on server > ** set up datanode-keytab on host A > * In the "Install components on host ...." step the following steps are > scheduled : > ** package install > ** (other package install) > ** keytab setup > * If there was an error in the 'other package install', then the kerberos key > generation, and host setup is skipped. > * But Ambari thinks, this component is installed - because the package > install step succeeded. > * However, when/if someone tries to start the component, it fails with the > missing keytabs > * The only way to fix this is to manually "Delete" and "Reinstall" the > component, which doesn't make the best impression on the user. > Proposed ideas to fix : > * re-order the install steps - first generate the keytabs on the server, > distribute to the host, and later install the package. This would ensure, > when the package is installed on a host, it's keytabs are there as well. > * mark the partially installed component as 'install failed' on the UI - so > the user could try to re-install, when they see it. -- This message was sent by Atlassian JIRA (v6.3.15#6346)