Hi David, Thank you very much for reporting these issues and submitting patches. AMBARI-10319 has been committed to branch-2.0.0 and AMBARI-10320 has been committed to trunk as well as branch-2.0.0. AMBARI-10321 has been marked invalid, as the patch is not needed for a successful build and deploy of ambari-metrics-assembly.deb. I don't see AMBARI-10317 as a release blocker (though it is a critical bug). The workaround is to set PID directories to something other than /var/run/* if you are on Ubuntu.
Thanks, Yusaku On 4/2/15 1:20 PM, "McWhorter, David" <[email protected]> wrote: >Hi all, I¹ve created reviews for 3 issues I found testing Ambari >2.0.0-rc2 on ubuntu 12.04: > >https://reviews.apache.org/r/32780/ addressing >https://issues.apache.org/jira/browse/AMBARI-10321 >https://reviews.apache.org/r/32779/ addressing >https://issues.apache.org/jira/browse/AMBARI-10320 >https://reviews.apache.org/r/32777/ addressing >https://issues.apache.org/jira/browse/AMBARI-10319 >Please let me know if I need to do anything different procedurally to >submit these fixes, I¹ve tried to follow the instructions at >https://cwiki.apache.org/confluence/display/AMBARI/How+to+Contribute. > >I also found https://issues.apache.org/jira/browse/AMBARI-10317 which I >believe Alejandro Fernandez is looking into fixing. > >All of these issues kept me from either building or deploying components >of Ambari 2.0.0-rc2 on ubuntu 12.04, so it may be advisable to include >fixes in the 2.0.0 release. > >Thanks, >David > > >‹ > >David McWhorter >Senior Developer, Foundations >Informatics and Technology Services >Office: 434.260.5232 | Mobile: 434.227.2551 >[email protected] | Premier, Inc. (NASDAQ: PINC) > > >From: <McWhorter>, David McWhorter ><[email protected]<mailto:[email protected]>> >Date: Wednesday, April 1, 2015 at 12:40 PM >To: "[email protected]<mailto:[email protected]>" ><[email protected]<mailto:[email protected]>> >Subject: error on Knox restart - ambari 2.0.0-rc2 > >Hello, we are testing deploying an HDP 2.2. Cluster using ambari >2.0.0-rc2 running on ubuntu 12.04. I¹ve been able to set up a cluster >running HDFS, MapReduce2, YARN, Zookeeper, Knox, Ranger, and Ambari >Metrics. When I shut down the whole cluster using Actions -> Stop All in >Ambari, reboot the hosts, and then try to restart the cluster I see the >error below restarting the Knox gateway. Can anyone tell me if this is a >known error, or if something is obviously misconfigured. If not I¹m >happy to create an issue and/or take a look at what is going on if >someone can point me at where to look. As a note, the directory >/var/run/knox is indeed missing on the master host. > >Thank you, >David > >Knox Gateway startup log: > > >2015-04-01 16:17:12,075 - Error while executing command 'start': >Traceback (most recent call last): > File >"/usr/lib/python2.6/site-packages/resource_management/libraries/script/scr >ipt.py", line 214, in execute > method(env) > File >"/usr/lib/python2.6/site-packages/ambari_commons/os_family_impl.py", line >89, in thunk > return fn(*args, **kwargs) > File >"/var/lib/ambari-agent/cache/common-services/KNOX/0.5.0.2.2/package/script >s/knox_gateway.py", line 80, in start > self.configure(env) > File >"/var/lib/ambari-agent/cache/common-services/KNOX/0.5.0.2.2/package/script >s/knox_gateway.py", line 64, in configure > knox() > File >"/usr/lib/python2.6/site-packages/ambari_commons/os_family_impl.py", line >89, in thunk > return fn(*args, **kwargs) > File >"/var/lib/ambari-agent/cache/common-services/KNOX/0.5.0.2.2/package/script >s/knox.py", line 99, in knox > sudo = True, > File >"/usr/lib/python2.6/site-packages/resource_management/core/base.py", line >148, in __init__ > self.env.run() > File >"/usr/lib/python2.6/site-packages/resource_management/core/environment.py" >, line 152, in run > self.run_action(resource, action) > File >"/usr/lib/python2.6/site-packages/resource_management/core/environment.py" >, line 118, in run_action > provider_action() > File >"/usr/lib/python2.6/site-packages/resource_management/core/providers/syste >m.py", line 274, in action_run > raise ex >Fail: Execution of 'chown -R knox:knox /var/lib/knox/data /var/log/knox >/var/log/knox /var/run/knox /etc/knox/conf' returned 1. chown: cannot >access `/var/run/knox': No such file or directory > >stdout: /var/lib/ambari-agent/data/output-107.txt > >2015-04-01 16:17:06,744 - u"Group['hadoop']" {'ignore_failures': False} >2015-04-01 16:17:06,744 - Modifying group hadoop >2015-04-01 16:17:06,797 - u"Group['users']" {'ignore_failures': False} >2015-04-01 16:17:06,797 - Modifying group users >2015-04-01 16:17:06,839 - u"Group['knox']" {'ignore_failures': False} >2015-04-01 16:17:06,839 - Modifying group knox >2015-04-01 16:17:06,886 - u"Group['ranger']" {'ignore_failures': False} >2015-04-01 16:17:06,886 - Modifying group ranger >2015-04-01 16:17:06,930 - u"User['mapred']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:06,930 - Modifying user mapred >2015-04-01 16:17:06,976 - u"User['root']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:06,977 - Modifying user root >2015-04-01 16:17:07,019 - u"User['ambari-qa']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'users']} >2015-04-01 16:17:07,020 - Modifying user ambari-qa >2015-04-01 16:17:07,066 - u"User['zookeeper']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,066 - Modifying user zookeeper >2015-04-01 16:17:07,109 - u"User['rangerlogger']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,110 - Modifying user rangerlogger >2015-04-01 16:17:07,152 - u"User['hdfs']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,152 - Modifying user hdfs >2015-04-01 16:17:07,195 - u"User['knox']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,195 - Modifying user knox >2015-04-01 16:17:07,238 - u"User['ranger']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,238 - Modifying user ranger >2015-04-01 16:17:07,282 - u"User['yarn']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,283 - Modifying user yarn >2015-04-01 16:17:07,326 - u"User['ams']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,327 - Modifying user ams >2015-04-01 16:17:07,370 - u"User['rangeradmin']" {'gid': 'hadoop', >'ignore_failures': False, 'groups': [u'hadoop']} >2015-04-01 16:17:07,370 - Modifying user rangeradmin >2015-04-01 16:17:07,413 - >u"File['/var/lib/ambari-agent/data/tmp/changeUid.sh']" {'content': >StaticFile('changeToSecureUid.sh'), 'mode': 0555} >2015-04-01 16:17:07,686 - >u"Execute['/var/lib/ambari-agent/data/tmp/changeUid.sh ambari-qa >/tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambar >i-qa,/tmp/sqoop-ambari-qa']" {'not_if': '(test $(id -u ambari-qa) -gt >1000) || (false)'} >2015-04-01 16:17:07,728 - Skipping >u"Execute['/var/lib/ambari-agent/data/tmp/changeUid.sh ambari-qa >/tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambar >i-qa,/tmp/sqoop-ambari-qa']" due to not_if >2015-04-01 16:17:07,728 - u"Group['hdfs']" {'ignore_failures': False} >2015-04-01 16:17:07,728 - Modifying group hdfs >2015-04-01 16:17:07,774 - u"User['hdfs']" {'ignore_failures': False, >'groups': [u'hadoop', 'hadoop', 'hdfs', u'hdfs']} >2015-04-01 16:17:07,775 - Modifying user hdfs >2015-04-01 16:17:07,818 - u"Directory['/etc/hadoop']" {'mode': 0755} >2015-04-01 16:17:07,974 - u"Directory['/etc/hadoop/conf.empty']" >{'owner': 'root', 'group': 'hadoop', 'recursive': True} >2015-04-01 16:17:08,110 - u"Link['/etc/hadoop/conf']" {'not_if': 'ls >/etc/hadoop/conf', 'to': '/etc/hadoop/conf.empty'} >2015-04-01 16:17:08,153 - Skipping u"Link['/etc/hadoop/conf']" due to >not_if >2015-04-01 16:17:08,160 - u"File['/etc/hadoop/conf/hadoop-env.sh']" >{'content': InlineTemplate(...), 'owner': 'hdfs', 'group': 'hadoop'} >2015-04-01 16:17:08,396 - u"Execute['('setenforce', '0')']" {'sudo': >True, 'only_if': 'test -f /selinux/enforce'} >2015-04-01 16:17:08,448 - Skipping u"Execute['('setenforce', '0')']" due >to only_if >2015-04-01 16:17:08,448 - u"Directory['/var/log/hadoop']" {'owner': >'root', 'mode': 0775, 'group': 'hadoop', 'recursive': True, 'cd_access': >'a'} >2015-04-01 16:17:08,843 - u"Directory['/var/run/hadoop']" {'owner': >'root', 'group': 'root', 'recursive': True, 'cd_access': 'a'} >2015-04-01 16:17:08,886 - Creating directory >u"Directory['/var/run/hadoop']" >2015-04-01 16:17:09,066 - Changing group for /var/run/hadoop from 1000 to >root >2015-04-01 16:17:09,364 - u"Directory['/tmp/hadoop-hdfs']" {'owner': >'hdfs', 'recursive': True, 'cd_access': 'a'} >2015-04-01 16:17:09,407 - Creating directory >u"Directory['/tmp/hadoop-hdfs']" >2015-04-01 16:17:09,587 - Changing owner for /tmp/hadoop-hdfs from 0 to >hdfs >2015-04-01 16:17:09,820 - >u"File['/etc/hadoop/conf/commons-logging.properties']" {'content': >Template('commons-logging.properties.j2'), 'owner': 'hdfs'} >2015-04-01 16:17:10,049 - u"File['/etc/hadoop/conf/health_check']" >{'content': Template('health_check-v2.j2'), 'owner': 'hdfs'} >2015-04-01 16:17:10,272 - u"File['/etc/hadoop/conf/log4j.properties']" >{'content': '...', 'owner': 'hdfs', 'group': 'hadoop', 'mode': 0644} >2015-04-01 16:17:10,506 - >u"File['/etc/hadoop/conf/hadoop-metrics2.properties']" {'content': >Template('hadoop-metrics2.properties.j2'), 'owner': 'hdfs'} >2015-04-01 16:17:10,732 - >u"File['/etc/hadoop/conf/task-log4j.properties']" {'content': >StaticFile('task-log4j.properties'), 'mode': 0755} >2015-04-01 16:17:11,085 - u"Directory['/etc/knox/conf']" {'owner': >'knox', 'group': 'knox', 'recursive': True} >2015-04-01 16:17:11,231 - u"XmlConfig['gateway-site.xml']" {'owner': >'knox', 'group': 'knox', 'conf_dir': '/etc/knox/conf', >'configuration_attributes': {}, 'configurations': ...} >2015-04-01 16:17:11,239 - Generating config: >/etc/knox/conf/gateway-site.xml >2015-04-01 16:17:11,239 - u"File['/etc/knox/conf/gateway-site.xml']" >{'owner': 'knox', 'content': InlineTemplate(...), 'group': 'knox', >'mode': None, 'encoding': 'UTF-8'} >2015-04-01 16:17:11,422 - Writing >u"File['/etc/knox/conf/gateway-site.xml']" because contents don't match >2015-04-01 16:17:11,561 - >u"File['/etc/knox/conf/gateway-log4j.properties']" {'content': '...', >'owner': 'knox', 'group': 'knox', 'mode': 0644} >2015-04-01 16:17:11,790 - >u"File['/etc/knox/conf/topologies/default.xml']" {'content': >InlineTemplate(...), 'owner': 'knox', 'group': 'knox'} >2015-04-01 16:17:12,014 - u"Execute['('chown', '-R', u'knox:knox', >'/var/lib/knox/data', '/var/log/knox', '/var/log/knox', u'/var/run/knox', >'/etc/knox/conf')']" {'sudo': True} >2015-04-01 16:17:12,075 - Error while executing command 'start': >Traceback (most recent call last): > File >"/usr/lib/python2.6/site-packages/resource_management/libraries/script/scr >ipt.py", line 214, in execute > method(env) > File >"/usr/lib/python2.6/site-packages/ambari_commons/os_family_impl.py", line >89, in thunk > return fn(*args, **kwargs) > File >"/var/lib/ambari-agent/cache/common-services/KNOX/0.5.0.2.2/package/script >s/knox_gateway.py", line 80, in start > self.configure(env) > File >"/var/lib/ambari-agent/cache/common-services/KNOX/0.5.0.2.2/package/script >s/knox_gateway.py", line 64, in configure > knox() > File >"/usr/lib/python2.6/site-packages/ambari_commons/os_family_impl.py", line >89, in thunk > return fn(*args, **kwargs) > File >"/var/lib/ambari-agent/cache/common-services/KNOX/0.5.0.2.2/package/script >s/knox.py", line 99, in knox > sudo = True, > File >"/usr/lib/python2.6/site-packages/resource_management/core/base.py", line >148, in __init__ > self.env.run() > File >"/usr/lib/python2.6/site-packages/resource_management/core/environment.py" >, line 152, in run > self.run_action(resource, action) > File >"/usr/lib/python2.6/site-packages/resource_management/core/environment.py" >, line 118, in run_action > provider_action() > File >"/usr/lib/python2.6/site-packages/resource_management/core/providers/syste >m.py", line 274, in action_run > raise ex >Fail: Execution of 'chown -R knox:knox /var/lib/knox/data /var/log/knox >/var/log/knox /var/run/knox /etc/knox/conf' returned 1. chown: cannot >access `/var/run/knox': No such file or directory >2015-04-01 16:17:12,119 - Command: /usr/bin/hdp-select status knox-server >> /tmp/tmp7GgVe1 >Output: knox-server - 2.2.0.0-2041 > > > > >‹ > >David McWhorter >Senior Developer, Foundations >Informatics and Technology Services >Office: 434.260.5232 | Mobile: 434.227.2551 >[email protected]<mailto:[email protected]> | >Premier, Inc. (NASDAQ: PINC) >
