[
https://issues.apache.org/jira/browse/AMBARI-11908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14585163#comment-14585163
]
Michael Harp commented on AMBARI-11908:
---------------------------------------
No obvious reason, we do not have firewall or selinux installed. Also our 2
Master blueprint has NN and RM collocated and does not exhibit this issue. My
guess is a timing issue related to the move to parallel deployments?
https://github.com/hortonworks/td-ape-tigereye/blob/master/hadoopBuilder/src/config/ambari/2-master-HDP-2.3.json
> ResourceManager fails initial start when not colocated with Namenode
> --------------------------------------------------------------------
>
> Key: AMBARI-11908
> URL: https://issues.apache.org/jira/browse/AMBARI-11908
> Project: Ambari
> Issue Type: Bug
> Environment: sles11sp3
> hdp-2.3.0.0-2346
> ambari-2.1.0-1064
> Reporter: Michael Harp
> Fix For: 2.1.0
>
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> Deploying with blueprint and Namenode-HA and Yarn-HA enabled resoucemanager
> fails initial start. Subsequent starts succeed.
> {code}
> 2015-06-13 03:41:24,922 - Getting jmx metrics from NN failed. URL:
> http://helios1.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus
> Traceback (most recent call last):
> File
> "/usr/lib/python2.6/site-packages/resource_management/libraries/functions/jmx.py",
> line 37, in get_value_from_jmx
> _, data = shell.checked_call(cmd, user=run_user, quiet=False)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 70, in inner
> result = function(command, **kwargs)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 92, in checked_call
> tries=tries, try_sleep=try_sleep)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 140, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 282, in _call
> raise Fail(err_msg)
> Fail: Execution of 'curl -s
> 'http://helios1.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus''
> returned 7.
> 2015-06-13 03:41:25,000 - Getting jmx metrics from NN failed. URL:
> http://helios2.labs.teradata.com:50070/jmx?qry=Hadoop:service=NameNode,name=NameNodeStatus
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)