[
https://issues.apache.org/jira/browse/METRON-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Allen reassigned METRON-894:
---------------------------------
Assignee: Nick Allen
> Ambari "Restart Metron Parsers" Fails If Any Parser Not Running
> ---------------------------------------------------------------
>
> Key: METRON-894
> URL: https://issues.apache.org/jira/browse/METRON-894
> Project: Metron
> Issue Type: Bug
> Affects Versions: 0.3.1
> Reporter: Nick Allen
> Assignee: Nick Allen
> Priority: Minor
>
> The "Restart Metron Parsers" action failed in Ambari. It failed because the
> "stop" portion of the "restart" failed because the YAF topology was not
> running. This should not be treated as an error condition.
> I was able to work around this by simply using a "start" operation instead of
> a "restart".
> {code}
> stderr: /var/lib/ambari-agent/data/errors-966.txt
> Traceback (most recent call last):
> File
> "/var/lib/ambari-agent/cache/common-services/METRON/0.4.0/package/scripts/parser_master.py",
> line 93, in <module>
> ParserMaster().execute()
> File
> "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py",
> line 280, in execute
> method(env)
> File
> "/var/lib/ambari-agent/cache/common-services/METRON/0.4.0/package/scripts/parser_master.py",
> line 81, in restart
> commands.restart_parser_topologies(env)
> File
> "/var/lib/ambari-agent/cache/common-services/METRON/0.4.0/package/scripts/parser_commands.py",
> line 146, in restart_parser_topologies
> self.stop_parser_topologies()
> File
> "/var/lib/ambari-agent/cache/common-services/METRON/0.4.0/package/scripts/parser_commands.py",
> line 141, in stop_parser_topologies
> Execute(stop_cmd, user=self.__params.metron_user)
> File "/usr/lib/python2.6/site-packages/resource_management/core/base.py",
> line 155, in __init__
> self.env.run()
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
> line 160, in run
> self.run_action(resource, action)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/environment.py",
> line 124, in run_action
> provider_action()
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py",
> line 273, in action_run
> tries=self.resource.tries, try_sleep=self.resource.try_sleep)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 70, in inner
> result = function(command, **kwargs)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 92, in checked_call
> tries=tries, try_sleep=try_sleep)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 140, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py",
> line 293, in _call
> raise ExecutionFailed(err_msg, code, out, err)
> resource_management.core.exceptions.ExecutionFailed: Execution of 'storm kill
> yaf' returned 1. Running: /usr/jdk64/jdk1.8.0_77/bin/java -client
> -Ddaemon.name= -Dstorm.options= -Dstorm.home=/usr/hdp/2.5.3.0-37/storm
> -Dstorm.log.dir=/var/log/storm
> -Djava.library.path=/usr/local/lib:/opt/local/lib:/usr/lib:/usr/hdp/current/storm-client/lib
> -Dstorm.conf.file= -cp
> /usr/hdp/2.5.3.0-37/storm/lib/clojure-1.7.0.jar:/usr/hdp/2.5.3.0-37/storm/lib/disruptor-3.3.2.jar:/usr/hdp/2.5.3.0-37/storm/lib/log4j-slf4j-impl-2.1.jar:/usr/hdp/2.5.3.0-37/storm/lib/storm-rename-hack-1.0.1.2.5.3.0-37.jar:/usr/hdp/2.5.3.0-37/storm/lib/log4j-api-2.1.jar:/usr/hdp/2.5.3.0-37/storm/lib/ring-cors-0.1.5.jar:/usr/hdp/2.5.3.0-37/storm/lib/log4j-core-2.1.jar:/usr/hdp/2.5.3.0-37/storm/lib/asm-5.0.3.jar:/usr/hdp/2.5.3.0-37/storm/lib/log4j-over-slf4j-1.6.6.jar:/usr/hdp/2.5.3.0-37/storm/lib/slf4j-api-1.7.7.jar:/usr/hdp/2.5.3.0-37/storm/lib/servlet-api-2.5.jar:/usr/hdp/2.5.3.0-37/storm/lib/zookeeper.jar:/usr/hdp/2.5.3.0-37/storm/lib/minlog-1.3.0.jar:/usr/hdp/2.5.3.0-37/storm/lib/kryo-3.0.3.jar:/usr/hdp/2.5.3.0-37/storm/lib/storm-core-1.0.1.2.5.3.0-37.jar:/usr/hdp/2.5.3.0-37/storm/lib/reflectasm-1.10.1.jar:/usr/hdp/2.5.3.0-37/storm/lib/objenesis-2.1.jar:/usr/hdp/2.5.3.0-37/storm/lib/ambari-metrics-storm-sink.jar:/usr/hdp/2.5.3.0-37/storm/extlib-daemon/ranger-storm-plugin-shim-0.6.0.2.5.3.0-37.jar:/usr/hdp/2.5.3.0-37/storm/extlib-daemon/ojdbc6.jar:/usr/hdp/2.5.3.0-37/storm/extlib-daemon/ranger-plugin-classloader-0.6.0.2.5.3.0-37.jar:/usr/hdp/current/storm-supervisor/conf:/usr/hdp/2.5.3.0-37/storm/bin
> org.apache.storm.command.kill_topology yaf
> Exception in thread "main" NotAliveException(msg:yaf is not alive)
> at
> org.apache.storm.generated.Nimbus$killTopologyWithOpts_result$killTopologyWithOpts_resultStandardScheme.read(Nimbus.java:10748)
> at
> org.apache.storm.generated.Nimbus$killTopologyWithOpts_result$killTopologyWithOpts_resultStandardScheme.read(Nimbus.java:10734)
> at
> org.apache.storm.generated.Nimbus$killTopologyWithOpts_result.read(Nimbus.java:10676)
> at
> org.apache.storm.thrift.TServiceClient.receiveBase(TServiceClient.java:86)
> at
> org.apache.storm.generated.Nimbus$Client.recv_killTopologyWithOpts(Nimbus.java:383)
> at
> org.apache.storm.generated.Nimbus$Client.killTopologyWithOpts(Nimbus.java:369)
> at
> org.apache.storm.command.kill_topology$_main.doInvoke(kill_topology.clj:27)
> at clojure.lang.RestFn.applyTo(RestFn.java:137)
> at org.apache.storm.command.kill_topology.main(Unknown Source)
> stdout: /var/lib/ambari-agent/data/output-966.txt
> 2017-04-26 18:21:46,880 - The hadoop conf dir
> /usr/hdp/current/hadoop-client/conf exists, will call conf-select on it for
> version 2.5.3.0-37
> 2017-04-26 18:21:46,882 - Checking if need to create versioned conf dir
> /etc/hadoop/2.5.3.0-37/0
> 2017-04-26 18:21:46,884 - call[('ambari-python-wrap',
> u'/usr/bin/conf-select', 'create-conf-dir', '--package', 'hadoop',
> '--stack-version', '2.5.3.0-37', '--conf-version', '0')] {'logoutput': False,
> 'sudo': True, 'quiet': False, 'stderr': -1}
> 2017-04-26 18:21:46,921 - call returned (1, '/etc/hadoop/2.5.3.0-37/0 exist
> already', '')
> 2017-04-26 18:21:46,922 - checked_call[('ambari-python-wrap',
> u'/usr/bin/conf-select', 'set-conf-dir', '--package', 'hadoop',
> '--stack-version', '2.5.3.0-37', '--conf-version', '0')] {'logoutput': False,
> 'sudo': True, 'quiet': False}
> 2017-04-26 18:21:46,960 - checked_call returned (0, '')
> 2017-04-26 18:21:46,962 - Ensuring that hadoop has the correct symlink
> structure
> 2017-04-26 18:21:46,962 - Using hadoop conf dir:
> /usr/hdp/current/hadoop-client/conf
> 2017-04-26 18:21:47,150 - The hadoop conf dir
> /usr/hdp/current/hadoop-client/conf exists, will call conf-select on it for
> version 2.5.3.0-37
> 2017-04-26 18:21:47,152 - Checking if need to create versioned conf dir
> /etc/hadoop/2.5.3.0-37/0
> 2017-04-26 18:21:47,155 - call[('ambari-python-wrap',
> u'/usr/bin/conf-select', 'create-conf-dir', '--package', 'hadoop',
> '--stack-version', '2.5.3.0-37', '--conf-version', '0')] {'logoutput': False,
> 'sudo': True, 'quiet': False, 'stderr': -1}
> 2017-04-26 18:21:47,193 - call returned (1, '/etc/hadoop/2.5.3.0-37/0 exist
> already', '')
> 2017-04-26 18:21:47,194 - checked_call[('ambari-python-wrap',
> u'/usr/bin/conf-select', 'set-conf-dir', '--package', 'hadoop',
> '--stack-version', '2.5.3.0-37', '--conf-version', '0')] {'logoutput': False,
> 'sudo': True, 'quiet': False}
> 2017-04-26 18:21:47,232 - checked_call returned (0, '')
> 2017-04-26 18:21:47,233 - Ensuring that hadoop has the correct symlink
> structure
> 2017-04-26 18:21:47,233 - Using hadoop conf dir:
> /usr/hdp/current/hadoop-client/conf
> 2017-04-26 18:21:47,235 - Group['metron'] {}
> 2017-04-26 18:21:47,238 - Group['livy'] {}
> 2017-04-26 18:21:47,238 - Group['elasticsearch'] {}
> 2017-04-26 18:21:47,238 - Group['spark'] {}
> 2017-04-26 18:21:47,239 - Group['zeppelin'] {}
> 2017-04-26 18:21:47,239 - Group['hadoop'] {}
> 2017-04-26 18:21:47,239 - Group['kibana'] {}
> 2017-04-26 18:21:47,240 - Group['users'] {}
> 2017-04-26 18:21:47,240 - User['hive'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,242 - User['storm'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,243 - User['zookeeper'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,244 - User['ams'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,245 - User['tez'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'users']}
> 2017-04-26 18:21:47,246 - User['zeppelin'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,247 - User['metron'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,248 - User['livy'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,248 - User['elasticsearch'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,249 - User['spark'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,250 - User['ambari-qa'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'users']}
> 2017-04-26 18:21:47,251 - User['kafka'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,252 - User['hdfs'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,253 - User['yarn'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,254 - User['kibana'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,255 - User['mapred'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,256 - User['hbase'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,257 - User['hcat'] {'gid': 'hadoop',
> 'fetch_nonlocal_groups': True, 'groups': [u'hadoop']}
> 2017-04-26 18:21:47,258 - File['/var/lib/ambari-agent/tmp/changeUid.sh']
> {'content': StaticFile('changeToSecureUid.sh'), 'mode': 0555}
> 2017-04-26 18:21:47,261 - Execute['/var/lib/ambari-agent/tmp/changeUid.sh
> ambari-qa
> /tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa']
> {'not_if': '(test $(id -u ambari-qa) -gt 1000) || (false)'}
> 2017-04-26 18:21:47,269 - Skipping
> Execute['/var/lib/ambari-agent/tmp/changeUid.sh ambari-qa
> /tmp/hadoop-ambari-qa,/tmp/hsperfdata_ambari-qa,/home/ambari-qa,/tmp/ambari-qa,/tmp/sqoop-ambari-qa']
> due to not_if
> 2017-04-26 18:21:47,270 - Directory['/tmp/hbase-hbase'] {'owner': 'hbase',
> 'create_parents': True, 'mode': 0775, 'cd_access': 'a'}
> 2017-04-26 18:21:47,272 - File['/var/lib/ambari-agent/tmp/changeUid.sh']
> {'content': StaticFile('changeToSecureUid.sh'), 'mode': 0555}
> 2017-04-26 18:21:47,274 - Execute['/var/lib/ambari-agent/tmp/changeUid.sh
> hbase /home/hbase,/tmp/hbase,/usr/bin/hbase,/var/log/hbase,/tmp/hbase-hbase']
> {'not_if': '(test $(id -u hbase) -gt 1000) || (false)'}
> 2017-04-26 18:21:47,281 - Skipping
> Execute['/var/lib/ambari-agent/tmp/changeUid.sh hbase
> /home/hbase,/tmp/hbase,/usr/bin/hbase,/var/log/hbase,/tmp/hbase-hbase'] due
> to not_if
> 2017-04-26 18:21:47,282 - Group['hdfs'] {}
> 2017-04-26 18:21:47,283 - User['hdfs'] {'fetch_nonlocal_groups': True,
> 'groups': [u'hadoop', u'hdfs']}
> 2017-04-26 18:21:47,284 - FS Type:
> 2017-04-26 18:21:47,284 - Directory['/etc/hadoop'] {'mode': 0755}
> 2017-04-26 18:21:47,308 -
> File['/usr/hdp/current/hadoop-client/conf/hadoop-env.sh'] {'content':
> InlineTemplate(...), 'owner': 'hdfs', 'group': 'hadoop'}
> 2017-04-26 18:21:47,310 -
> Directory['/var/lib/ambari-agent/tmp/hadoop_java_io_tmpdir'] {'owner':
> 'hdfs', 'group': 'hadoop', 'mode': 01777}
> 2017-04-26 18:21:47,330 - Execute[('setenforce', '0')] {'not_if': '(! which
> getenforce ) || (which getenforce && getenforce | grep -q Disabled)', 'sudo':
> True, 'only_if': 'test -f /selinux/enforce'}
> 2017-04-26 18:21:47,341 - Skipping Execute[('setenforce', '0')] due to not_if
> 2017-04-26 18:21:47,342 - Directory['/var/log/hadoop'] {'owner': 'root',
> 'create_parents': True, 'group': 'hadoop', 'mode': 0775, 'cd_access': 'a'}
> 2017-04-26 18:21:47,346 - Directory['/var/run/hadoop'] {'owner': 'root',
> 'create_parents': True, 'group': 'root', 'cd_access': 'a'}
> 2017-04-26 18:21:47,346 - Directory['/tmp/hadoop-hdfs'] {'owner': 'hdfs',
> 'create_parents': True, 'cd_access': 'a'}
> 2017-04-26 18:21:47,354 -
> File['/usr/hdp/current/hadoop-client/conf/commons-logging.properties']
> {'content': Template('commons-logging.properties.j2'), 'owner': 'hdfs'}
> 2017-04-26 18:21:47,357 -
> File['/usr/hdp/current/hadoop-client/conf/health_check'] {'content':
> Template('health_check.j2'), 'owner': 'hdfs'}
> 2017-04-26 18:21:47,358 -
> File['/usr/hdp/current/hadoop-client/conf/log4j.properties'] {'content': ...,
> 'owner': 'hdfs', 'group': 'hadoop', 'mode': 0644}
> 2017-04-26 18:21:47,377 -
> File['/usr/hdp/current/hadoop-client/conf/hadoop-metrics2.properties']
> {'content': Template('hadoop-metrics2.properties.j2'), 'owner': 'hdfs',
> 'group': 'hadoop'}
> 2017-04-26 18:21:47,378 -
> File['/usr/hdp/current/hadoop-client/conf/task-log4j.properties'] {'content':
> StaticFile('task-log4j.properties'), 'mode': 0755}
> 2017-04-26 18:21:47,379 -
> File['/usr/hdp/current/hadoop-client/conf/configuration.xsl'] {'owner':
> 'hdfs', 'group': 'hadoop'}
> 2017-04-26 18:21:47,386 - File['/etc/hadoop/conf/topology_mappings.data']
> {'owner': 'hdfs', 'content': Template('topology_mappings.data.j2'),
> 'only_if': 'test -d /etc/hadoop/conf', 'group': 'hadoop'}
> 2017-04-26 18:21:47,391 - File['/etc/hadoop/conf/topology_script.py']
> {'content': StaticFile('topology_script.py'), 'only_if': 'test -d
> /etc/hadoop/conf', 'mode': 0755}
> 2017-04-26 18:21:47,682 - The hadoop conf dir
> /usr/hdp/current/hadoop-client/conf exists, will call conf-select on it for
> version 2.5.3.0-37
> 2017-04-26 18:21:47,684 - Checking if need to create versioned conf dir
> /etc/hadoop/2.5.3.0-37/0
> 2017-04-26 18:21:47,687 - call[('ambari-python-wrap',
> u'/usr/bin/conf-select', 'create-conf-dir', '--package', 'hadoop',
> '--stack-version', '2.5.3.0-37', '--conf-version', '0')] {'logoutput': False,
> 'sudo': True, 'quiet': False, 'stderr': -1}
> 2017-04-26 18:21:47,726 - call returned (1, '/etc/hadoop/2.5.3.0-37/0 exist
> already', '')
> 2017-04-26 18:21:47,727 - checked_call[('ambari-python-wrap',
> u'/usr/bin/conf-select', 'set-conf-dir', '--package', 'hadoop',
> '--stack-version', '2.5.3.0-37', '--conf-version', '0')] {'logoutput': False,
> 'sudo': True, 'quiet': False}
> 2017-04-26 18:21:47,766 - checked_call returned (0, '')
> 2017-04-26 18:21:47,767 - Ensuring that hadoop has the correct symlink
> structure
> 2017-04-26 18:21:47,767 - Using hadoop conf dir:
> /usr/hdp/current/hadoop-client/conf
> 2017-04-26 18:21:47,771 - Create Metron Local Config Directory
> 2017-04-26 18:21:47,771 - Configure Metron global.json
> 2017-04-26 18:21:47,771 - Directory['/usr/metron/0.4.0/config/zookeeper']
> {'owner': 'metron', 'group': 'metron', 'mode': 0755}
> 2017-04-26 18:21:47,781 -
> File['/usr/metron/0.4.0/config/zookeeper/global.json'] {'content':
> InlineTemplate(...), 'owner': 'metron'}
> 2017-04-26 18:21:47,786 -
> File['/usr/metron/0.4.0/config/zookeeper/../elasticsearch.properties']
> {'content': InlineTemplate(...), 'owner': 'metron'}
> 2017-04-26 18:21:47,787 - Loading config into ZooKeeper
> 2017-04-26 18:21:47,787 - Execute['/usr/metron/0.4.0/bin/zk_load_configs.sh
> --mode PUSH -i /usr/metron/0.4.0/config/zookeeper -z
> y113.l42scl.hortonworks.com:2181,y114.l42scl.hortonworks.com:2181,y115.l42scl.hortonworks.com:2181']
> {'path': [u'/usr/jdk64/jdk1.8.0_77/bin']}
> 2017-04-26 18:21:49,396 - Calling security setup
> 2017-04-26 18:21:49,397 - Restarting the parser topologies
> 2017-04-26 18:21:49,397 - Stopping parsers
> 2017-04-26 18:21:49,397 - Stopping bro
> 2017-04-26 18:21:49,397 - Execute['storm kill bro'] {'user': 'metron'}
> 2017-04-26 18:21:55,400 - Stopping snort
> 2017-04-26 18:21:55,401 - Execute['storm kill snort'] {'user': 'metron'}
> 2017-04-26 18:22:01,016 - Stopping yaf
> 2017-04-26 18:22:01,017 - Execute['storm kill yaf'] {'user': 'metron'}
> Command failed after 1 tries
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)