[
https://issues.apache.org/jira/browse/AMBARI-21566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Onischuk updated AMBARI-21566:
-------------------------------------
Status: Patch Available (was: Open)
> RU: Hive service check was failed during RU
> -------------------------------------------
>
> Key: AMBARI-21566
> URL: https://issues.apache.org/jira/browse/AMBARI-21566
> Project: Ambari
> Issue Type: Bug
> Reporter: Andrew Onischuk
> Assignee: Andrew Onischuk
> Fix For: 2.5.2
>
> Attachments: AMBARI-21566.patch
>
>
> Cluster: 172.22.117.104:8080 - nats11-46-vdms-erm16tofnsec-s11 - 48h
> Job: <http://linux-jenkins.qe.hortonworks.com:8080/job/Run-HDP-Tests/551233/>
> Std.err
>
>
>
> Exception in thread Thread-4:
> Traceback (most recent call last):
> File "/usr/lib64/python2.6/threading.py", line 532, in __bootstrap_inner
> self.run()
> File "/usr/lib64/python2.6/threading.py", line 736, in run
> self.function(*self.args, **self.kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 383, in _on_timeout
> terminate_process(proc, terminate_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 48, in terminate_process
> kill_process_tree(proc)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 91, in kill_process_tree
> shell.checked_call(["bash", kill_tree_script, str(proc.pid),
> str(signal.SIGKILL)])
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 72, in inner
> result = function(command, **kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 102, in checked_call
> tries=tries, try_sleep=try_sleep,
> timeout_kill_strategy=timeout_kill_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 150, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 303, in _call
> raise ExecutionFailed(err_msg, code, out, err)
> ExecutionFailed: Execution of 'bash
> /usr/lib/python2.6/site-packages/resource_management/core/files/killtree.sh
> 6363 9' returned 1. kill: top: invalid signal
> Try `kill --help' for more information.
>
> Exception in thread Thread-5:
> Traceback (most recent call last):
> File "/usr/lib64/python2.6/threading.py", line 532, in __bootstrap_inner
> self.run()
> File "/usr/lib64/python2.6/threading.py", line 736, in run
> self.function(*self.args, **self.kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 383, in _on_timeout
> terminate_process(proc, terminate_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 48, in terminate_process
> kill_process_tree(proc)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 91, in kill_process_tree
> shell.checked_call(["bash", kill_tree_script, str(proc.pid),
> str(signal.SIGKILL)])
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 72, in inner
> result = function(command, **kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 102, in checked_call
> tries=tries, try_sleep=try_sleep,
> timeout_kill_strategy=timeout_kill_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 150, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 303, in _call
> raise ExecutionFailed(err_msg, code, out, err)
> ExecutionFailed: Execution of 'bash
> /usr/lib/python2.6/site-packages/resource_management/core/files/killtree.sh
> 8306 9' returned 1. kill: top: invalid signal
> Try `kill --help' for more information.
>
> Exception in thread Thread-6:
> Traceback (most recent call last):
> File "/usr/lib64/python2.6/threading.py", line 532, in __bootstrap_inner
> self.run()
> File "/usr/lib64/python2.6/threading.py", line 736, in run
> self.function(*self.args, **self.kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 383, in _on_timeout
> terminate_process(proc, terminate_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 48, in terminate_process
> kill_process_tree(proc)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 91, in kill_process_tree
> shell.checked_call(["bash", kill_tree_script, str(proc.pid),
> str(signal.SIGKILL)])
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 72, in inner
> result = function(command, **kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 102, in checked_call
> tries=tries, try_sleep=try_sleep,
> timeout_kill_strategy=timeout_kill_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 150, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 303, in _call
> raise ExecutionFailed(err_msg, code, out, err)
> ExecutionFailed: Execution of 'bash
> /usr/lib/python2.6/site-packages/resource_management/core/files/killtree.sh
> 10179 9' returned 1. kill: top: invalid signal
> Try `kill --help' for more information.
>
> Exception in thread Thread-7:
> Traceback (most recent call last):
> File "/usr/lib64/python2.6/threading.py", line 532, in __bootstrap_inner
> self.run()
> File "/usr/lib64/python2.6/threading.py", line 736, in run
> self.function(*self.args, **self.kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 383, in _on_timeout
> terminate_process(proc, terminate_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 48, in terminate_process
> kill_process_tree(proc)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 91, in kill_process_tree
> shell.checked_call(["bash", kill_tree_script, str(proc.pid),
> str(signal.SIGKILL)])
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 72, in inner
> result = function(command, **kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 102, in checked_call
> tries=tries, try_sleep=try_sleep,
> timeout_kill_strategy=timeout_kill_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 150, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 303, in _call
> raise ExecutionFailed(err_msg, code, out, err)
> ExecutionFailed: Execution of 'bash
> /usr/lib/python2.6/site-packages/resource_management/core/files/killtree.sh
> 11142 9' returned 1. kill: top: invalid signal
> Try `kill --help' for more information.
>
> Exception in thread Thread-8:
> Traceback (most recent call last):
> File "/usr/lib64/python2.6/threading.py", line 532, in __bootstrap_inner
> self.run()
> File "/usr/lib64/python2.6/threading.py", line 736, in run
> self.function(*self.args, **self.kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 383, in _on_timeout
> terminate_process(proc, terminate_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 48, in terminate_process
> kill_process_tree(proc)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 91, in kill_process_tree
> shell.checked_call(["bash", kill_tree_script, str(proc.pid),
> str(signal.SIGKILL)])
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 72, in inner
> result = function(command, **kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 102, in checked_call
> tries=tries, try_sleep=try_sleep,
> timeout_kill_strategy=timeout_kill_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 150, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 303, in _call
> raise ExecutionFailed(err_msg, code, out, err)
> ExecutionFailed: Execution of 'bash
> /usr/lib/python2.6/site-packages/resource_management/core/files/killtree.sh
> 12560 9' returned 1. kill: top: invalid signal
> Try `kill --help' for more information.
>
> Exception in thread Thread-9:
> Traceback (most recent call last):
> File "/usr/lib64/python2.6/threading.py", line 532, in __bootstrap_inner
> self.run()
> File "/usr/lib64/python2.6/threading.py", line 736, in run
> self.function(*self.args, **self.kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 383, in _on_timeout
> terminate_process(proc, terminate_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 48, in terminate_process
> kill_process_tree(proc)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/signal_utils.py",
> line 91, in kill_process_tree
> shell.checked_call(["bash", kill_tree_script, str(proc.pid),
> str(signal.SIGKILL)])
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 72, in inner
> result = function(command, **kwargs)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 102, in checked_call
> tries=tries, try_sleep=try_sleep,
> timeout_kill_strategy=timeout_kill_strategy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 150, in _call_wrapper
> result = _call(command, **kwargs_copy)
> File
> "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line
> 303, in _call
> raise ExecutionFailed(err_msg, code, out, err)
> ExecutionFailed: Execution of 'bash
> /usr/lib/python2.6/site-packages/resource_management/core/files/killtree.sh
> 14431 9' returned 1. kill: top: invalid signal
> Try `kill --help' for more information.
>
>
> Python script has been killed due to timeout after waiting 300 secs
>
> std.out
>
>
>
> 2017-07-24 06:03:46,281 - MariaDB RedHat Support: false
> 2017-07-24 06:03:46,283 - An upgrade/downgrade for HDP-2.6.2.0-106 is in
> progress, determining which hadoop conf dir to use.
> 2017-07-24 06:03:46,320 - call['ambari-python-wrap /usr/bin/hdp-select
> status hadoop-client'] {'timeout': 20}
> 2017-07-24 06:03:46,563 - call returned (0, 'hadoop-client - 2.5.3.0-37')
> 2017-07-24 06:03:46,564 - hadoop-client is currently at version 2.5.3.0-37
> 2017-07-24 06:03:46,565 - hdp-select has not yet been called to update
> the symlink for this component, keep using version 2.5.3.0-37
> 2017-07-24 06:03:46,565 - Hadoop conf dir: /usr/hdp/2.5.3.0-37/hadoop/conf
> 2017-07-24 06:03:46,565 - Using hadoop conf dir:
> /usr/hdp/2.5.3.0-37/hadoop/conf
> 2017-07-24 06:03:46,567 - call['ambari-python-wrap /usr/bin/hdp-select
> status hadoop-client'] {'timeout': 20}
> 2017-07-24 06:03:46,895 - call returned (0, 'hadoop-client - 2.5.3.0-37')
> 2017-07-24 06:03:46,895 - hadoop-client is currently at version 2.5.3.0-37
> 2017-07-24 06:03:46,959 - call['ambari-python-wrap /usr/bin/hdp-select
> status hive-server2'] {'timeout': 20}
> 2017-07-24 06:03:47,222 - call returned (0, 'hive-server2 - 2.5.3.0-37')
> 2017-07-24 06:03:47,223 - Stack Feature Version Info: Cluster Stack=2.6,
> Cluster Current Version=2.5.3.0-37, Command Stack=HDP-2.6, Command
> Version=2.6.2.0-106, Upgrade Direction=upgrade -> 2.6.2.0-106
> 2017-07-24 06:03:47,343 - Running Hive Server checks
> 2017-07-24 06:03:47,343 - --------------------------
>
> 2017-07-24 06:03:47,344 - Server Address List :
> ['nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal'], Port : 10000, SSL
> KeyStore : None
> 2017-07-24 06:03:47,345 - Waiting for the Hive Server to start...
> 2017-07-24 06:03:47,345 - Execute['/usr/bin/kinit -kt
> /etc/security/keytabs/smokeuser.headless.keytab [email protected]; ']
> {'user': 'ambari-qa'}
> 2017-07-24 06:03:48,369 - Execute['! beeline -u
> 'jdbc:hive2://nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal:10000/;transportMode=binary;principal=hive/[email protected]'
> -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid
> URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'],
> 'timeout_kill_strategy': 2, 'timeout': 30, 'user': 'ambari-qa'}
> 2017-07-24 06:04:44,650 - Connection to
> nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal on port 10000 failed
> 2017-07-24 06:04:49,656 - Execute['/usr/bin/kinit -kt
> /etc/security/keytabs/smokeuser.headless.keytab [email protected]; ']
> {'user': 'ambari-qa'}
> 2017-07-24 06:04:50,577 - Execute['! beeline -u
> 'jdbc:hive2://nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal:10000/;transportMode=binary;principal=hive/[email protected]'
> -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid
> URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'],
> 'timeout_kill_strategy': 2, 'timeout': 30, 'user': 'ambari-qa'}
> 2017-07-24 06:05:46,295 - Connection to
> nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal on port 10000 failed
> 2017-07-24 06:05:51,301 - Execute['/usr/bin/kinit -kt
> /etc/security/keytabs/smokeuser.headless.keytab [email protected]; ']
> {'user': 'ambari-qa'}
> 2017-07-24 06:05:52,129 - Execute['! beeline -u
> 'jdbc:hive2://nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal:10000/;transportMode=binary;principal=hive/[email protected]'
> -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid
> URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'],
> 'timeout_kill_strategy': 2, 'timeout': 30, 'user': 'ambari-qa'}
> 2017-07-24 06:06:36,139 - Connection to
> nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal on port 10000 failed
> 2017-07-24 06:06:41,145 - Execute['/usr/bin/kinit -kt
> /etc/security/keytabs/smokeuser.headless.keytab [email protected]; ']
> {'user': 'ambari-qa'}
> 2017-07-24 06:06:41,942 - Execute['! beeline -u
> 'jdbc:hive2://nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal:10000/;transportMode=binary;principal=hive/[email protected]'
> -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid
> URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'],
> 'timeout_kill_strategy': 2, 'timeout': 30, 'user': 'ambari-qa'}
> 2017-07-24 06:07:13,388 - Connection to
> nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal on port 10000 failed
> 2017-07-24 06:07:18,394 - Execute['/usr/bin/kinit -kt
> /etc/security/keytabs/smokeuser.headless.keytab [email protected]; ']
> {'user': 'ambari-qa'}
> 2017-07-24 06:07:18,764 - Execute['! beeline -u
> 'jdbc:hive2://nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal:10000/;transportMode=binary;principal=hive/[email protected]'
> -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid
> URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'],
> 'timeout_kill_strategy': 2, 'timeout': 30, 'user': 'ambari-qa'}
> 2017-07-24 06:08:06,923 - Connection to
> nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal on port 10000 failed
> 2017-07-24 06:08:11,930 - Execute['/usr/bin/kinit -kt
> /etc/security/keytabs/smokeuser.headless.keytab [email protected]; ']
> {'user': 'ambari-qa'}
> 2017-07-24 06:08:12,929 - Execute['! beeline -u
> 'jdbc:hive2://nats11-46-vdms-erm16tofnsec-s11-11.openstacklocal:10000/;transportMode=binary;principal=hive/[email protected]'
> -e '' 2>&1| awk '{print}'|grep -i -e 'Connection refused' -e 'Invalid
> URL''] {'path': ['/bin/', '/usr/bin/', '/usr/lib/hive/bin/', '/usr/sbin/'],
> 'timeout_kill_strategy': 2, 'timeout': 30, 'user': 'ambari-qa'}
>
> Command failed after 1 tries
>
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)