[ https://issues.apache.org/jira/browse/AMBARI-17198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dmytro Grinenko updated AMBARI-17198: ------------------------------------- Attachment: AMBARI-17198.patch.1 > Failure in mahout package installation upon retry is not correctly reported > causing EU to fail > ---------------------------------------------------------------------------------------------- > > Key: AMBARI-17198 > URL: https://issues.apache.org/jira/browse/AMBARI-17198 > Project: Ambari > Issue Type: Bug > Components: ambari-server > Affects Versions: 2.4.0 > Reporter: Dmytro Grinenko > Priority: Critical > Fix For: 2.4.0 > > Attachments: AMBARI-17198.patch, AMBARI-17198.patch.1 > > > *Steps* > 1. With Ambari 2.2.2 build, deploy HDP 2.4.0.0 cluster > 2. Register bits for HDP-2.4.2.0-195 and start Installation of packages > 3. Observed an error in first attempt of package install on one of the host > {code} > stderr: /var/lib/ambari-agent/data/errors-560.txt > No handlers could be found for logger "root" > 2016-04-14 01:22:09,756 - Caught signal 15, will handle it gracefully. > Compute the actual version if possible before exiting. > 2016-04-14 01:22:09,785 - Package Manager failed to install packages. Error: > (4, 'Interrupted system call') > Traceback (most recent call last): > File > "/var/lib/ambari-agent/cache/custom_actions/scripts/install_packages.py", > line 386, in install_packages > retry_count=agent_stack_retry_count) > File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", > line 154, in __init__ > self.env.run() > File > "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", > line 160, in run > self.run_action(resource, action) > File > "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", > line 124, in run_action > provider_action() > File > "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/__init__.py", > line 54, in action_install > self.install_package(package_name, self.resource.use_repos, > self.resource.skip_repos) > File > "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/zypper.py", > line 45, in install_package > active_base_repos = self.get_active_base_repos() > File > "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/zypper.py", > line 73, in get_active_base_repos > (code, output) = self.call_with_retries(LIST_ACTIVE_REPOS_CMD) > File > "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/__init__.py", > line 80, in call_with_retries > return self._call_with_retries(cmd, is_checked=False, **kwargs) > File > "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/__init__.py", > line 91, in _call_with_retries > code, out = func(cmd, **kwargs) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", > line 70, in inner > result = function(command, **kwargs) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", > line 105, in call > tries=tries, try_sleep=try_sleep) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", > line 140, in _call_wrapper > result = _call(command, **kwargs_copy) > File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", > line 240, in _call > ready, _, _ = select.select(read_set, [], [], 1) > error: (4, 'Interrupted system call') > Python script has been killed due to timeout after waiting 1800 secs -- This message was sent by Atlassian JIRA (v6.3.4#6332)