[ 
https://issues.apache.org/jira/browse/AMBARI-17198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmytro Grinenko updated AMBARI-17198:
-------------------------------------
    Attachment:     (was: AMBARI-17198.patch.1)

> Failure in mahout package installation upon retry is not correctly reported 
> causing EU to fail
> ----------------------------------------------------------------------------------------------
>
>                 Key: AMBARI-17198
>                 URL: https://issues.apache.org/jira/browse/AMBARI-17198
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-server
>    Affects Versions: 2.4.0
>            Reporter: Dmytro Grinenko
>            Priority: Critical
>             Fix For: 2.4.0
>
>         Attachments: AMBARI-17198.patch, AMBARI-17198.patch.1
>
>
> *Steps*
> 1. With Ambari 2.2.2 build, deploy HDP 2.4.0.0 cluster
> 2. Register bits for HDP-2.4.2.0-195 and start Installation of packages
> 3. Observed an error in first attempt of package install on one of the host
> {code}
> stderr:   /var/lib/ambari-agent/data/errors-560.txt
> No handlers could be found for logger "root"
> 2016-04-14 01:22:09,756 - Caught signal 15, will handle it gracefully. 
> Compute the actual version if possible before exiting.
> 2016-04-14 01:22:09,785 - Package Manager failed to install packages. Error: 
> (4, 'Interrupted system call')
> Traceback (most recent call last):
>   File 
> "/var/lib/ambari-agent/cache/custom_actions/scripts/install_packages.py", 
> line 386, in install_packages
>     retry_count=agent_stack_retry_count)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", 
> line 154, in __init__
>     self.env.run()
>   File 
> "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", 
> line 160, in run
>     self.run_action(resource, action)
>   File 
> "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", 
> line 124, in run_action
>     provider_action()
>   File 
> "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/__init__.py",
>  line 54, in action_install
>     self.install_package(package_name, self.resource.use_repos, 
> self.resource.skip_repos)
>   File 
> "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/zypper.py",
>  line 45, in install_package
>     active_base_repos = self.get_active_base_repos()
>   File 
> "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/zypper.py",
>  line 73, in get_active_base_repos
>     (code, output) = self.call_with_retries(LIST_ACTIVE_REPOS_CMD)
>   File 
> "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/__init__.py",
>  line 80, in call_with_retries
>     return self._call_with_retries(cmd, is_checked=False, **kwargs)
>   File 
> "/usr/lib/python2.6/site-packages/resource_management/core/providers/package/__init__.py",
>  line 91, in _call_with_retries
>     code, out = func(cmd, **kwargs)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", 
> line 70, in inner
>     result = function(command, **kwargs)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", 
> line 105, in call
>     tries=tries, try_sleep=try_sleep)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", 
> line 140, in _call_wrapper
>     result = _call(command, **kwargs_copy)
>   File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", 
> line 240, in _call
>     ready, _, _ = select.select(read_set, [], [], 1)
> error: (4, 'Interrupted system call')
>  Python script has been killed due to timeout after waiting 1800 secs



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to