[
https://issues.apache.org/jira/browse/VCL-727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13777722#comment-13777722
]
Andy Kurth commented on VCL-727:
--------------------------------
Author: arkurth
Date: Thu Sep 19 15:59:15 2013
New Revision: 1524774
URL: http://svn.apache.org/r1524774
Log:
VCL-727
Added additional looping to xCAT.pm::_rpower which makes additional attempts if
a timeout error is encountered. Added ability to control number of rpower
attempts via entries in the variable table: xcat|rpower_error_limit,
xcat|timeout_error_limit
Modified:
vcl/trunk/managementnode/lib/VCL/Module/Provisioning/xCAT.pm
vcl/trunk/mysql/vcl.sql
Modified: vcl/trunk/managementnode/lib/VCL/Module/Provisioning/xCAT.pm
URL:
http://svn.apache.org/viewvc/vcl/trunk/managementnode/lib/VCL/Module/Provisioning/xCAT.pm?rev=1524774&r1=1524773&r2=1524774&view=diff
> xCAT commands may timeout - make multiple attempts
> --------------------------------------------------
>
> Key: VCL-727
> URL: https://issues.apache.org/jira/browse/VCL-727
> Project: VCL
> Issue Type: Improvement
> Components: vcld (backend)
> Affects Versions: 2.3.2
> Reporter: Andy Kurth
> Assignee: Andy Kurth
> Priority: Minor
> Fix For: 2.3.3
>
>
> xCAT commands run from the management node may occasionally display "Error:
> Timeout". This usually occurs when multiple nodes are being loaded and
> several xCAT commands are issued concurrently.
> Making additional attempts to run the command again usually results in the
> command succeeding. There are loops to detect errors and make multiple
> attempts in the current xCAT code but sometimes these are not sufficient. It
> would be beneficial to differentiate timeout errors from other errors. If a
> timeout error occurs, additional attempts should be made. The current loops
> are sufficient for non-timeout errors. If these occur multiple times in a
> row it usually means there is a problem which won't be fixed by trying over
> and over again. If timeout errors are encountered, more leeway should be
> given.
> This will help prevent some failed new and reload reservations.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira