[ 
https://issues.apache.org/jira/browse/VCL-727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13777722#comment-13777722
 ] 

Andy Kurth commented on VCL-727:
--------------------------------

Author: arkurth
Date: Thu Sep 19 15:59:15 2013
New Revision: 1524774

URL: http://svn.apache.org/r1524774
Log:

VCL-727
Added additional looping to xCAT.pm::_rpower which makes additional attempts if 
a timeout error is encountered. Added ability to control number of rpower 
attempts via entries in the variable table: xcat|rpower_error_limit, 
xcat|timeout_error_limit

Modified:
    vcl/trunk/managementnode/lib/VCL/Module/Provisioning/xCAT.pm
    vcl/trunk/mysql/vcl.sql

Modified: vcl/trunk/managementnode/lib/VCL/Module/Provisioning/xCAT.pm
URL: 
http://svn.apache.org/viewvc/vcl/trunk/managementnode/lib/VCL/Module/Provisioning/xCAT.pm?rev=1524774&r1=1524773&r2=1524774&view=diff
                
> xCAT commands may timeout - make multiple attempts
> --------------------------------------------------
>
>                 Key: VCL-727
>                 URL: https://issues.apache.org/jira/browse/VCL-727
>             Project: VCL
>          Issue Type: Improvement
>          Components: vcld (backend)
>    Affects Versions: 2.3.2
>            Reporter: Andy Kurth
>            Assignee: Andy Kurth
>            Priority: Minor
>             Fix For: 2.3.3
>
>
> xCAT commands run from the management node may occasionally display "Error: 
> Timeout".  This usually occurs when multiple nodes are being loaded and 
> several xCAT commands are issued concurrently.
> Making additional attempts to run the command again usually results in the 
> command succeeding.  There are loops to detect errors and make multiple 
> attempts in the current xCAT code but sometimes these are not sufficient.  It 
> would be beneficial to differentiate timeout errors from other errors.  If a 
> timeout error occurs, additional attempts should be made.  The current loops 
> are sufficient for non-timeout errors.  If these occur multiple times in a 
> row it usually means there is a problem which won't be fixed by trying over 
> and over again.  If timeout errors are encountered, more leeway should be 
> given.
> This will help prevent some failed new and reload reservations.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to