[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-9509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15629781#comment-15629781
 ] 

ASF subversion and git services commented on CLOUDSTACK-9509:
-------------------------------------------------------------

Commit 32a397aa9357c409de7561a8c68a469c3bf3c52a in cloudstack's branch 
refs/heads/master from [~rohit.ya...@shapeblue.com]
[ https://git-wip-us.apache.org/repos/asf?p=cloudstack.git;h=32a397a ]

CLOUDSTACK-9509: Host Connects Without Storage

KVM hosts on shared storage failure was accepted by mgmt server with the
host state as Up, even though there was no primary/shared storage available on
it. This patch offers a quick fix by throwing an exception in the storage 
monitor
which connects storage pool on host. The failure is trapped by agent manager
that disconnects the agent without any investigation.

Based on Lab tests, KVM agent may take upto 2 minutes to attempt NFS mount when
the storage is inaccessible (firewalled, or shutdown) before returning back with
an error. It is safe to assume that this won't add pressure on mgmt server due 
to
several reconnection attempts, and KVM agent would retry reconnection every 2
minutes.

For such KVM hosts, where failure happens due to storage issues; they will be
briefly put in Alert state but will be mostly be in Connecting state during 
which
the KVM host attempts to mount/reconfigure NFS storage pool.

Signed-off-by: Rohit Yadav <rohit.ya...@shapeblue.com>


> KVM Hosts connect with no storage
> ---------------------------------
>
>                 Key: CLOUDSTACK-9509
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-9509
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the 
> default.) 
>            Reporter: Rohit Yadav
>            Assignee: Rohit Yadav
>            Priority: Critical
>             Fix For: 4.8.1, 4.10.0.0, 4.9.1.0
>
>
> KVM hosts on shared storage failure was accepted by mgmt server with the
> host state as Up, even though there was no primary/shared storage available 
> on it. They fail the ModifyStoragePoolCommand, but the management server 
> continues on with adding SSH keys and marking them as up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to