[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2016-02-09 Thread France (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15138908#comment-15138908
 ] 

France commented on CLOUDSTACK-3367:


:-/ 3 years after, the same issue persists. And it is not just this one.
This is why, we have given up on CS and are slowly migrating to Proxmox VE.

> When one primary storage fails, all XenServer hosts get rebooted, killing all 
> VMs, even those not on this primary storage.
> --
>
> Key: CLOUDSTACK-3367
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
> Project: CloudStack
>  Issue Type: Bug
>  Security Level: Public(Anyone can view this level - this is the 
> default.) 
>  Components: Management Server, XenServer
>Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
> Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
> 4.1.0
>Reporter: France
>Assignee: Abhinandan Prateek
> Fix For: Future
>
>
> As the title says: if only one of the primary storages fails, all XenServer 
> hosts get rebooted one by one. Because i have many primary storages, which 
> are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
> overkill. Please disable this or implement just stopping/killing the VMs 
> running on that storage and try to re-attach that storage only.
> Problem was reported on the mailing list, as well as a workaround for 
> XenServer. So i'm not the only one hit by this "bug/feature". Workaround for 
> now is as follows:
> 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
> out the two entries which have "reboot -f"
> 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
> 3. Restart the Script  - kill 
> 4. Force reconnect Host from the UI,  the script will then re-launch on 
> reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2016-02-08 Thread Frank Louwers (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15136794#comment-15136794
 ] 

Frank Louwers commented on CLOUDSTACK-3367:
---

Please note this is *not* Xenserver specific: KVM has the same issue!

> When one primary storage fails, all XenServer hosts get rebooted, killing all 
> VMs, even those not on this primary storage.
> --
>
> Key: CLOUDSTACK-3367
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
> Project: CloudStack
>  Issue Type: Bug
>  Security Level: Public(Anyone can view this level - this is the 
> default.) 
>  Components: Management Server, XenServer
>Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
> Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
> 4.1.0
>Reporter: France
>Assignee: Abhinandan Prateek
> Fix For: Future
>
>
> As the title says: if only one of the primary storages fails, all XenServer 
> hosts get rebooted one by one. Because i have many primary storages, which 
> are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
> overkill. Please disable this or implement just stopping/killing the VMs 
> running on that storage and try to re-attach that storage only.
> Problem was reported on the mailing list, as well as a workaround for 
> XenServer. So i'm not the only one hit by this "bug/feature". Workaround for 
> now is as follows:
> 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
> out the two entries which have "reboot -f"
> 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
> 3. Restart the Script  - kill 
> 4. Force reconnect Host from the UI,  the script will then re-launch on 
> reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2016-02-08 Thread Paul Angus (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15138445#comment-15138445
 ] 

Paul Angus commented on CLOUDSTACK-3367:


OK. But the current solution is actually worse than the original problem.

> When one primary storage fails, all XenServer hosts get rebooted, killing all 
> VMs, even those not on this primary storage.
> --
>
> Key: CLOUDSTACK-3367
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
> Project: CloudStack
>  Issue Type: Bug
>  Security Level: Public(Anyone can view this level - this is the 
> default.) 
>  Components: Management Server, XenServer
>Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
> Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
> 4.1.0
>Reporter: France
>Assignee: Abhinandan Prateek
> Fix For: Future
>
>
> As the title says: if only one of the primary storages fails, all XenServer 
> hosts get rebooted one by one. Because i have many primary storages, which 
> are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
> overkill. Please disable this or implement just stopping/killing the VMs 
> running on that storage and try to re-attach that storage only.
> Problem was reported on the mailing list, as well as a workaround for 
> XenServer. So i'm not the only one hit by this "bug/feature". Workaround for 
> now is as follows:
> 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
> out the two entries which have "reboot -f"
> 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
> 3. Restart the Script  - kill 
> 4. Force reconnect Host from the UI,  the script will then re-launch on 
> reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2016-02-08 Thread Paul Angus (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15137723#comment-15137723
 ] 

Paul Angus commented on CLOUDSTACK-3367:


[~abhi_shapeblue] The behaviour of heartbeat.sh reboots the host if it detects 
ANY of the storage pools have gone offline.
it's too late to protect/shutdown the VMs on that storage as it's already 
unavailable.

As the current behaviour causes the host to restart, any VMs running on 'still 
good' storage are shut down / reset.  - This is the problem.



 

> When one primary storage fails, all XenServer hosts get rebooted, killing all 
> VMs, even those not on this primary storage.
> --
>
> Key: CLOUDSTACK-3367
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
> Project: CloudStack
>  Issue Type: Bug
>  Security Level: Public(Anyone can view this level - this is the 
> default.) 
>  Components: Management Server, XenServer
>Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
> Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
> 4.1.0
>Reporter: France
>Assignee: Abhinandan Prateek
> Fix For: Future
>
>
> As the title says: if only one of the primary storages fails, all XenServer 
> hosts get rebooted one by one. Because i have many primary storages, which 
> are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
> overkill. Please disable this or implement just stopping/killing the VMs 
> running on that storage and try to re-attach that storage only.
> Problem was reported on the mailing list, as well as a workaround for 
> XenServer. So i'm not the only one hit by this "bug/feature". Workaround for 
> now is as follows:
> 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
> out the two entries which have "reboot -f"
> 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
> 3. Restart the Script  - kill 
> 4. Force reconnect Host from the UI,  the script will then re-launch on 
> reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2016-02-07 Thread Paul Angus (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15136411#comment-15136411
 ] 

Paul Angus commented on CLOUDSTACK-3367:


[~aprateek] is it feasible to add a global setting which controls this 
behaviour?
While the current behaviour can be useful in some circumstances there are quite 
a few where it is highly undesirable
It would be good if the cloud operators could have control over it.

> When one primary storage fails, all XenServer hosts get rebooted, killing all 
> VMs, even those not on this primary storage.
> --
>
> Key: CLOUDSTACK-3367
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
> Project: CloudStack
>  Issue Type: Bug
>  Security Level: Public(Anyone can view this level - this is the 
> default.) 
>  Components: Management Server, XenServer
>Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
> Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
> 4.1.0
>Reporter: France
>Assignee: Abhinandan Prateek
> Fix For: Future
>
>
> As the title says: if only one of the primary storages fails, all XenServer 
> hosts get rebooted one by one. Because i have many primary storages, which 
> are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
> overkill. Please disable this or implement just stopping/killing the VMs 
> running on that storage and try to re-attach that storage only.
> Problem was reported on the mailing list, as well as a workaround for 
> XenServer. So i'm not the only one hit by this "bug/feature". Workaround for 
> now is as follows:
> 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
> out the two entries which have "reboot -f"
> 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
> 3. Restart the Script  - kill 
> 4. Force reconnect Host from the UI,  the script will then re-launch on 
> reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2016-02-07 Thread Abhinandan Prateek (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15136564#comment-15136564
 ] 

Abhinandan Prateek commented on CLOUDSTACK-3367:


[~paulangus] the reboot behaviour is to save VMs have disk on the down primary 
storage. With later releases of Xen it is possible to shutdown those only the 
VMs that have disk on bad PS. It makes sense to just implement this feature now.
This will require the setting to be manipulatable on each host and will not be 
as simple to control from MS.

> When one primary storage fails, all XenServer hosts get rebooted, killing all 
> VMs, even those not on this primary storage.
> --
>
> Key: CLOUDSTACK-3367
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
> Project: CloudStack
>  Issue Type: Bug
>  Security Level: Public(Anyone can view this level - this is the 
> default.) 
>  Components: Management Server, XenServer
>Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
> Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
> 4.1.0
>Reporter: France
>Assignee: Abhinandan Prateek
> Fix For: Future
>
>
> As the title says: if only one of the primary storages fails, all XenServer 
> hosts get rebooted one by one. Because i have many primary storages, which 
> are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
> overkill. Please disable this or implement just stopping/killing the VMs 
> running on that storage and try to re-attach that storage only.
> Problem was reported on the mailing list, as well as a workaround for 
> XenServer. So i'm not the only one hit by this "bug/feature". Workaround for 
> now is as follows:
> 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
> out the two entries which have "reboot -f"
> 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
> 3. Restart the Script  - kill 
> 4. Force reconnect Host from the UI,  the script will then re-launch on 
> reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2015-07-14 Thread Keerthiraja (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14626340#comment-14626340
 ] 

Keerthiraja commented on CLOUDSTACK-3367:
-

Will this be taken care on current 4.5.2 release 

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
Assignee: Abhinandan Prateek
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2014-12-03 Thread France (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14232986#comment-14232986
 ] 

France commented on CLOUDSTACK-3367:


Soon will be in third year of this critical bug reported...
...still no one cares, if failure on one of primary storages, which happens to 
be non redundant, hard reboots WHOLE cloud.
Or is this not the case anymore with new releases ad it has been fixed?

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2014-12-03 Thread John Kinsella (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14233185#comment-14233185
 ] 

John Kinsella commented on CLOUDSTACK-3367:
---

Folks - you'll have better success by finding one or more developers who are 
familiar with the XenServer integration code and asking for their help than 
just griping in a jira ticket that isn't assigned to anybody.

Search the dev list or commits for folks who have worked on the appropriate 
code in the past, or just start a thread on dev@ with a subject to attract the 
appropriate folks.

Griping, while it might be justified, doesn't tend to gain favor in open source 
projects where people are volunteering their time.

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2014-09-24 Thread France (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14146276#comment-14146276
 ] 

France commented on CLOUDSTACK-3367:


Anyone willing to pick this up?
It has been well over a year by now. :-(

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0, 4.5.0, 4.3.1
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2014-03-05 Thread France (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13920762#comment-13920762
 ] 

France commented on CLOUDSTACK-3367:


Just an idea for whomever picks this issue up (if anyone at all :( ).
Before killing the whole hypervisor host, maybe live migrate instances who's 
private storage is still functioning to another hypervisor.

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2014-03-03 Thread France (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13917844#comment-13917844
 ] 

France commented on CLOUDSTACK-3367:


LOL, we're rolling backwards on the issue. We just lost assignee. :-)

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2014-03-03 Thread Koushik Das (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13917862#comment-13917862
 ] 

Koushik Das commented on CLOUDSTACK-3367:
-

Busy with some other stuff, unassigned so that it can be picked up by anyone 
else willing to work on it. Initially had planned to fix it as part of 
CLOUDSTACK-5203 ( 
https://cwiki.apache.org/confluence/display/CLOUDSTACK/User+VM+HA+using+native+XS+HA+capabilities)
 but that proposal was dropped due to regressions 
(http://mail-archives.apache.org/mod_mbox/cloudstack-dev/201311.mbox/%3c20cf38cb4385ce4d9d1558d52a0fc05818f...@sjcpex01cl03.citrite.net%3E).

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2013-09-13 Thread Koushik Das (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13766393#comment-13766393
 ] 

Koushik Das commented on CLOUDSTACK-3367:
-

In XS 6.2 I simulated SR failure by disconnecting it. After that tried shutting 
down the VM, the operation succeeded and the VMs power state was made 'halted'. 
It even allowed destroying the 'halted' VM.

So looks like in XS 6.2 selective cleanup of VMs can be done instead of 
rebooting host. But in older versions of XS there is no way to do selective 
cleanup.

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2013-08-05 Thread Jiri Kanicky (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13729501#comment-13729501
 ] 

Jiri Kanicky commented on CLOUDSTACK-3367:
--

I agree with Alex in regards to XenServers. Disabling the reboot might cause 
problems on the XenServer, because any failure of storage will leave staled 
mounts on the host.

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2013-07-26 Thread Abhinandan Prateek (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13720725#comment-13720725
 ] 

Abhinandan Prateek commented on CLOUDSTACK-3367:


The heartbeat script should only be modified after we fix the issue of stopping 
that VMs that are running on that storage. Without a reboot these VMs will 
become inconsistent. I think it should be classified as an improvement.

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France

 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2013-07-26 Thread France (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13720748#comment-13720748
 ] 

France commented on CLOUDSTACK-3367:


I agree with your path to fix, but i disagree that not killing VMs, which have 
no issues, is an improvement or a new feature.
If you kill/destroy/stop something that's working normally and you should not 
have, it's definitely a bug. A mayor bug. :-)

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Improvement
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (CLOUDSTACK-3367) When one primary storage fails, all XenServer hosts get rebooted, killing all VMs, even those not on this primary storage.

2013-07-26 Thread Alex Huang (JIRA)

[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13721491#comment-13721491
 ] 

Alex Huang commented on CLOUDSTACK-3367:


Our experience in testing this with the 5.6 version of XenServer is that if we 
attempt to stop the VMs with XenServer while the storage is out, that XenServer 
may not shut them down cleanly due to storage problems, leading to further 
problems down the road.  It's the reason why we chose to reboot instead of stop 
VMs.  

You also have to consider how often this happens.  If a storage server needs to 
be taken out, the storage server should be put in maintenance mode which 
shutdown the vms.  In that case, then it won't cause host to reboot.  
Therefore, this can only happen with an unscheduled outage of the storage 
server.

We can add a few things to make this happen less often.

- Don't put a heartbeat on the storage until a VM using that storage is on a 
host.
- Remove the heartbeat on the storage when all VMs using that storage is done.
- Try to stop the VMs within a short interval and if by that interval we can't 
stop the VMs, then reboot.

 When one primary storage fails, all XenServer hosts get rebooted, killing all 
 VMs, even those not on this primary storage.
 --

 Key: CLOUDSTACK-3367
 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-3367
 Project: CloudStack
  Issue Type: Bug
  Security Level: Public(Anyone can view this level - this is the 
 default.) 
  Components: Management Server, XenServer
Affects Versions: 4.1.0, 4.2.0
 Environment: CentOS 6.3, XenServer 6.0.2 + all hotfixes, CloudStack 
 4.1.0
Reporter: France
Priority: Critical
 Fix For: Future


 As the title says: if only one of the primary storages fails, all XenServer 
 hosts get rebooted one by one. Because i have many primary storages, which 
 are/were running fine with other VMs, rebooting XenServer Hipervisor is an 
 overkill. Please disable this or implement just stopping/killing the VMs 
 running on that storage and try to re-attach that storage only.
 Problem was reported on the mailing list, as well as a workaround for 
 XenServer. So i'm not the only one hit by this bug/feature. Workaround for 
 now is as follows:
 1. Modify /opt/xensource/bin/xenheartbeat.sh on all your Hosts, commenting 
 out the two entries which have reboot -f
 2. Identify the PID of the script  - pidof -x xenheartbeat.sh
 3. Restart the Script  - kill pid
 4. Force reconnect Host from the UI,  the script will then re-launch on 
 reconnect

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira