Do you have power management configured?
Was the "failed" host fenced/rebooted?


On Fri, Apr 4, 2014 at 2:21 PM, Koen Vanoppen <[email protected]>wrote:

> So... It is possible for a fully automatic migration of the VM to another
> hypervisor in case Storage connection fails?
> How can we make this happen? Because for the moment, when we tested the
> situation they stayed in pause state.
> (Test situation:
>
>    - Unplug the 2 fibre cables from the hypervisor
>    - VM's go in pause state
>    - VM's stayed in pause state until the failure was solved
>
> )
>
>
> They only returned when we restored the fiber connection to the
> Hypervisor...
>
> Kind Regards,
>
> Koen
>
>
>
> 2014-04-04 13:52 GMT+02:00 Koen Vanoppen <[email protected]>:
>
>> So... It is possible for a fully automatic migration of the VM to another
>> hypervisor in case Storage connection fails?
>> How can we make this happen? Because for the moment, when we tested the
>> situation they stayed in pause state.
>> (Test situation:
>>
>>    - Unplug the 2 fibre cables from the hypervisor
>>    - VM's go in pause state
>>    - VM's stayed in pause state until the failure was solved
>>
>> )
>>
>>
>> They only returned when we restored the fiber connection to the
>> Hypervisor...
>>
>> Kind Regards,
>>
>> Koen
>>
>>
>> 2014-04-03 16:53 GMT+02:00 Koen Vanoppen <[email protected]>:
>>
>> ---------- Forwarded message ----------
>>> From: "Doron Fediuck" <[email protected]>
>>> Date: Apr 3, 2014 4:51 PM
>>> Subject: Re: [Users] HA
>>> To: "Koen Vanoppen" <[email protected]>
>>> Cc: "Omer Frenkel" <[email protected]>, <[email protected]>, "Federico
>>> Simoncelli" <[email protected]>, "Allon Mureinik" <[email protected]
>>> >
>>>
>>>
>>>
>>> ----- Original Message -----
>>> > From: "Koen Vanoppen" <[email protected]>
>>> > To: "Omer Frenkel" <[email protected]>, [email protected]
>>> > Sent: Wednesday, April 2, 2014 4:17:36 PM
>>> > Subject: Re: [Users] HA
>>> >
>>> > Yes, indeed. I meant not-operational. Sorry.
>>> > So, if I understand this correctly. When we ever come in a situation
>>> that we
>>> > loose both storage connections on our hypervisor, we will have to
>>> manually
>>> > restore the connections first?
>>> >
>>> > And thanx for the tip for speeding up thins :-).
>>> >
>>> > Kind regards,
>>> >
>>> > Koen
>>> >
>>> >
>>> > 2014-04-02 15:14 GMT+02:00 Omer Frenkel < [email protected] > :
>>> >
>>> >
>>> >
>>> >
>>> >
>>> > ----- Original Message -----
>>> > > From: "Koen Vanoppen" < [email protected] >
>>> > > To: [email protected]
>>> > > Sent: Wednesday, April 2, 2014 4:07:19 PM
>>> > > Subject: [Users] HA
>>> > >
>>> > > Dear All,
>>> > >
>>> > > Due our acceptance testing, we discovered something. (Document will
>>> > > follow).
>>> > > When we disable one fiber path, no problem multipath finds it way no
>>> pings
>>> > > are lost.
>>> > > BUT when we disabled both the fiber paths (so one of the storage
>>> domain is
>>> > > gone on this host, but still available on the other host), vms go in
>>> paused
>>> > > mode... He chooses a new SPM (can we speed this up?), put's the host
>>> in
>>> > > non-responsive (can we speed this up, more important) and the VM's
>>> stay on
>>> > > Paused mode... I would expect that they would be migrated (yes, HA is
>>> >
>>> > i guess you mean the host moves to not-operational (in contrast to
>>> > non-responsive)?
>>> > if so, the engine will not migrate vms that are paused to do io error,
>>> > because of data corruption risk.
>>> >
>>> > to speed up you can look at the storage domain monitoring timeout:
>>> > engine-config --get StorageDomainFalureTimeoutInMinutes
>>> >
>>> >
>>> > > enabled) to the other host and reboot there... Any solution? We are
>>> still
>>> > > using oVirt 3.3.1 , but we are planning a upgrade to 3.4 after the
>>> easter
>>> > > holiday.
>>> > >
>>> > > Kind Regards,
>>> > >
>>> > > Koen
>>> > >
>>>
>>> Hi Koen,
>>> Resuming from paused due to io issues is supported (adding relevant
>>> folks).
>>> Regardless, if you did not define power management, you should manually
>>> approve
>>> source host was rebooted in order for migration to proceed. Otherwise we
>>> risk
>>> split-brain scenario.
>>>
>>> Doron
>>>
>>
>>
>
> _______________________________________________
> Users mailing list
> [email protected]
> http://lists.ovirt.org/mailman/listinfo/users
>
>
_______________________________________________
Users mailing list
[email protected]
http://lists.ovirt.org/mailman/listinfo/users

Reply via email to