Re: [okeanos-dev] Re: KVM versions, machine types and failed migrations

2013-01-10 Thread Daniel P. Berrange
On Wed, Jan 09, 2013 at 03:27:53PM +0200, Vangelis Koukis wrote:
 On Wed, Jan 09, 2013 at 01:10:45pm +, Daniel P. Berrange wrote:
  When doing migration, the fundamental requirement is that the guest
  OS visible machine ABI must not change. Thus there are three key
  things to take care of when launching QEMU on the migration target
  host.
  
   - The device PCI/USB addresses must be identical to the source
   - The machine type must be identical to the source
   - The CPU model must be identical to the source
  
 
 Thanks for the detailed list of requirements, we'll take it into account
 for the relevant Ganeti patch.
 
  If you don't follow those requirements, either QEMU or the guest OS
  or both will crash  burn during migration  you get to keep both
  pieces :-)
  
 
 My point is, are these requirements left up to the caller of kvm
 -incoming to satisfy? Since the migration will most probably break,
 wouldn't it be best for QEMU to detect this and complain loudly, instead
 of continuing with the migration, failing silently and destroying the
 VM?
 
 Sure there could be some yes, do it, I know it is going to break
 option, which will make QEMU proceed with the migration. However, in 99%
 of the cases this is just user error, e.g. the user has upgraded the
 version on the other end and has not specified -M explicitly. It would
 be best if QEMU was able to detect and warn the user about what is going
 to happen, because it does lead to the VM dying.

What you describe is certainly desirable, but it is quite hard to achieve
with current QEMU. Much of the work with moving to the new QEMU object
model  configuration descriptions has been motivated by a desire to
enable improvements migration handling. As you suggest, the goal is that
the source QEMU be able to send a complete  reliable hardware description
to the destination QEMU during migration.It is getting closer, but we're
not there yet.

Regards,
Daniel
-- 
|: http://berrange.com  -o-http://www.flickr.com/photos/dberrange/ :|
|: http://libvirt.org  -o- http://virt-manager.org :|
|: http://autobuild.org   -o- http://search.cpan.org/~danberr/ :|
|: http://entangle-photo.org   -o-   http://live.gnome.org/gtk-vnc :|
--
To unsubscribe from this list: send the line unsubscribe kvm in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


KVM versions, machine types and failed migrations

2013-01-09 Thread Vangelis Koukis
Hello,

I'd like to ask a few questions about the way migrations work in KVM
among different emulated machine types and different versions of the
qemu-kvm package. I am sending to both the kvm@ and qemu-devel@ lists,
please redirect me if I was wrong in doing so.

In a nutshell: while trying to live-migrate a VM on ~okeanos [1], we
see VM migrations fail silently if going from kvm 1.0 to kvm 1.1.
The source VM is frozen, info migrate on the source monitor reports
success, but the VM is dead upon arrival on the destination process.
Please see [3] for the exact package versions for qemu-kvm we have
tested with.

Migration works if the destination kvm has been started with the same
machine type as the source VM, e.g., using -M pc-1.0 specifically on
the destination, when migrating a pc-1.0 machine from kvm 1.0 to
kvm 1.1.

How does the machine type specified with -M work in the case of
migrations? Are migrations expected to fail if the machine type is
different between source and destination process? If yes, shouldn't KVM be
able to detect this and abort the migration instead of failing silently?

For every (src, dst) pair of package version / machine type,
we saw the following:

 dst:   1.0/pc-0.12  1.0/pc-1.0  1.1/pc-1.1
src:
1.0/pc-0.12 ok   ok  fails silently
1.0/pc-1.0  fails silently   ok  fails silently
1.1/pc-1.1  fails silently   fails silently  ok

Machine types pc-0.12 and pc-1.0 were run with qemu-kvm package version
1.0, machine type pc-1.1 with qemu-kvm package version 1.1.

Also, the migration 1.0/pc-0.12 to 1.0/pc-1.0 seems to work... How can
it work or even be allowed, given that the guest finds itself running on
a different hardware configuration after the migration?

Regarding different package versions of qemu-kvm, it seems migrations do
not work from source 0.12.5 to any other version *even* if -M pc-0.12 is
specified at the incoming KVM process. For versions = 1.0 everything
works provided the machine type on the destination is the same as on the
source.

Our goal is to patch Ganeti [2] so that it sets the destination machine
type to that of the source specifically, ensuring migrations work
seamlessly after a KVM upgrade. Is there a way to retrieve the machine
type of a running KVM process through a monitor command?

Thank you,
Vangelis.

[1] ~okeanos IaaS: http://okeanos.io
[2] Ganeti: https://code.google.com/p/ganeti/
[3] We tested on Debian Squeeze with package versions:

kvm 0.12: qemu-kvm_0.12.5+dfsg-5+squeeze8_amd64.deb
kvm 1.0: qemu-kvm_1.0+dfsg-8~bpo60+1_amd64.deb
kvm 1.1: qemu-kvm_1.1.2+dfsg-2~bpo60+1_amd64.deb

-- 
Vangelis Koukis
vkou...@grnet.gr
OpenPGP public key ID:
pub  1024D/1D038E97 2003-07-13 Vangelis Koukis vkou...@cslab.ece.ntua.gr
 Key fingerprint = C5CD E02E 2C78 7C10 8A00  53D8 FBFC 3799 1D03 8E97

Only those who will risk going too far
can possibly find out how far one can go.
-- T.S. Eliot


signature.asc
Description: Digital signature


Re: KVM versions, machine types and failed migrations

2013-01-09 Thread Daniel P. Berrange
On Wed, Jan 09, 2013 at 02:23:50PM +0200, Vangelis Koukis wrote:
 Hello,
 
 I'd like to ask a few questions about the way migrations work in KVM
 among different emulated machine types and different versions of the
 qemu-kvm package. I am sending to both the kvm@ and qemu-devel@ lists,
 please redirect me if I was wrong in doing so.
 
 In a nutshell: while trying to live-migrate a VM on ~okeanos [1], we
 see VM migrations fail silently if going from kvm 1.0 to kvm 1.1.
 The source VM is frozen, info migrate on the source monitor reports
 success, but the VM is dead upon arrival on the destination process.
 Please see [3] for the exact package versions for qemu-kvm we have
 tested with.
 
 Migration works if the destination kvm has been started with the same
 machine type as the source VM, e.g., using -M pc-1.0 specifically on
 the destination, when migrating a pc-1.0 machine from kvm 1.0 to
 kvm 1.1.
 
 How does the machine type specified with -M work in the case of
 migrations? Are migrations expected to fail if the machine type is
 different between source and destination process? If yes, shouldn't KVM be
 able to detect this and abort the migration instead of failing silently?

When doing migration, the fundamental requirement is that the guest
OS visible machine ABI must not change. Thus there are three key
things to take care of when launching QEMU on the migration target
host.

 - The device PCI/USB addresses must be identical to the source
 - The machine type must be identical to the source
 - The CPU model must be identical to the source

If you don't follow those requirements, either QEMU or the guest OS
or both will crash  burn during migration  you get to keep both
pieces :-)

 Regarding different package versions of qemu-kvm, it seems migrations do
 not work from source 0.12.5 to any other version *even* if -M pc-0.12 is
 specified at the incoming KVM process. For versions = 1.0 everything
 works provided the machine type on the destination is the same as on the
 source.

Some older versions of QEMU were buggy causing the machine type to
not correctly preserve ABI.

 Our goal is to patch Ganeti [2] so that it sets the destination machine
 type to that of the source specifically, ensuring migrations work
 seamlessly after a KVM upgrade. Is there a way to retrieve the machine
 type of a running KVM process through a monitor command?

IIRC there is not a monitor command for this. The general approach
to dealing with migration stability should be to launch QEMU with a
canonical hardware configuration. This means explicitly setting a machine
type, CPU model and PCI/USB devices addresses upfront. NB you should not
use 'pc' as a machine type - if you query the list of machine types from
QEMU, it will tell you what 'pc' corresponds to (pc-1.2) and then use the
versioned type so you have a known machine type.

Regards,
Daniel
-- 
|: http://berrange.com  -o-http://www.flickr.com/photos/dberrange/ :|
|: http://libvirt.org  -o- http://virt-manager.org :|
|: http://autobuild.org   -o- http://search.cpan.org/~danberr/ :|
|: http://entangle-photo.org   -o-   http://live.gnome.org/gtk-vnc :|
--
To unsubscribe from this list: send the line unsubscribe kvm in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [okeanos-dev] Re: KVM versions, machine types and failed migrations

2013-01-09 Thread Vangelis Koukis
On Wed, Jan 09, 2013 at 01:10:45pm +, Daniel P. Berrange wrote:
 When doing migration, the fundamental requirement is that the guest
 OS visible machine ABI must not change. Thus there are three key
 things to take care of when launching QEMU on the migration target
 host.
 
  - The device PCI/USB addresses must be identical to the source
  - The machine type must be identical to the source
  - The CPU model must be identical to the source
 

Thanks for the detailed list of requirements, we'll take it into account
for the relevant Ganeti patch.

 If you don't follow those requirements, either QEMU or the guest OS
 or both will crash  burn during migration  you get to keep both
 pieces :-)
 

My point is, are these requirements left up to the caller of kvm
-incoming to satisfy? Since the migration will most probably break,
wouldn't it be best for QEMU to detect this and complain loudly, instead
of continuing with the migration, failing silently and destroying the
VM?

Sure there could be some yes, do it, I know it is going to break
option, which will make QEMU proceed with the migration. However, in 99%
of the cases this is just user error, e.g. the user has upgraded the
version on the other end and has not specified -M explicitly. It would
be best if QEMU was able to detect and warn the user about what is going
to happen, because it does lead to the VM dying.

  Regarding different package versions of qemu-kvm, it seems migrations do
  not work from source 0.12.5 to any other version *even* if -M pc-0.12 is
  specified at the incoming KVM process. For versions = 1.0 everything
  works provided the machine type on the destination is the same as on the
  source.
 
 Some older versions of QEMU were buggy causing the machine type to
 not correctly preserve ABI.
 
  Our goal is to patch Ganeti [2] so that it sets the destination machine
  type to that of the source specifically, ensuring migrations work
  seamlessly after a KVM upgrade. Is there a way to retrieve the machine
  type of a running KVM process through a monitor command?
 
 IIRC there is not a monitor command for this.

 The general approach
 to dealing with migration stability should be to launch QEMU with a
 canonical hardware configuration. This means explicitly setting a machine
 type, CPU model and PCI/USB devices addresses upfront. NB you should not
 use 'pc' as a machine type - if you query the list of machine types from
 QEMU, it will tell you what 'pc' corresponds to (pc-1.2) and then use the
 versioned type so you have a known machine type.
 

This is exactly what we're trying to do: specify -M explicitly in the
kvm command line, instead of letting the default pc machine type
change arbitrarily whenever the qemu-kvm package gets upgraded.

Thanks again,
Vangelis.

-- 
Vangelis Koukis
vkou...@grnet.gr
OpenPGP public key ID:
pub  1024D/1D038E97 2003-07-13 Vangelis Koukis vkou...@cslab.ece.ntua.gr
 Key fingerprint = C5CD E02E 2C78 7C10 8A00  53D8 FBFC 3799 1D03 8E97

Only those who will risk going too far
can possibly find out how far one can go.
-- T.S. Eliot


signature.asc
Description: Digital signature