Re: [Qemu-devel] [RFC PATCH 0/4] Fix subsection ambiguity in the migration format

Anthony Liguori Tue, 26 Jul 2011 14:46:22 -0700

On 07/26/2011 03:13 PM, Juan Quintela wrote:

Anthony Liguori<anth...@codemonkey.ws>  wrote:

On 07/26/2011 07:07 AM, Juan Quintela wrote:

- Be able to describe that different features/versions.  This is not the
    difficult part, it can be subsections, optional fields, whatever.
    What is the difficult part is _knowing_ what fields needs to be on
    each version.  That again depends of the device, not migration.

- Be able to to do forward/bacward compatibility (and without
    comunication both sides is basically impossible).


Hrm, I'm not sure I agree with these conclusions.

Management tools should do their best job to create two compatible
device models.


How?  only part that can have enough information is the "new" part
(either source of destination).  And we are being very careful about not
allowing any comunication/setting of what is in the other side.


I'll explain below.

- Send things on the wire (really this is the easy part, we can play
    with it touching only migration functions.).

We also need a way to future proof ourselves.


We have been very bad at this.  Automatic checking is the only way that
I can think of.


I don't know what you mean by automatic checking.


We should have unit test to see that (at least) the obvious migration work.

Oh, 100% agree. In fact, I've posted patches :) But I wasn't happywith the level of completeness of those tests and want to write bettertests which is part of my motivation in visiting this topic.

We have two things here.  Device level&   protocol level.

Device level: very late to set anything.
Protocol level: we can set things here, but notice that only a few
things cane be set here.


Once we have a protocol level feature bit, we can add device level
feature bits as a new feature.


This don't help  migration time is very late to configure a device.  We
need to configure it at creation time.  It makes no sense to try to
migrate device foo with 4 bar's and at migration time try to "push" it
into only 2 bars.  Having it created with 2 bars in the 1st place is the
only sane solution.

I misunderstood what you were suggesting. For guest visible devicefeatures, they must be configured at creation time. I'm in full agreement.

It's 100% mechanical and makes absolutely no logic change.  It works
equally well with legacy and VMstate migration handlers.

3) Add a Visitor class that operates on QEMUFile.

At this state, we can migrate to data structures.  That means we can
migrate to QEMUFile, QObjects, or JSON.  We could change the protocol
at this stage to something that was still binary but had section sizes
and things of that nature.


That was the whole point of vmstate.


The problem with vmstate is that it's an all or nothing thing and the
conversion isn't programmatic.


This is the whole point.  We are being declarative, and we create a
mecanism about how to visit all nodes.  What we do in each node is not
VMState business.  VMState only defines the nodes, and which ones belong
to each version.

Right. Thinking more after the call, I think this may be a better wayto explain what I'm proposing.

With VMState, we provide a declarative description of each devicesstate. Because it's declarative, some things end up being tough todescribe like variable sized arrays and complex data structures. You'veworked through a lot of these, but this is fundamentally what makes thisapproach difficult to complete.

At the end of VMState conversion, we have a declaration of how to readthe current state of the device tree. We can write a function thattakes all of the VMState descriptions and builds something from thosedescriptions.

But right now, what we actually have is a routine that takes a VMStatedata description, and then calls a marshalling function. In essence,the data description gets interpreted to an imperative serializationmechanism.

I'm suggesting that instead of trying to eliminate the imperativeness(which will be hard since we have a lot of hooks in various places), weshould embrace the imperativeness. Instead of marshalling to aQEMUFile, we marshal to a Visitor, Visitor being an abstract that canmarshal to arbitrary formats/objects.

So we never actually walk the VMState tables to do anything. Theunconverted purely imperative routines we just convert to use marshal toa Visitor instead of QEMUFile.

What this gives us is a way to achieve the same level of abstractionthat VMState would give us for almost no work at all. Thatfundamentally let's us take the next step in "fixing" migration.

device with some features ->   migration ->   device with other features

and it works.  This means that "migration" does magic, and this is never
going to work.

Until now, this kind of worked because we only supported migration from
old ->   new, or the same version.  Migration from old ->   new can never
have new features.  But from new ->   old to work, we need a way to
disable the new features.   That is completely independent of migration.


At startup time, not dynamically.  And we have this, that's what -M
pc-X
is about.


It don't work.


Here's how I think we can fix this.

We have two concepts today, the machine and devices. Not all devicescan be created by an end user as some are implied by the machine (thisis qdev.no_user). Since not everything is directly created by the user,there is no easy way to basically do a dump of the device model, thenfeed that back into QEMU for recreation.

We do compatibility by using global properties for the differentmachines but this is a tough proposition to get right as the granularityis pretty poor. I can't change a property of a particular devicecreated by the machine without changing it universally.

With an improved qdev (which I think is QOM, but for now, just ignorethat), we would be able to do the following:

1) create a device, that creates other devices as children of itself*without* those children being under a bus hierarchy.

2) eliminate the notion of machines altogether, and instead replacemachines with a chipset, soc device, or whatever is the logic devicethat basically equates to what the machine logic does today.

The pc machine code is basically the i440fx. You could take everythingthat it does, call it an i440fx object, and make "machine" propertiesproperties of the i440fx. That makes what we think of as machinecreation identical to device creation.

3) eliminate anonymous devices, implicit bus assignment, and all of theother features of qdev that prevent the device model from beingdescribed in a stable fashion.

'-device-add e1000,id=foo' is ambiguous as is '-net nic,model=virtio-net nic,model=virtio'.

The rules about how we find the bus and location of e1000 in the devicemodel today are arbitrary and difficult to introspect. The result isthat what you use to create a device model becomes wildly different thanwhat you would use to recreate a device model. There's really no way toprogrammaticaly discover this today either. qdev doesn't return theproperties value at construction time but rather the current value.That's not necessarily the value you want to use to recreate the device.

That's not to say that we shouldn't have friendly interfaces that doautomatic PCI assignment bus assignment. But that has to live a levelhigher up than where it lives today in order to create stable device trees.


The rules in QOM are meant to solve these problems.  They basically are:

a) All devices must have a unique name at the time of creation makingstable device addressing guaranteed.

b) All relationships between devices are expressed as connectionsbetween plugs and sockets. There are no exceptions here. Theimplication is that you never need to use code to recreate a devicemodel, you can always dump the device model and recreate it via QMPcommands.

c) All device properties are settable after creation time. This mightnot seem like a big deal, but in order to support composition, the actof instantiating a device such as the PIIX which creates more deviceslike a UART requires that you set the UARTs construction propertiesafter creation of the PIIX. Without having an explicit "realize" statewhere construction properties have been set, this problem is incrediblydifficult to solve.

qdev cannot satisfy these requirements as it sits today. Maybe there'sa way to incrementally evolve qdev into QOM, I haven't really thought itthrough yet.

But the end goal is pretty clear. We should be able to do (qemu)dump_device_model > foo.cfg in the source and then (qemu)import_device_model foo.cfg in the destination right before the finalstage of migration. And this should be part of the migration protocol.

This would make migration with hotplug work, along with scores of otherthings. This is another reason why a unified model makes sense, just asyou want to dump the device tree, you want to be able to enumerate thebackends to make sure that identically named backends exist on thedestination. Doing that in a single operation is a lot easier thandoing it 10 different ways.


Regards,

Anthony Liguori

Re: [Qemu-devel] [RFC PATCH 0/4] Fix subsection ambiguity in the migration format

Reply via email to