Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

Anthony Liguori Sun, 27 Feb 2011 06:00:27 -0800

On 02/27/2011 03:10 AM, Avi Kivity wrote:

On 02/24/2011 07:58 PM, Anthony Liguori wrote:
If you move the cdrom to a different IDE channel, you have to updatethe stateful non-config file.
Whereas if you do

   $ qemu-img create -f cd-tray -b ~/foo.img ~/foo-media-tray.img
   $ qemu -cdrom ~/foo-media-tray.img

the cd-rom tray state will be tracked in the image file.
Yeah, but how do you move it?
There is no need to move the file at all. Simply point the new driveat the media tray.

No, I was asking, how do you move the cdrom to a different IDE channel.Are you using QMP? Are you changing the command line arguments?

If you do a remove/add through QMP, then the config file will reflectthings just fine.
If all access to the state file is through QMP then it becomes morepalatable. A bit on that later.

As I think I've mentioned before, I hadn't really thought about anopaque state file but I'm not necessary opposed to it. I don't see anobvious advantage to making it opaque but I agree it should beaccessible via QMP.

If you want to do it outside of QEMU, then you can just ignore theconfig file and manage all of the state yourself. But it's nevergoing to work as well (it will be racy) and you're pushing atremendous amount of knowledge that ultimately belongs in QEMU (whatstate needs to persist) to something that isn't QEMU which means it'sprobably not going to be done correctly.
I know you're a big fan of the omnipotent management tool but myexperience has been that we need to help the management tooling folksmore by expecting less from them.
I thought that's what I'm doing by separating the state out. It'seasy for management to assemble configuration from their database andconvert it into a centralized representation (like a qemu commandline). It's a lot harder to disassemble a central staterepresentation and move it back to the database.
Using QMP is better than directly accessing the state file since qemudoes the disassembly for you (provided the command references thedevice using its normal path, not some random key). The file justbecomes a way to survive a crash, and all management needs to knowabout is to make it available and back it up. But it means thateverything must be done via QMP, including assembly of the machine,otherwise the state file can become stale.
Separating the state out to the device is even easier, sincemanagement is already expected to take care of disk images. Allthat's needed is to create the media tray image once, then you canforget about it completely.

Except that instead of having one state file, we might have a dozenadditional "device state" files.

Again the question is who is the authoritative source of theconfiguration. Is it the management tool or is it qemu?
QEMU. No question about it. At any point in time, we are theauthoritative source of what the guest's configuration is. There'sno doubt about it. A management tool can try to keep up with us, butultimately we are the only ones that know for sure.
We have all of this information internally. Just persisting it isnot a major architectural change. It's something we should have beendoing (arguably) from the very beginning.
That's a huge divergence from how management tools are written.

This is one of the reasons why management tooling around QEMU needsquite a bit of improving.

There is simply no way a management tool can do a good job of being anauthoritative source of configuration. The races we're discussion is agood example of why.

But beyond those races, QEMU is the only entity that knows withcertainty what bits of information are important to persist in order topreserve a guest across shutdown/restart. The fact that we've puntedthis problem for so long has only ensured that management tools areeither intrinsically broken or only support the most minimal subset offunctionality we actually support.

Currently they contain the required guest configuration, arepresentation of what's the current live configuration, and theyissue monitor commands to move the live configuration towards therequired configuration (or just generate a qemu command line). Whatyou're describing is completely different, I'm not even sure what it is.

Management tools shouldn't have to think about how the monitor commandsthey issue impact the invocation options of QEMU.

The management tool already has to keep track of (the optional partsof) the guest device tree. It cannot start reading the statefulnon-config file at random points in time. So all that is left isthe guest controlled portions of the device tree, which are prettyrare, and random events like live-copy migration. I think thatintroducing a new authoritative source of information will create alot of problems.
QEMU has always been the authoritative source. Nothing new has beenintroduced. We never persisted the machine's configuration whichmeant management tools had to try to aggressively keep up with uswhich is intrinsically error prone. Fixing this will only improveexisting management tools.
If you look at management tools, they believe they are theauthoritative source of configuration information (not guest state,which is more or less ignored).


It's because we've given them no other option.

Right, but we should make it easy, not hard.
Yeah, I fail to see how this makes it hard. We conveniently aresaying, hey, this is all the state that needs to be persisted. We'llpersist it for you if you want, otherwise, we'll expose it in acentral location.
The state-in-a-file is just a blob. Don't expect the tool to parse itand reassociate the various bits to its own representation. Exposingit via QMP commands is a lot better though.

I don't really see this as being a major issue. There's no such thingas a "blob". If someone wants to manipulate the state, they will. Weneed to keep compatibility to support migrating from version-to-version.

I agree that we want to provide QMP interfaces to work with the statefile. But I don't think we should be hostile to manual manipulation.


Regards,

Anthony Liguori

Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

Reply via email to