Re: [PATCH] AF_VMCHANNEL address family for guest<->host communication.

Jeremy Fitzhardinge Mon, 15 Dec 2008 15:44:31 -0800

Anthony Liguori wrote:

Jeremy Fitzhardinge wrote:
Anthony Liguori wrote:
That seems unnecessarily complex.
Well, the simplest thing is to let the host TCP stack do TCP. Couldyou go into more detail about why you'd want to avoid that?
The KVM model is that a guest is a process. Any IO operationsoriginal from the process (QEMU). The advantage to this is that youget very good security because you can use things like SELinux andsimply treat the QEMU process as you would the guest. In fact, ingeneral, I think we want to assume that QEMU is guest code from asecurity perspective.
By passing up the network traffic to the host kernel, we now face aproblem when we try to get the data back. We could setup a tun deviceto send traffic to the kernel but then the rest of the system can seethat traffic too. If that traffic is sensitive, it's potentially unsafe.

Well, one could come up with a mechanism to bind an interface to be onlyvisible to a particular context/container/something.

You can use iptables to restrict who can receive traffic and possiblyuse SELinux packet tagging or whatever. This gets extremely complexthough.

Well, if you can just tag everything based on interface its relativelysimple.

It's far easier to avoid the host kernel entirely and implement thebackends in QEMU. Then any actions the backend takes will be onbehalf of the guest. You never have to worry about transport dataleakage.

Well, a stream-like protocol layered over a reliable packet transportwould get you there without the complexity of tcp. Or just do ausermode tcp; its not that complex if you really think it simplifies theother aspects.

This is why I've been pushing for the backends to be implemented inQEMU. Then QEMU can marshal the backend-specific state and transferit during live migration. For something like copy/paste, this isobvious (the clipboard state). A general command interface isprobably stateless so it's a nop.
Copy/paste seems like a particularly bogus example. Surely thisisn't a sensible way to implement it?
I think it's the most sensible way to implement it. Would you suggestsomething different?


Well, off the top of my head I'm assuming the requirements are:

   * the goal is to unify the user's actual desktop session with a
     virtual session within a vm
   * a given user may have multiple VMs running on their desktop
   * a VM may be serving multiple user sessions
   * the VMs are not necessarily hosted by the user's desktop machine
   * the VMs can migrate at any moment

To me that looks like a daemon running within the context of each of theuser's virtual sessions monitoring clipboard events, talking over a TCPconnection to a corresponding daemon in their desktop session, which isresponsible for reconciling cuts and pastes in all the various sessions.

I guess you'd say that each VM would multiplex all its cut/paste eventsvia its AF_VMCHANNEL/cut+paste channel to its qemu, which would thendemultiplex them off to the user's real desktops. And that since the VMitself may have no networking, it needs to be a special magic connection.

And my counter argument to this nicely placed straw man is that theVM<->qemu connection can still be TCP, even if its a private networkwith no outside access.

I'm not a fan of having external backends to QEMU for the veryreasons you outline above. You cannot marshal the state of achannel we know nothing about. We're really just talking aboutextending virtio in a guest down to userspace so that we canimplement paravirtual device drivers in guest userspace. This maybe an X graphics driver, a mouse driver, copy/paste, remoteshutdown, etc.A socket seems like a natural choice. If that's wrong, then wecan explore other options (like a char device, virtual fs, etc.).
I think a socket is a pretty poor choice. It's too low level, and itonly really makes sense for streaming data, not for data storage(name/value pairs). It means that everyone ends up making up theirown serializations. A filesystem view with notifications seems to bea better match for the use-cases you mention (aside from cut/paste),with a single well-defined way to serialize onto any given channel.Each "file" may well have an application-specific content, but ingeneral that's going to be something pretty simple.
I had suggested a virtual file system at first and was thoroughlyridiculed for it :-) There is a 9p virtio transport already so wecould even just use that.

You mean 9p directly over a virtio ringbuffer rather than via thenetwork stack? You could do that, but I'd still argue that using thenetwork stack is a better approach.

The main issue with a virtual file system is that it does map well toother guests. It's actually easier to implement a socket interfacefor Windows than it is to implement a new file system.

There's no need to put the "filesystem" into the kernel unless somethingelse in the kernel needs to access it. A usermode implementationtalking over some stream interface would be fine.

But we could find ways around this with libraries. If we used 9p as atransport, we could just provide a char device in Windows thatreceived it in userspace.


Or just use a tcp connection, and do it all with no kernel mods.

(Is 9p a good choice? You need to be able to subscribe to eventshappening to files, and you'd need some kind of atomicity guarantee. Idunno, maybe 9p already has this or can be cleanly adapted.)


   J
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH] AF_VMCHANNEL address family for guest<->host communication.

Reply via email to