internal architecture of the unified rumprun repository

Antti Kantee Sun, 05 Apr 2015 16:39:06 -0700

Hi,

In the recent weeks I've been pulling the bits of -baremetal and -xentogether and apart and sideways, and it's starting to be clear what theresulting internal (*) architecture of the rumprun unikernel (**) shouldlook like. This is a short('ish) description of what I've finishedpushing today.

*) I'm really keen on defining things in terms of internal and external,since people can now start using our product, and for that it'simportant, in Lampson's words, to "keep a place to stand", with theexternal interfaces of course being the place where users can stand


**) which, important to stress, is not the same thing as a rump kernel

First, why unify at all? For one, it keeps me happier since I don'thave to change the same code multiple times. Second, it will keepgratuitous differences from creeping in, which has observably beenhappening (and equally observable bugs resulting from those gratuitousdifferences!). Third, unified code will work the same way, which isgood for consistency between platforms. Note, I'm not saying that allplatforms will work exactly the same, but at least samer (though, that'sprobably not an actual word).

The problem that I generated from the first rumprun stack based onMiniOS, and I stress that it was in no way of a result of flaw inMiniOS, was the lack of any separation between "userspace" and"kernelspace". That was fine in MiniOS, since it provides aself-contained package, but not so much the case for rumprun, wherethere is a clear *conceptual* separation between userspace andkernelspace; the conceptual separation of course does not mean we can'tlink everything into a single address space like we are doing withrumprun. We want to avoid moebius-strip computing where the kernel runson top of libc and vice versa, because that makes it very hard to reasonabout dependencies of components. While it seemed like a good idea atthe time, we desperately want to fix the mishmash now.

So, getting to the architecture description, the rule is that upperlayers can depend on lower ones but not vice versa:

1) platform, which provides low-level bootstrap and platform-dependentroutines such as the clock.2) core, which provides platform-independent routines such as thescheduler and also MD low-level routines such as thread contextswitching, parts of rumpuser and, eventually, atomic operations. Note:core should define the interfaces to be implemented by platform whichare used by layers >=#2 (mostly #3 and #5).3) rumpuser, which is below libc but above platform/core. Notably, therump kernel depends on this layer (and transitively the ones below)

4) rump kernel

5) base, which provides common userlevel routines, e.g. when ones fromthe regular libc are not applicable. conceptually, libc is also in thislayer, but of course we don't implement our own libc

The actual application(s) would be layer 6. It's nice that we didn'tneed 7 layers, because 7-layer designs suck, as famously demonstrated bysome committee designing networking stacks.

If you think the names suck, invent just better ones, and we can changethem without screwing users because they're exposed only internally(!).

Ok, so layers 1 and 2 don't really follow the dependency rule, since "1"can use features from "2". We could create layer "0" to address thisand put e.g. the atomic ops there, but I'm not sure it's worth the fusscurrently. If it some days turns out to be worth the fuss, hey,internal interfaces, we can just do it without screwing users ...

So, we get clean, conceptual separation between the "kernel" and"usermode", and therefore it should be mostly trivial to run the rumpkernel on a given platform without userspace (build goonotwithstanding). That userspace-less mode of operation may beinteresting for example to small embedded system vendors who mostly wantkernel functionality and don't mind writing some hundreds of lines of"application" directly against rump kernel syscalls for the benefit ofbeing able to not ship "userspace" at all. For example, networkedsensor devices come to mind.

The other separation we gain is independence between the rumprununikernel and the rump kernel. Yes, that makes sense ;)

Purely theoretically speaking *winkwink*, if some other OS besidesNetBSD were to be structured to run on top of the rump kernel hypercallinterface (i.e. turned into an anykernel), and a suitable libc were toalso be available, that alternative OS could be offered as amore-or-less drop-in replacement of the core of the rumprun unikernel.

All of the above layers now exist, and while there's still a bit ofrototilling left, things are starting to look peachy. For example, youcan have a look at how delightfully empty rumprun/platform/xen already is.


  - antti

internal architecture of the unified rumprun repository

Reply via email to