Re: next meeting: Monday May 19th 10:00 - 12:00 CEST

Hannes Mehnert Tue, 29 Apr 2025 03:45:39 -0700

Hello again,

our next meeting is on May 19th (in 2.5 weeks) -- live from the retreat(let's see how that goes). We'll be at https://meet.jit.si/MirageOS --see our shared pad at https://pad.data.coop/To6IOSeNSOK9kFVlgo7XWw?both#for notes and agenda (add your talking points there) :)


Here are the notes from our meeting last Monday.

Have an awesome day,

Hannes

## Meeting April 28th 10:00 - 12:00 CEST
- Participants: Fabrice, Pierre, Reynir, Hannes, Sam, Romain

### Defunctorization work

Hannes worked on defunctorizing Mirage. It seems to work well and nocomplaints.

Hannes: should we defunctorize the network stack next and the block devices?

Pierre: is very happy with the current state of defunctorization. Forthe network stack, at least for Xen it will be tricky since we need twonetwork interfaces -- for backend and frontend.Romain: Did an experiment about MirageOS and miou-solo5, and has a TCPstack without functors -- still an experimental project

Reynir: If I understand, it goes a bit further with no functor at all
Romain: https://git.robur.coop/dinosaure/experiment-miou-solo5

Romain: and a little article here:https://blog.robur.coop/articles/utcp_and_effects.html


### Unikraft
Sam: the work on unikraft is close to be published

Sam: performance for network is pretty good, for the block device thesituation is less clear

Fabrice: block solo5 outperforms the unikraft

Hannes: from their website (unikraft), they claim much higherperformance than solo5

Fabrice: network is fine, it is a little faster
Sam: one of the next steps is to have a real benchmark with network
Romain: do you know the performance issues with block devices and unikraft?

Fabrice: it uses virtio device, quite different from the network one.performs badly if you do small sector (one sector at one time)operations -- much better if you operate on multiple sectorsFabrice: you can as well have multiple operations in flight withunikraft - which helps to shadow the single sector bad performanceRomain: did some performance benchmarks in terms of mirage-tcpip andutcp, and there's a large gap in the benchmarks (using iperf3, we'renear 1Gb/s with mirage-tcpip -- and with utcp 900 Mb/s)

Fabrice: our network test served one file

Romain: we have a really old unikernel which is compatible with iperf2,but the experiment above has iperf3 supportRomain: mirage-tcpip is 1Gbits/s and utcp is 900 Gbits/s on ourexperimentation. The main question is about scheduling now (where mioudiffers from what lwt does)Sam: for getting it released, we populate the different repositoriesunder the mirage organization, and open PRs for the repositories wherewe have unikraft patches


### utcp https://github.com/robur-coop/utcp

Hannes: TCP/IP stack based on a formal model (HOL4, SML; manuallytranslated SML to OCaml - Recently we found a mistranslation caused by amissing set of parenthesis).Hannes: mirage-tcpip: it works very well, but as discussed on themailing list it has obscure semantics in certain cases. It is deeply inthe LWT monad and has memory leaks.Hannes: Utcp has a pure, functional core with unit tests. Recentlyworked on performance with Romain. µtcp still lacks congestion controland newer features of TCP such as selective acknowledgement (whichmirage-tcpip also doesn't implement). µtcp started off several yearsago. We (in Robur) run it in production machines, and we have mostlyworked on correctness and resource usage, and we still have correctnessissues (failed assertions) and resource usage issues. Performance wiseµtcp tries to stick to congression control and window sizes whilemirage-tcpip doesn't try to adhere to a specific congression controlalgorithm or bound the memory usage. The gained interest of µtcp is alsodue to it not being tied to lwt and thus allows for other schedulers.Hannes: utcp is meant to replace only mirage-tcpip's src/tcp (ocamlfindtcpip.tcp)Romain: for the miou TCP/IP stack, we worked on a new IP stack (which isdifferent from mirage-tcpip's one)


### Mirage CI

Hannes: OCaml 5.4 support in ocaml-solo5 (ocaml-unikraft)? -- twodifferent repositories but shared patches, also 5.4 has most patchesupstream \o/ -- https://github.com/ocaml/ocaml/pull/13810 (maybe we canask Antonin, Gabriel, Florian at the retreat whether that can make it to5.4)

Hannes: OCaml 5.3 is not yet tested in the Mirage CI

Hannes: there's a PR from Tim about fixing the OCaml 5.2.1 supporthttps://github.com/ocurrent/mirage-ci/pull/51


### Remove bigarray from Cstruct

Pierre: experimented with branches from Hannes that use cstruct wherethe buffer is Bytes.t

Pierre: updated io-page to not rely on cstruct, but use bigarray directly
Pierre: this currently works with QubesOS, a hello world runs nicely

Pierre: ran into issues when running the network stack, mirage-tcpip istightly coupled with cstruct and relies on the fact that cstruct isbased on bigarray (esp. C stubs)

Pierre: may use utcp to check whether that'll be good enough / work

Pierre: need a careful review of the io-page API, since all itsdependencies need updatesHannes: the only C code is the checksum code, no? in utcp we have pureOCaml checksum code, and we could use that in mirage-tcpip (we use sometrick about bigarray to outperform the computation)


### Ownership of buffers on the IP level

Romain: another experiment with the IP stack, with mirage-tcpip you havecstruct everywhere -- difficult to change to something else. withcstruct you want to not copy when you have a fragmented packet -- withbigarray and cstruct you can have a subview (without copying). the ideais to have a bigarray directly when you have a defragmented packet, andif it is fragmented you get a copy of the bigarraysRomain: with mirage-tcpip you have the question about ownership andfragmented/defragmented: do you have the ownership or not?Romain: My intuition is at the IP level, we should have a variantbetween a bigarray (defragmented) and a string (fragmented) -- if youhave a bigarray you should care about the ownership

Hannes: in practise, 99.9999% of IP packets are not fragmented

### Checksum code - performance investigations

Hannes: maybe we should measure the utcp checksum code (OCaml code) andmirage-tcpip checksum code (C code)Romain: it is tricky due to memory layout, and also you've to take carethat OCaml 5 C-FFI is different (and introduces a memory barrier), socheck what your environment is before doing the benchmark (CPU cache)Hannes: maybe the checksum code could then be in a separate, independentpackage used by both utcp and mirage-tcpipHannes: question about the performance focus: arm? x86? 64 bits only?also 32 bits?


### tcpip handling of RST
Pierre: curious whether there was more communication about uTCP
Hannes: there wasn't

Pierre: I'll try reach out to them, one of the advisors is in myreasearch group

Re: next meeting: Monday May 19th 10:00 - 12:00 CEST

Reply via email to