Hi, (There's a TL;DR at the end)
While working on preparing and reworking the hyperlaunch series for upstreaming it's slowly becoming apparent the degree of duplication with dom0less. Oleksii's latest effort to move all that code into common[*] (see ad03faa942b9("xen/common: dom0less: make some...) makes this even clearer. There are 1:1 relationships for every key data strucutre, and by deviating we're overcomplicating the ability to share code. [*] https://lore.kernel.org/xen-devel/cover.1746468003.git.oleksii.kuroc...@gmail.com/ dom0less Hyperlaunch ------------------------------ kernel_info boot_domain bootmodule boot_module bootmodule_kind bootmod_type membanks memmap bootinfo boot_info The difficulty in code sharing is not just unfortunate from a vague sense of elegance or neatness. Having different code paths parsing the same DT bindings means it's a matter of time before x86 and all other Xen ports have different bindings, which would would only worsen the situation wrt code sharing and user confusion. I've been trying to get _somewhere_ in using parts of dom0less on x86 and develop a credible PoC that highlights the use of dom0less parsing code paths. The results are interesting. While I didn't get to a full integration in the hyperlaunch series as I hoped due to time constraints I did get far enough to: 1. Replace boot_module, boot_domain and bootmod_type with their dom0less counterparts (pending some cleanup). 2. Isolate binding parsing code in common/device-tree so it's dettached from the not-so-common dom0less domain building logic in dom0less-build.c 3. Do early module kind detection from x86 using code in (2). 4. Do late binding parsing also using code in (2) after unflattening the DTB. And it works well enough under QEMU to populate boot_info to a first approximation. It's missing hyperlaunch-specific bindings (like "domid" or "mode"), but that's just a matter of adding them to the already existing per-arch binding parser ("mode", maybe, would be a candidate for this) or retrofit them in dom0less ("domid" is a very clear candidate for this) and integrating in the larger series to be able to actually boot domains. The PoC does not go all the way due to time constraints, as I said, but I think it goes far enough to convince me it's both feasible and beneficial to go in this direction rather than that of a full reimplementation on x86, specially seeing how Oleksii already aims to have full code reuse on riscv. A brave new world would reuse all data (including bootinfo) and code (bootfdt.c/bootinfo.c), but I couldn't get that far, and I don't think I'll be able to in a timely manner. IOW: I compiled-in device-tree.c, but NOT bootfdt.c or bootinfo.c, or any of the others. I merely extracted from those files the binding parsing bits of interest. It'd be nice to use them, but the domain construction logic is just too different at present. As for the code I tested with, I need to do some serious cleanup before it's ready for sharing, and even moreso for review, but before I go through that I'd like to reach consensus on the following points before investing any more of my time working on the side. TL;DR: I think we should aim to share binding code wherever possible, using common datastructures (kernel_info and bootmodule) as dumping ground for the results of the binding parsing functions. I seek agreement on the following 3 points for the end goal of DTB multidomain boots on x86 before I start slicing my hacks into reasonable chunks. 1. We want common data structures, with arch-specific fields to hold information from xen,domain DT nodes 2. We want to have a single collection of DTB parsers in the code. 3. We want to operate on the unflattened DTB for the majority of parsing. (plus a minimal version on the FDT in order to bootstrap, also common) (2) and (3) are tightly related. There's many reasons for wanting to use the unflattened blobs as much as possible. They range from quirks in specific "dtc" versions, to complexities parsing phandles, to corner cases involving duplicate properties (i.e: due to .dtsi files), etc. Unflattening an FDT brings a lots of "maybe-ok-after-sanitising" FDTs back into canonically correct DTs. I'll share the PoC code as soon as as it's in a presentable state. Hopefully by the end of the week. But I'm sending this ahead of time to start collecting thoughts right away. So. Thoughts? Cheers, Alejandro