On 5/21/25 10:35, Alejandro Vallejo wrote:
Hi,

(There's a TL;DR at the end)

While working on preparing and reworking the hyperlaunch series for
upstreaming it's slowly becoming apparent the degree of duplication with
dom0less.

Yes, this was by design so that when we got to the point of convergence it would be apparent where the commonalities are and thus be collapsed.

Oleksii's latest effort to move all that code into common[*] (see
ad03faa942b9("xen/common: dom0less: make some...) makes this even clearer.
There are 1:1 relationships for every key data strucutre, and by deviating
we're overcomplicating the ability to share code.

   [*] 
https://lore.kernel.org/xen-devel/cover.1746468003.git.oleksii.kuroc...@gmail.com/

     dom0less           Hyperlaunch
     ------------------------------
     kernel_info        boot_domain
     bootmodule         boot_module
     bootmodule_kind   bootmod_type
     membanks                memmap
     bootinfo             boot_info

If you go back, you will see less of these differences. A lot of the variations have been the results of reviewer's request. And that goes to the fact that dom0less was developed with Arm mentality in terminology, eg. memory banks verse memory map.

The difficulty in code sharing is not just unfortunate from a vague sense of
elegance or neatness. Having different code paths parsing the same DT bindings
means it's a matter of time before x86 and all other Xen ports have different
bindings, which would would only worsen the situation wrt code sharing and user
confusion.

Only if we allowed it, but with that said there are subtleties between the arch that will require variation. Such as mode, which is primarily an x86 specific.

I've been trying to get _somewhere_ in using parts of dom0less on x86
and develop a credible PoC that highlights the use of dom0less parsing
code paths. The results are interesting.

While I didn't get to a full integration in the hyperlaunch series as I hoped
due to time constraints I did get far enough to:

   1. Replace boot_module, boot_domain and bootmod_type with their dom0less
      counterparts (pending some cleanup).
   2. Isolate binding parsing code in common/device-tree so it's dettached from
      the not-so-common dom0less domain building logic in dom0less-build.c
   3. Do early module kind detection from x86 using code in (2).
   4. Do late binding parsing also using code in (2) after unflattening the DTB.

And it works well enough under QEMU to populate boot_info to a first
approximation. It's missing hyperlaunch-specific bindings (like "domid"
or "mode"), but that's just a matter of adding them to the already
existing per-arch binding parser ("mode", maybe, would be a candidate
for this) or retrofit them in dom0less ("domid" is a very clear
candidate for this) and integrating in the larger series to be able to
actually boot domains.

The PoC does not go all the way due to time constraints, as I said, but
I think it goes far enough to convince me it's both feasible and
beneficial to go in this direction rather than that of a full
reimplementation on x86, specially seeing how Oleksii already aims to
have full code reuse on riscv.

A brave new world would reuse all data (including bootinfo) and code
(bootfdt.c/bootinfo.c), but I couldn't get that far, and I don't think
I'll be able to in a timely manner. IOW: I compiled-in device-tree.c,
but NOT bootfdt.c or bootinfo.c, or any of the others. I merely
extracted from those files the binding parsing bits of interest.

It'd be nice to use them, but the domain construction logic is just too
different at present. As for the code I tested with,  I need to do some serious
cleanup before it's ready for sharing, and even moreso for review, but before I
go through that I'd like to reach consensus on the following points before
investing any more of my time working on the side.

TL;DR:

I think we should aim to share binding code wherever possible, using common
datastructures (kernel_info and bootmodule) as dumping ground for the results
of the binding parsing functions. I seek agreement on the following 3 points
for the end goal of DTB multidomain boots on x86 before I start slicing
my hacks into reasonable chunks.

   1. We want common data structures, with arch-specific fields to hold
      information from xen,domain DT nodes
   2. We want to have a single collection of DTB parsers in the code.
   3. We want to operate on the unflattened DTB for the majority of parsing.
     (plus a minimal version on the FDT in order to bootstrap, also common)

As for 3, I can repost my original analysis that went to the working group on why using this is not the best idea. Doing 3 would require doing at least two, if not three passes on the DTB for x86 with zero benefit/need since, unlike Arm, we never have to read from it after boot. The way the x86 parser is today, we are walking the DTB only once.

What I would suggest for 2 is that, perhaps, it might be an opportune time to review the existing DTB parsing code. Providing a common interface to parse standard/spec DTB structures regardless if it is coming from an FTB or via the FTB index, aka "unflattened" DTB. Doing so would enable a complete reuse within the DTB parsers and remove then need for 3.


(2) and (3) are tightly related. There's many reasons for wanting to use the
unflattened blobs as much as possible. They range from quirks in specific "dtc"
versions, to complexities parsing phandles, to corner cases involving duplicate
properties (i.e: due to .dtsi files), etc. Unflattening an FDT brings a
lots of "maybe-ok-after-sanitising" FDTs back into canonically correct DTs.

I'll share the PoC code as soon as as it's in a presentable state.
Hopefully by the end of the week. But I'm sending this ahead of time to
start collecting thoughts right away.

So. Thoughts?


v/r,
dps


Reply via email to