Hyperlaunch Device Tree Discussion

Daniel P. Smith Wed, 21 May 2025 08:39:48 -0700

Greetings,

Per my response to Allejandro's message, here is the response sent thethe DTB working group formed last year to discuss DTB parsing for x86.



Original Message:

I have copied everyone that attended the hyperlaunch working group a fewweeks back to ensure everyone has a chance to review and comment.

As a start and to provide a common understanding, first is a quickoverview of Flattened Device Tree and Xen's "Unflattened Device Tree".The intent is to assist everyone in having an equal footing whenconsidering the impacts that Device Tree parsing brings.

A Flattened Device Tree (FDT) is a nested linear tree structure thatuses a combination of tags, layout definition, and headers to allownavigation through the tree. Because the layout is nested, if given theoffset for a node in the FDT, it is possible to start at that node andjump directly into the tree to access child nodes and properties.Provided below is a visual representation of what any parent node,including the root node, may look like:


+------------------------------+
| NODE TAG (parent node)       |
+------------------------------+
| Null-term String (node name) |
+------------------------------+
| PROPERTY TAG                 |
+------------------------------+
| struct property {            |
|   u32 len                    |
|   u32 name_offset            |
| }                            |
+------------------------------+
| Property Data                |
+------------------------------+
| NODE TAG (child node)        |
+------------------------------+
| Null-term String (node name) |
+------------------------------+
| PROPERTY TAG                 |
+------------------------------+
| struct property {            |
|   u32 len                    |
|   u32 name_offset            |
| }                            |
+------------------------------+
| Property Data                |
+------------------------------+
| END NODE TAG (child node)    |
+------------------------------+
| END NODE TAG (parent node)   |
+------------------------------+

Before moving forward, let us clarify some terminology to ensure acommon understanding when discussing a tree.

- node path: represents a series of hierarchical child nodes startingat the root node- adjacent node: the logically next node that is at the same level inthe tree- child node: a node that is a one level lower leaf to another node,the parent node- tree walk: incrementally walking the nodes, to locate a specificnode or to iterate over the whole tree

The libfdt library provides handlers for finding the offset of a node,as well as handlers to jump to a node offset and iterate only on thechild nodes. While the libfdt is fairly optimized, the reality is thatto find a node, the library must do a tree walk starting with the firstnode written in the FDT. If a node is not a path match at the currentdepth, it must cross a null terminated string, all the node's propertyentries and all children nodes to reach the next adjacent node. Once apath match for the depth is found, then the search may descend into thenext depth and repeat the process until a match at that level is found.

This brings us to Xen's "Unflattened Device Tree" (UDT), for which I amquoting as I find myself thinking of it in another way, which IMHO is amore descriptive name, which is that it is an FDT lookup index. It justhappens that the implementation for the lookup index structure is a treestructure. UDT uses a structure to represent a node and one to representa property. The node structure is a traditional tree structure withadjacent and child node pointers. The contents of both structures arepointers to the respective memory locations within the FDT. As with theFDT, in order to locate a node in the index, a tree walk of the indexmust be done. The difference comes when a node is not a path match, toreach the adjacent node, it only needs to access the node pointed to bythe adjacent node pointer of the current node. UDT provides an API forwalking the node tree, walking the property list for a node, and methodsfor type-interpreted extraction of property values. NB: thetype-interpreted extraction API is codified around taking a UDT propertystructure, but the interpreted extraction logic isn't UDT specific as itis still reading the property value from the FDT.

The benefit UDT brings is when repeated node lookups and/or repeatedphandle dereferencing are done. For both FDT and UDT, a tree walk mustbe done. The walk will start with a node, either the root node or onefor which a reference has already been found, walking each adjacent nodeand descending into a node's children when a path match occurs. Forphandle dereferencing, the benefit is greater due to the fact that whenindexing the FDT, phandles get dereferenced, thus allowing directreference in the index. For comparison, a phandle dereference usinglibfdt does a walk of the tree to find the node referenced by the phandle.

The UDT, as implemented, is not without cost. The current implementationtakes two complete walks of the entire FDT using libfdt. The first passis to obtain the amount of memory that is required to allocate enoughinstances of the UDT node and property structures to represent the fulltree. The second pass is when the FDT nodes and properties are indexedinto the UDT.

With the expense of using FDT and UDT established, it is important toput that expense into context. Consider hyperlaunch on x86 where thearch itself has no DT requirements. In all likelihood, an FDTconstructed for this arch would only contain the nodes necessary forhyperlaunch. If hyperlaunch were constructed x86 only, loading theconfiguration could be done in a single full walk of the FDT, even whenconsidering phandle usage. The reason this is true for the phandlescase, is that as nodes known to be a phandle target are encountered,their offset into the FDT could be stored with dereferencing resolvedpost walk.

For DT based archs, currently accepted costs are two FDT node lookupsalong with the two full walks to construct the UDT. These first two nodelookups being the memory allocation table and the Xen command line. Asnoted above, an FDT node lookup requires a walk of the linear tree untilthe node is located. AIUI at this point is that the number of nodes thatmust be crossed is indeterminate. I did not see anything in the devicetree specification that the FDT must be packed in the same order as thestring representation. NB: I have not reviewed the DT compiler to see ifit optimizes for early access nodes to be packed at the beginning of thelinear tree to reduce the number of nodes that must be crossed.

While the aforementioned strategy for x86 would be optimal for x86, itis not necessarily the best for DT based archs. Hyperlaunch started, andcurrently is, focused on the x86 arch, but long term it was alwaysunderstood that its more expansive design would be desirable by allarchs. Like anything that moves into common, a slightly less efficientapproach for one platform is accepted for the benefit of a commonimplementation that reduces the amount of code while increasing thenumber of reviewers.

After listening to everyone's concerns, re-reviewing all of Arm's devicetree logic, and considering everything in totality, the conclusion isthat there is a core root cause from with which all the concerns raisedflow. First a summary of the main concerns raised,

- The issue of memory allocator(s) available at the time when thefirst FDT walk/parsing occurs.- Overhead of doing a more than one FDT walk to obtain the hyperlaunchconfiguration when phandles are in use.- Supporting FDT would require the introduction of aduplicate/competing set of property parsers.

This root cause is due to a design decision difference made for thehyperlaunch domain builder versus the dom0less domain builder and Arm'sapproach to device tree parsing. For dom0less, the approach is to walkthe UDT index tree at the domain construction time, which appears tostem from Arm's need and practice of repeatedly accessing device treeentries. Whereas x86 has no need for the device tree and took theapproach to do a single walk to extract its configuration into a codeusable structure.

With that understanding, it is believed that these two approaches arenot diametrically opposed and in fact can be blended together to resultin a generally optimized approach. The approach will be to conduct twofull walks of the FDT, an early boot pass before memory allocation isavailable and a second pass after a memory allocator is set up. Bothpasses serve to populate the proposed boot info structure, specificallythe scope of these passes are as follows:


Early FDT Walk: (static values)
 - calculate the space required for the device tree index
 - count the number of domains defined
 - Xen command line
 - XSM policy
 - arch specific static values, via an arch_early_fdt()

Late FDT Walk: (dynamic values)
 - construct device tree index, aka "unflattened device tree"

- populate boot modules entries (NB: boot modules are expected to bestatic array)

 - store device tree node index for top level index and hyperlaunch node
 - populate boot domain entries, basis will be device tree index node
 - arch specific dynamic values, via an arch_late_fdt()

By taking this approach which is constructed around a set of archneutral structures will enable another goal of the hyperlaunch series,which is to move to having a common domain creation/construction logic.Currently, there is a significant amount of duplication in each arch'sbranch for boot time construction of domains. It will also allowremoving arch specific code from the initialization of commoninfrastructure such as XSM.


V/r,
Daniel P. Smith

Hyperlaunch Device Tree Discussion

Reply via email to