On 23/07/2025 3:21 pm, Jan Beulich wrote:
> On 23.07.2025 16:13, Andrew Cooper wrote:
>> On 23/07/2025 2:56 pm, Yann Sionneau wrote:
>>> xen.efi PE does not boot when loaded from shim or some patched
>>> downstream grub2.
>>>
>>> What happens is the bootloader would honour the MEM_DISCARDABLE
>>> flag of the .reloc section meaning it would not load its content
>>> into memory.
>>>
>>> But Xen is parsing the .reloc section content twice at boot:
>>> * https://elixir.bootlin.com/xen/v4.20.1/source/xen/common/efi/boot.c#L1362
>>> * 
>>> https://elixir.bootlin.com/xen/v4.20.1/source/xen/arch/x86/efi/efi-boot.h#L237
>>>
>>> Therefore it would crash with the following message:
>>> "Unsupported relocation type" as reported there:
>>>
>>> * 
>>> https://github.com/QubesOS/qubes-issues/issues/8206#issuecomment-2619048838
>>> * 
>>> https://lore.kernel.org/xen-devel/7e039262-1f54-46e1-8f70-ac3f03607...@suse.com/T/#me122b9e6c27cd98db917da2c9f67e74a2c6ad7a5
>>>
>>> This commit adds a small C host tool named keeprelocs
>>> that is called after xen.efi is produced by the build system
>>> in order to remove this bit from its .reloc section header.
>>>
>>> Signed-off-by: Yann Sionneau <yann.sionn...@vates.tech>
>>> ---
>>>  xen/Makefile           |   5 +-
>>>  xen/arch/x86/Makefile  |   1 +
>>>  xen/tools/Makefile     |   3 ++
>>>  xen/tools/keeprelocs.c | 119 +++++++++++++++++++++++++++++++++++++++++
>>>  4 files changed, 127 insertions(+), 1 deletion(-)
>>>  create mode 100644 xen/tools/keeprelocs.c
>> I'm sick and tired of the hoops we have to jump through for broken
>> tooling.  This is now rewriting the PE+ metadata because apparently the
>> linker can't do it correctly.
> The linker is doing it correctly. It is us (and very likely just us) who
> have special needs here.
>
>> Either fix the linker (or the way we drive it/etc), or we're doing away
>> with PE+ emulation entirely and writing the MZ/PE headers by hand like
>> literally every other kernel does.
> "Fixing" the linker was suggested, but with my binutils maintainer hat on
> I'm inclined to not accept a Xen-only option into the linker.

Either Xen is doing something wrong, and should be doing it differently,
or Xen is doing something right and the tooling is
wrong/incomplete/whatever.

As a related question, is anyone other than Xen using PE+ emulation in
anger?  Every other kernel/OS level tool I'm aware of writes the MZ/PE
header by hand, and frankly, the list of bugs we've found in PE+
emulation would strongly suggest that noone else is using it.

~Andrew

Reply via email to