Hello
I don't have access to a MI300A but I worked with AMD several month ago
to solve a very similar issue. It was caused by a buggy APCI HMAT in the
BIOS.
Try setting HWLOC_USE_NUMA_DISTANCES=0 in the environment to disable the
hwloc code that uses this HMAT info. If the warning goes away
Is there a timeline for hwloc to support the MI300A? Currently, hwloc isn’t
happy when it encounters one:
* hwloc 2.9.0 received invalid information from the operating system.
*
* Failed with: intersection without