Re: [hwloc-devel] comments on API changes

Fawzi Mohamed Fri, 2 Apr 2010 09:03:10 -0400


On 2-apr-10, at 14:49, Brice Goglin wrote:

Fawzi Mohamed wrote:

I would take advantage more info about the possible numa node
connectivity (to know where to steal tasks), but I don't have access
to machines that would really take advantage of that, and probably
even then using the HW structure as topology would not bad.


NUMA connectivity is a big problem. Most x86 machines show nothing
interesting in cat /sys/devices/system/node/node*/distance. On AMD

hypertransport, there are some routing information in lspci, but Ithink

you have to be root to see it, and I am not even sure it's enough to
discover the actual HT graph.

We have a "measurement-based backend" in the TODO-list. It could bethe

only way to find out the NUMA connectivity.


nice

That said, it's not clear that this will be a big problem. AMD
Magny-Cours machines and Nehalem-EX machines with 2-4 sockets have a
fully-interconnected NUMA graph. No problem if we don't have NUMA
topology information there.. For larger machines, SGI is alreadydoing a
good job at reporting NUMA distances in sysfs. What remains is other
large machines (several vendors announced 8-socket Nehalem EXmachines).
We'll see.

indeed as I said the max I have access are 2 socket AMD or Nehalem-EX,so it doesn't matter.

For now I just have:
PU core soket/numa_node machine system

and for the for the foreseeable future that is probably good enough(all nodes in the same machine are at the same distance).

I don't aim at using this system between different machines...

Fawzi

Re: [hwloc-devel] comments on API changes

Reply via email to