Jeff Squyres wrote:
On Jan 18, 2006, at 9:02 PM, Paul Greidanus wrote:
I've got a new cluster I'm deploying (with trunk ~4105), and I had a
question that came up:
My nodes are 4 and 8 core (8 and 32Gb RAM), and we were wondering if
LAM was compiled to take advantage of the interconnect
(hypertransport) between CPUs, or does it use the IP stack/network to
connect for cpus on the same machine?
LAM has less-than-optimal IB support, but it is functional. We have not
expressly tested on machines with hypertransport -- I actually don't
know if they export normal HCA interfaces (which LAM can support) or
whether a hypertransport-enabled MPI is required.
Ok, can I s/hypertransport/SMP/g on the message, which is more what I'm
thinking of.. the details of the memory subsys inside of a machine
aren't really all that important to this question.. but, will the LAM
that comes with Oscar use mem->mem copies, rather then using the IP
stack, or even worse, send it out to the network device itself?
Also, is it usually preferable to fill up a node, rather then spread
it out across the nodes? i.e. 16 cpu job over 16 nodes, or 2 8 cpu
nodes?
It is usually preferable to fill up a node before spreading to a second
node because latencies between processors can be smaller (e.g., via
using shared memory, or even using the HCA's DMA engine).
Cool, that's what I was thinking too.
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Oscar-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/oscar-devel