Oliver Geisler wrote:
This is the results of skampi pt2pt, first with shared memory allowed,
second shared memory excluded.
Thanks for the data. The TCP results are not very interesting... they
look reasonable.
The shared-memory data is rather straightforward: results are just
plain ridiculously bad. The results for "eager" messages (messages
shorter than 4Kbytes) are around 12 millisec. The results for
"rendezvous" messages (longer than 4 Kbytes, signal the receiver, wait
for an acknowledgement, then send the message) are about 30 millisec.
I was also curious about "long-message bandwidth", but since SKaMPI is
only going up to 16 Kbyte messages, we can't really tell.
But, maybe all that is irrelevent.
Why is shared-memory performance about four orders of magnitude slower
than it should be? The processes are communicating via memory that's
shared by having the processes all mmap the same file into their address
spaces. Is it possible that with the newer kernels, operations to that
shared file are going all the way out to disk? Maybe you don't know the
answer, but hopefully someone on this mail list can provide some insight.