Oliver Geisler wrote:

This is the results of skampi pt2pt, first with shared memory allowed,
second shared memory excluded.

Thanks for the data. The TCP results are not very interesting... they look reasonable.

The shared-memory data is rather straightforward: results are just plain ridiculously bad. The results for "eager" messages (messages shorter than 4Kbytes) are around 12 millisec. The results for "rendezvous" messages (longer than 4 Kbytes, signal the receiver, wait for an acknowledgement, then send the message) are about 30 millisec.

I was also curious about "long-message bandwidth", but since SKaMPI is only going up to 16 Kbyte messages, we can't really tell.

But, maybe all that is irrelevent.

Why is shared-memory performance about four orders of magnitude slower than it should be? The processes are communicating via memory that's shared by having the processes all mmap the same file into their address spaces. Is it possible that with the newer kernels, operations to that shared file are going all the way out to disk? Maybe you don't know the answer, but hopefully someone on this mail list can provide some insight.

Reply via email to