Re: [OMPI devel] RFC: [slightly] Optimize Fortran MPI_SEND / MPI_RECV

Jeff Squyres Sat, 7 Feb 2009 15:30:07 -0500

On Feb 7, 2009, at 12:23 PM, Brian W. Barrett wrote:

End result: I guess I'm a little surprised that the difference isthat clear -- does a function call really take 10ns? I'm alsosurprised that the layered C version has significantly more jitterthan the non-layered version; I can't really explain that. I'dwelcome anyone else replicating experiment and/or eyeballing mycode to make sure I didn't bork something up.
That is significantly higher than I would have expected for a singlefunction call. When I did all the component tests a couple yearsago, a function call into a shared library was about 5ns on an IntelXeon (pre-Core 2 design) and about 2.5 on an AMD Opteron.

Good; I'm not crazy for thinking that this is a little too obvious --it smells like I did something wrong. Could someone eyeball thesefiles and see if I missed anything obvious:


http://www.open-mpi.org/hg/hgwebdir.cgi/jsquyres/fortran/file/tip/ompi/mpi/f77/send_f.c
http://www.open-mpi.org/hg/hgwebdir.cgi/jsquyres/fortran/file/tip/ompi/mpi/f77/recv_f.c

--
Jeff Squyres
Cisco Systems

Re: [OMPI devel] RFC: [slightly] Optimize Fortran MPI_SEND / MPI_RECV

Reply via email to