Re: [OMPI users] top question

Jeff Squyres Wed, 3 Jun 2009 09:32:13 -0400

We get this question so much that I really need to add it to theFAQ. :-\

Open MPI currently always spins for completion for exactly the reasonthat Scott cites: lower latency.

Arguably, when using TCP, we could probably get a bit betterperformance by blocking and allowing the kernel to make more progressthan a single quick pass through the sockets progress engine, but thatinvolves some other difficulties such as simultaneously allowingshared memory progress. We have ideas how to make this work, but ithas unfortunately remained at a lower priority: the performancedifference isn't that great, and we've been focusing on the other,lower latency interconnects (shmem, MX, verbs, etc.).




On Jun 3, 2009, at 8:37 AM, Scott Atchley wrote:

On Jun 3, 2009, at 6:05 AM, tsi...@coas.oregonstate.edu wrote:

> Top always shows all the paralell processes at 100% in the %CPU
> field, although some of the time these must be waiting for a
> communication to complete. How can I see actual processing as
> opposed to waiting at a barrier?
>
> Thanks,
> Tiago

Using what interconnect?

For performance reasons (lower latency), the app and/or OMPI may be
polling on the completion. Are you using blocking or non-blocking
communication?

Scott
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users



--
Jeff Squyres
Cisco Systems

Re: [OMPI users] top question

Reply via email to