Re: [OMPI devel] Need help for semaphore in BML

Jeff Squyres Thu, 19 Apr 2007 20:39:35 -0400

On Apr 19, 2007, at 1:45 PM, [email protected] wrote:

I want to put semaphore in bml.h--- mca_bml_send before and aftercalling
btl_send.
SO that when a process call btl_send it first lock a globalvariable X and
then proceeds.Also if an external Tcp function wants to send data it
should first lock global variable X and then proceed.
Can anyone tell me only changing bml.h is enough or are there anyother
files where I need to make changes.

This is likely to be a complex issue because there's the put and getfunctions as well. ob1 uses a fairly complex algorithm to decidewhen to call the bml interface functions -- I doubt that the use of asemaphore in a single location is going to do what you want.


(why a semaphore, anyway -- why not a mutex?)

(As I tried doing this and run mpi program it gave me ORTE time outerroralso when I changed file back to normal it was not compiling andgiving me
error in libmca_bml.la etc...unfortunately I deleted entire folder and
downloaded new version.)

Changing bml.h should have zero effect on the ORTE layer. ORTE is awhole different abstraction and wholly below the OMPI layer. Thereare a few places in the OMPI layer that interact with the lower ORTElater, but the bml is not one of them.


I'm guessing that you had some other problem.

If you're going to be working continually with Open MPI, you mightwant to get a subversion checkout.

Can any one please help me and tell me how should I go aboutimplementinglocks/semaphore in bml layer so that all mpi process access lock(of same
priority ) and continue working while Tcp acquire only when network is
free(or there is lot of serial operation between 2 mpi sends).

I want to emphasize again that this won't give you what you havedescribed in previous mails: the PML interface is designed to beasynchronous. So when you call send/put/get, it only (possibly)*starts* the communication transfer. When you unlock upon return,you're allowing the alternate communication mechanism to come in andstart another communication method (via a different BTL, perhaps),but it does not change that there may still be activity occurringdown in the kernel and/or hardware. Also, this scheme does notaccount for received message contention -- it only [tries to] accountfor sending contention.

So even if you get the locking working the way that you want, I don'tthink that you're going to get the overlap and multiplexing that youexpect.


--
Jeff Squyres
Cisco Systems

Re: [OMPI devel] Need help for semaphore in BML

Reply via email to