[OMPI users] Multi-threading with OpenMPI ?

2009-09-11 Thread Ashika Umanga Umagiliya
Greetings all, Please refer to image at: http://i27.tinypic.com/mtqurp.jpg Here the process illustrated in the image: 1) C++ Webservice loads the "libParallel.so" when it starts up. (dlopen) 2) When a new request comes from a client,*new thread* is created, SOAP data is bound to C++ objects

[OMPI users] Timers

2009-09-11 Thread amjad ali
Hi all, I want to get the elapsed time from start to end of my parallel program (OPENMPI based). It should give same time for the same problem always; irrespective of whether the nodes are running some or programs or they are running only that program. How to do this? Regards.

Re: [OMPI users] undefined symbol error when built as a sharedlibrary

2009-09-11 Thread Jeff Squyres
On Sep 10, 2009, at 9:42 PM, Ashika Umanga Umagiliya wrote: That fixed the problem ! You are indeed a voodoo master... could you explain the spell behind your magic :) The problem has to do with how plugins (aka dynamic shared objects, DSO's) are loaded. When a DSO is loaded into a Linux

Re: [OMPI users] Timers

2009-09-11 Thread jody
Hi I'm not sure if i completely understand your requirements, but have you tried MPI_WTime? Jody On Fri, Sep 11, 2009 at 7:54 AM, amjad ali wrote: > Hi all, > I want to get the elapsed time from start to end of my parallel program > (OPENMPI based). It should give same time

[OMPI users] Bad MPI_Bcast behaviour when running over openib

2009-09-11 Thread Ake Sandgren
Hi! The following code shows a bad behaviour when running over openib. Openmpi: 1.3.3 With openib it dies with "error polling HP CQ with status WORK REQUEST FLUSHED ERROR status number 5 ", with tcp or shmem it works as expected. #include #include #include #include "mpi.h" int main(int

Re: [OMPI users] undefined symbol error when built as a sharedlibrary

2009-09-11 Thread Reuti
Am 11.09.2009 um 12:14 schrieb Jeff Squyres: On Sep 10, 2009, at 9:42 PM, Ashika Umanga Umagiliya wrote: That fixed the problem ! You are indeed a voodoo master... could you explain the spell behind your magic :) The problem has to do with how plugins (aka dynamic shared objects, DSO's)

Re: [OMPI users] undefined symbol error when built as asharedlibrary

2009-09-11 Thread Jeff Squyres
On Sep 11, 2009, at 7:26 AM, Reuti wrote: > OMPI's --disable-dlopen configure option has Open MPI build in a > different way. Aha - this might also explain what I faced some time ago. I tried to compile an application called Molpro with GlobalArrays which I compiled with Open MPI. I faced

Re: [OMPI users] Bad MPI_Bcast behaviour when running over openib

2009-09-11 Thread Rolf Vandevaart
Hi, how exactly do you run this to get this error? I tried and it worked for me. burl-ct-x2200-16 50 =>mpirun -mca btl_openib_warn_default_gid_prefix 0 -mca btl self,sm,openib -np 2 -host burl-ct-x2200-16,burl-ct-x2200-17 -mca btl_openib_ib_timeout 16 a.out I am 0 at 1252670691 I am 1 at

Re: [OMPI users] Bad MPI_Bcast behaviour when running over openib

2009-09-11 Thread Jeff Squyres
Cisco is no longer an IB vendor, but I seem to recall that these kinds of errors typically indicated a fabric problem. Have you run layer 0 and 1 diagnostics to ensure that the fabric is clean? On Sep 11, 2009, at 8:09 AM, Rolf Vandevaart wrote: Hi, how exactly do you run this to get

[OMPI users] application hangs when checkpointing application

2009-09-11 Thread Jean Potsam
Hi Everyone,     I wrote a small program with a function to trigger the checkpointing mechanism as follows:     #include #include #include #include #include void trigger_checkpoint(); int main(int argc, char **argv) { int

Re: [OMPI users] Bad MPI_Bcast behaviour when running over openib

2009-09-11 Thread Ake Sandgren
On Fri, 2009-09-11 at 13:18 +0200, Ake Sandgren wrote: > Hi! > > The following code shows a bad behaviour when running over openib. Oops. Red Face big time. I happened to run the IB test between two systems that don't have IB connectivity. Goes and hide in a dark corner... -- Ake Sandgren,

[OMPI users] Application hangs when checkpointing application (update)

2009-09-11 Thread Jean Potsam
  Hi Everyone,   I noticed that it hangs just before displaying the following while trying to checkpoint the application.   [sun06:15252] orte_checkpoint: notify_hnp: Requested a checkpoint of jobid [INVALID]  ###   Can it be

[OMPI users] OpenMPI on OS X - file is not of required architecture

2009-09-11 Thread Andreas Haselbacher
I've built openmpi version 1.3.3 on a MacPro with OS X 10.5.8 and the Intel 10.1.006 Fortran compiler and gcc 4.0.  As far as I can tell, the configure and make commands completed fine. There are some warnings, but it's not clear to me that they are critical - or the explanation for what's not

Re: [OMPI users] Disable use of Torque at run-time

2009-09-11 Thread jgans
Hi Ralph, Thank you for you help. This is exactly what I wanted! Regards, Jason Ralph Castain wrote: Hmmm...well, here is one way to do it: mpirun -n 1 -host n0 ./master_worker : -n N-1 -host +e ./master_worker What this will do is put rank 0 on the first node in your allocation, and then

Re: [OMPI users] OpenMPI on OS X - file is not of required architecture

2009-09-11 Thread Jeff Squyres
On Sep 11, 2009, at 10:05 AM, Andreas Haselbacher wrote: I've built openmpi version 1.3.3 on a MacPro with OS X 10.5.8 and the Intel 10.1.006 Fortran compiler and gcc 4.0. As far as I can tell, the configure and make commands completed fine. There are some warnings, but it's not clear to

Re: [OMPI users] OpenMPI on OS X - file is not of required architecture

2009-09-11 Thread Andreas Haselbacher
On Fri, Sep 11, 2009 at 5:10 PM, Jeff Squyres wrote: > On Sep 11, 2009, at 10:05 AM, Andreas Haselbacher wrote: > > I've built openmpi version 1.3.3 on a MacPro with OS X 10.5.8 and the >> Intel 10.1.006 Fortran compiler and gcc 4.0. As far as I can tell, the >> configure

Re: [OMPI users] OpenMPI on OS X - file is not of required architecture

2009-09-11 Thread Doug Reeder
Andreas, Have you checked that ifort is creating 64 bit objects. If I remember correctly with 10.1 the default was to create 32 bit objects. Doug Reeder On Sep 11, 2009, at 3:25 PM, Andreas Haselbacher wrote: On Fri, Sep 11, 2009 at 5:10 PM, Jeff Squyres wrote: On