Re: [OMPI users] deadlock in openmpi 1.5rc5

2010-08-09 Thread Jeff Squyres
gv=0x7fff6e0c09b8) > at > /wg/stor5/wgsim/hsu/projects/cturtle_mpi/wg-ros-pkg-unreleased/stacks/mpi/mpi_test/src/mpi_test.cpp:38 > (gdb) > > > > > On Mon, Aug 9, 2010 at 6:39 AM, Jeff Squyres <jsquy...@cisco.com> wrote: > In your first mail, you mentioned

Re: [OMPI users] MPI_Bcast issue

2010-08-10 Thread Jeff Squyres
bug? >>> >> >>> >> Is there any multicast or N to N broadcast where sender processes can >>> >> avoid participating when they don’t need to? >>> >> >>> >> Thanks in advance >>> >> Randolph >>> >> >>> >> >>> >> >>> >> ___ >>> >> users mailing list >>> >> us...@open-mpi.org >>> >> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> > >>> >>> -- >>> Sent from my mobile device >>> >>> David Zhang >>> University of California, San Diego >>> >>> ___ >>> users mailing list >>> us...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >>> ___ >>> users mailing list >>> us...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/users >> >> >> -Inline Attachment Follows- >> >> ___ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users >> >> ___ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > > -Inline Attachment Follows- > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] MPI_Bcast issue

2010-08-11 Thread Jeff Squyres
(I'll say that OMPI's ALLGATHER algorithm is probably not well optimized for massive data transfers like you describe) -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] MPI_Bcast issue

2010-08-11 Thread Jeff Squyres
On Aug 11, 2010, at 9:54 AM, Jeff Squyres wrote: > (I'll say that OMPI's ALLGATHER algorithm is probably not well optimized for > massive data transfers like you describe) Wrong wrong wrong -- I should have checked the code before sending. I made the incorrect assumption that OMPI stil

Re: [OMPI users] MPI_Bcast issue

2010-08-11 Thread Jeff Squyres
gorithms allow themselves to be used (e.g., O(N) may not matter if there's a small number of peers with small messages). -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] MPI_Bcast issue

2010-08-12 Thread Jeff Squyres
P963 -- 2455 South Road -- Poughkeepsie, NY 12601 > > Tele (845) 433-7846 Fax (845) 433-8363 > > > > -Inline Attachment Follows- > > > ___ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > > > ___ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] problem with .bashrc stetting of openmpi

2010-08-13 Thread Jeff Squyres
loading shared libraries: libmpi.so.0: cannot open > shared object file: No such file or directory > > > > Thanks for any help. > Best regards, > Sunita > > ___ > users mailing

Re: [OMPI users] Abort

2010-08-13 Thread Jeff Squyres
t have their coredumpsize automatically reset to 0, etc. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Abort

2010-08-16 Thread Jeff Squyres
il thread, it's not entirely clear: are you calling abort() or MPI_Abort()? MPI_Abort() won't drop a core file. abort() should. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] [openib] segfault when using openib btl

2010-08-16 Thread Jeff Squyres
>>>>>>>> parallel computation involving the openib btl and > > > > >>>>>>>> OpenMPI-1.4.2 (the same issue can be observed with > > > > >>>>>>>> OpenMPI-1.3.3). > > > > >>>>>>>> > > > > >>>>>>>>mpirun (Open MPI) 1.4.2 > > > > >>>>>>>>Report bugs to http://www.open-mpi.org/community/help/ > > > > >>>>>>>>[pbn08:02624] *** Process received signal *** > > > > >>>>>>>>[pbn08:02624] Signal: Segmentation fault (11) > > > > >>>>>>>>[pbn08:02624] Signal code: Address not mapped (1) > > > > >>>>>>>>[pbn08:02624] Failing at address: (nil) > > > > >>>>>>>>[pbn08:02624] [ 0] /lib64/libpthread.so.0 [0x349540e4c0] > > > > >>>>>>>>[pbn08:02624] *** End of error message *** > > > > >>>>>>>>sh: line 1: 2624 Segmentation fault > > > > >>>>>>>> > > > > >>>>>>>> \/share\/hpc3\/actran_suite\/Actran_11\.0\.rc2\.41872\/RedHatE > > > > >>>>>>>> L\ -5 \/ x 86 _6 4\ /bin\/actranpy_mp > > > > >>>>>>>> '--apl=/share/hpc3/actran_suite/Actran_11.0.rc2.41872/RedHatEL > > > > >>>>>>>> -5 /x 86 _ 64 /A c tran_11.0.rc2.41872' > > > > >>>>>>>> '--inputfile=/work/st25652/LSF_130073_0_47696_0/Case1_3Dreal_m > > > > >>>>>>>> 4_ n2 .d a t' > > > > >>>>>>>> '--scratch=/scratch/st25652/LSF_130073_0_47696_0/scratch' > > > > >>>>>>>> '--mem=3200' '--threads=1' '--errorlevel=FATAL' '--t_max=0.1' > > > > >>>>>>>> '--parallel=domain' > > > > >>>>>>>> > > > > >>>>>>>> If I choose not to use the openib btl (by using --mca btl > > > > >>>>>>>> self,sm,tcp on the command line, for instance), I don't > > > > >>>>>>>> encounter any problem and the parallel computation runs > > > > >>>>>>>> flawlessly. > > > > >>>>>>>> > > > > >>>>>>>> I would like to get some help to be able: > > > > >>>>>>>> - to diagnose the issue I'm facing with the openib btl > > > > >>>>>>>> - understand why this issue is observed only when using the > > > > >>>>>>>> openib btl and not when using self,sm,tcp > > > > >>>>>>>> > > > > >>>>>>>> Any help would be very much appreciated. > > > > >>>>>>>> > > > > >>>>>>>> The outputs of ompi_info and the configure scripts of OpenMPI > > > > >>>>>>>> are enclosed to this email, and some information on the > > > > >>>>>>>> infiniband drivers as well. > > > > >>>>>>>> > > > > >>>>>>>> Here is the command line used when launching a parallel > > > > >>>>>>>> computation > > > > >>>>>>>> > > > > >>>>>>>> using infiniband: > > > > >>>>>>>>path_to_openmpi/bin/mpirun -np $NPROCESS --hostfile > > > > >>>>>>>>host.list --mca > > > > >>>>>>>> > > > > >>>>>>>> btl openib,sm,self,tcp --display-map --verbose --version > > > > >>>>>>>> --mca mpi_warn_on_fork 0 --mca btl_openib_want_fork_support 0 > > > > >>>>>>>> [...] > > > > >>>>>>>> > > > > >>>>>>>> and the command line used if not using infiniband: > > > > >>>>>>>>path_to_openmpi/bin/mpirun -np $NPROCESS --hostfile > > > > >>>>>>>>host.list --mca > > > > >>>>>>>> > > > > >>>>>>>> btl self,sm,tcp --display-map --verbose --version --mca > > > > >>>>>>>> mpi_warn_on_fork 0 --mca btl_openib_want_fork_support 0 [...] > > > > >>>>>>>> > > > > >>>>>>>> Thanks, > > > > >>>>>>>> Eloi > > > > >>>>>> > > > > >>>>>> ___ > > > > >>>>>> users mailing list > > > > >>>>>> us...@open-mpi.org > > > > >>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/users > > -- > > > Eloi Gaudry > > Free Field Technologies > Company Website: http://www.fft.be > Company Phone: +32 10 487 959 > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Abort

2010-08-16 Thread Jeff Squyres
he entire MPI job. abort() drops core when I'm running > on 1 processor, but not in a multiprocessor run. In addition, a node > calling abort() doesn't lead to the entire run being killed off. > > David > O > n Mon, 2010-08-16 at 08:51 -0700, Jeff Squyres wrote: >> On Aug 13,

Re: [OMPI users] [openib] segfault when using openib btl

2010-08-16 Thread Jeff Squyres
gt;>>>> switch to the basic linear algorithm. Anyway whatever the >>>>>>>>>>>> algorithm used, the segmentation fault remains. >>>>>>>>>>>> >>>>>>>>>>>> Does anyone could give some advice on ways to diagnose the issue >>>>>>>>>>>> I'm facing ? >>>>>>>>>>>> >>>>>>>>>>>> Regards, >>>>>>>>>>>> Eloi >>>>>>>>>>>> >>>>>>>>>>>> On Monday 12 July 2010 10:53:58 Eloi Gaudry wrote: >>>>>>>>>>>>> Hi, >>>>>>>>>>>>> >>>>>>>>>>>>> I'm focusing on the MPI_Bcast routine that seems to randomly >>>>>>>>>>>>> segfault when using the openib btl. I'd like to know if there >>>>>>>>>>>>> is any way to make OpenMPI switch to a different algorithm >>>>>>>>>>>>> than the default one being selected for MPI_Bcast. >>>>>>>>>>>>> >>>>>>>>>>>>> Thanks for your help, >>>>>>>>>>>>> Eloi >>>>>>>>>>>>> >>>>>>>>>>>>> On Friday 02 July 2010 11:06:52 Eloi Gaudry wrote: >>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>> >>>>>>>>>>>>>> I'm observing a random segmentation fault during an internode >>>>>>>>>>>>>> parallel computation involving the openib btl and >>>>>>>>>>>>>> OpenMPI-1.4.2 (the same issue can be observed with >>>>>>>>>>>>>> OpenMPI-1.3.3). >>>>>>>>>>>>>> >>>>>>>>>>>>>>mpirun (Open MPI) 1.4.2 >>>>>>>>>>>>>>Report bugs to http://www.open-mpi.org/community/help/ >>>>>>>>>>>>>>[pbn08:02624] *** Process received signal *** >>>>>>>>>>>>>>[pbn08:02624] Signal: Segmentation fault (11) >>>>>>>>>>>>>>[pbn08:02624] Signal code: Address not mapped (1) >>>>>>>>>>>>>>[pbn08:02624] Failing at address: (nil) >>>>>>>>>>>>>>[pbn08:02624] [ 0] /lib64/libpthread.so.0 [0x349540e4c0] >>>>>>>>>>>>>>[pbn08:02624] *** End of error message *** >>>>>>>>>>>>>>sh: line 1: 2624 Segmentation fault >>>>>>>>>>>>>> >>>>>>>>>>>>>> \/share\/hpc3\/actran_suite\/Actran_11\.0\.rc2\.41872\/RedHatE >>>>>>>>>>>>>> L\ -5 \/ x 86 _6 4\ /bin\/actranpy_mp >>>>>>>>>>>>>> '--apl=/share/hpc3/actran_suite/Actran_11.0.rc2.41872/RedHatEL >>>>>>>>>>>>>> -5 /x 86 _ 64 /A c tran_11.0.rc2.41872' >>>>>>>>>>>>>> '--inputfile=/work/st25652/LSF_130073_0_47696_0/Case1_3Dreal_m >>>>>>>>>>>>>> 4_ n2 .d a t' >>>>>>>>>>>>>> '--scratch=/scratch/st25652/LSF_130073_0_47696_0/scratch' >>>>>>>>>>>>>> '--mem=3200' '--threads=1' '--errorlevel=FATAL' '--t_max=0.1' >>>>>>>>>>>>>> '--parallel=domain' >>>>>>>>>>>>>> >>>>>>>>>>>>>> If I choose not to use the openib btl (by using --mca btl >>>>>>>>>>>>>> self,sm,tcp on the command line, for instance), I don't >>>>>>>>>>>>>> encounter any problem and the parallel computation runs >>>>>>>>>>>>>> flawlessly. >>>>>>>>>>>>>> >>>>>>>>>>>>>> I would like to get some help to be able: >>>>>>>>>>>>>> - to diagnose the issue I'm facing with the openib btl >>>>>>>>>>>>>> - understand why this issue is observed only when using the >>>>>>>>>>>>>> openib btl and not when using self,sm,tcp >>>>>>>>>>>>>> >>>>>>>>>>>>>> Any help would be very much appreciated. >>>>>>>>>>>>>> >>>>>>>>>>>>>> The outputs of ompi_info and the configure scripts of OpenMPI >>>>>>>>>>>>>> are enclosed to this email, and some information on the >>>>>>>>>>>>>> infiniband drivers as well. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Here is the command line used when launching a parallel >>>>>>>>>>>>>> computation >>>>>>>>>>>>>> >>>>>>>>>>>>>> using infiniband: >>>>>>>>>>>>>>path_to_openmpi/bin/mpirun -np $NPROCESS --hostfile >>>>>>>>>>>>>>host.list --mca >>>>>>>>>>>>>> >>>>>>>>>>>>>> btl openib,sm,self,tcp --display-map --verbose --version >>>>>>>>>>>>>> --mca mpi_warn_on_fork 0 --mca btl_openib_want_fork_support 0 >>>>>>>>>>>>>> [...] >>>>>>>>>>>>>> >>>>>>>>>>>>>> and the command line used if not using infiniband: >>>>>>>>>>>>>>path_to_openmpi/bin/mpirun -np $NPROCESS --hostfile >>>>>>>>>>>>>>host.list --mca >>>>>>>>>>>>>> >>>>>>>>>>>>>> btl self,sm,tcp --display-map --verbose --version --mca >>>>>>>>>>>>>> mpi_warn_on_fork 0 --mca btl_openib_want_fork_support 0 [...] >>>>>>>>>>>>>> >>>>>>>>>>>>>> Thanks, >>>>>>>>>>>>>> Eloi >>>>>>>>>>>> ___ >>>>>>>>>>>> users mailing list >>>>>>>>>>>> us...@open-mpi.org >>>>>>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> -- >>> >>> >>> Eloi Gaudry >>> >>> Free Field Technologies >>> Company Website: http://www.fft.be >>> Company Phone: +32 10 487 959 >>> >>> ___ >>> users mailing list >>> us...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/user > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Does OpenMPI 1.4.1 support the MPI_IN_PLACE designation ...

2010-08-16 Thread Jeff Squyres
ies.c:179: > restoreTL: Assertion `n >= 0 && n < rl->max' failed. " > > Any help is appreciated, > > Thanks, > > -- > Gokhan > > Think green before you print this email. > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] mem lock limit & ulimit

2010-08-18 Thread Jeff Squyres
suggestions, it would be appreciated. >> Many thanks. >> ___ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] [openib] segfault when using openib btl

2010-08-18 Thread Jeff Squyres
not believe that the IB/iWARP vendors have cleaned up the openib BTL much in this regard recently. But then again, we branched for v1.3 a long time ago, so I don't remember offhand if any valgrind cleanups occurred since then... -- Jeff Squyres jsquy...@cisco.com For corporate legal information

Re: [OMPI users] Open-MPI 1.4.2 : mpirun core-dumps when "-npernode N" is used at command line

2010-08-23 Thread Jeff Squyres
ashes. > > Is this a known issue? I found related problem (of around May, 2010) when > people were using the same option but in a SLURM environment. > > regards > > Michael > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] is there a way to bring to light _all_ configure options in a ready installation?

2010-08-24 Thread Jeff Squyres
; -- > Dipl.-Inform. Paul Kapinos - High Performance Computing, > RWTH Aachen University, Center for Computing and Communication > Seffenter Weg 23, D 52074 Aachen (Germany) > Tel: +49 241/80-24915 > _______ > users mailing list > us...

Re: [OMPI users] Open-MPI 1.4.2 : mpirun core-dumps when "-npernode N" is used at command line

2010-08-24 Thread Jeff Squyres
gt; |> >> |> but it failed. I am attaching the configure and make logs. >> |> >> |> regards >> |> >> |> Michael >> |> >> |> >> |> On 08/23/10 20:53, Ralph Castain wrote: >> |>> >> |>&g

Re: [OMPI users] IMB-MPI broadcast test stalls for large core counts: debug ideas?

2010-08-24 Thread Jeff Squyres
ariable. i.e. 10GigE might be eth0 on one and > eth2 on the next. In case this matters. I was told this shouldn't be > an issue. Are all the eth0's on one subnet and all the eth2's on a different subnet? Or are all eth0's and eth2's all on the same subnet? -- Jeff Squyres jsquy...@cisco

Re: [OMPI users] IMB-MPI broadcast test stalls for large core counts: debug ideas?

2010-08-25 Thread Jeff Squyres
all 1GigE's are on 10.0.x.x It would simplify testing if you could get all the eth0's to be of one type and on the same subnet, and the same for eth1. Once you do that, try using just one of the networks by telling OMPI to use only one of the devices, something like this: mpirun --mca btl_tcp_if

Re: [OMPI users] Problem including C MPI code from C++ using C linkage

2010-09-02 Thread Jeff Squyres
se then your is safe for both C and C++, and you keep contained inside it (assumedly preserving some abstraction barriers in your code by keeping the MPI prototypes bundled with ). -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] spin-wait backoff

2010-09-03 Thread Jeff Squyres
a long time but never implemented. Are your jobs hanging because of deadlock (i.e., application error), or infrastructure error? If they're hanging because of deadlock, there are some PMPI-based tools that might be able to help. -- Jeff Squyres jsquy...@cisco.com For corporate legal i

Re: [OMPI users] compiler upgrades require openmpi rebuild?

2010-09-03 Thread Jeff Squyres
easiest / least total time spent to just build another copy of OMPI against 10.8. :-\ -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Memory allocation error when linking with MPI libraries

2010-09-03 Thread Jeff Squyres
So setenv'ing it in your current shell and then submitting to the scheduler might be sufficient (e.g,. Torque and SLURM do this). -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] users Digest, Vol 1674, Issue 1

2010-09-09 Thread Jeff Squyres
>> > > > > > > ___ >>> > users mailing list >>> > us...@open-mpi.org >>> > http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >> -- next part -- >> HTML attachment scrubbed and removed >> > > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Strange Segmentation Fault inside MPI_Init

2010-09-11 Thread Jeff Squyres
, argv=0x7fffe008) > at test.c:4 > > I can't figure out what's going on here! It says MPI_Init is segfaulting, but > I think it is probably some kind of misconfiguration. > I have tried reinstalling the openmpi package. I have an AMD Turion X2 > M500(64 bit) process

Re: [OMPI users] send and receive buffer the same on root

2010-09-16 Thread Jeff Squyres
us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] BTL layer

2010-09-22 Thread Jeff Squyres
t-to-point functions (MPI_SEND and the like). They may *also* act as the lowest layers for all of the collective functions. Some collective algorithms call MPI_SEND to do their individual sends, for example. Others may invoke BTL send/receive functions directly. Others don't use the BTLs at all. -

Re: [OMPI users] multipath support for infiniband

2010-09-22 Thread Jeff Squyres
as to enable automatic path migration over > lmc, but this is only for failover, if I remember rightly. > > Regards, > Jens > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users --

Re: [OMPI users] PathScale problems persist

2010-09-22 Thread Jeff Squyres
PI 1.4.2 is built with GCC (GCC 4.1.2). > No problems are found with Open MPI 1.2.6 and PathScale either. > > Best regards, > > Rafa > > -- > Rafael Arco Arredondo > Centro de Servicios de Informática y Redes de Comunicaciones > Campus de Fuentenueva - Edificio Mecenas > Universidad de Granada > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] OpenMPI on the ARM processor architecture?

2010-09-22 Thread Jeff Squyres
uld be welcome! :-) I believe that the first step would be to get some assembly for the ARM platform for some of OMPI's key routines (locks, atomics, etc.). Beyond that, it *might* "just work"...? -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] multipath support for infiniband

2010-09-22 Thread Jeff Squyres
matter), you may not see much difference. That being said, if you were watching OMPI's behavior with regard to LIDs and only saw the base LIDs go by, then perhaps something is wrong. Shrug. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.c

Re: [OMPI users] MPI_Reduce performance

2010-09-22 Thread Jeff Squyres
ne > to know the specifics. FWIW: here's a brief writeup of MPI_Ibarrier - http://blogs.cisco.com/ciscotalk/performance/comments/ibarrier/ -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Continued functionality across a SLES10 to SLES11 upgrade ...

2010-09-22 Thread Jeff Squyres
does, relax, have a beer^H^H^H^Hnon-cafinated tea - if it does not work, you may have to go the recompile-everything route -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Continued functionality across a SLES10 to SLES11 upgrade ...

2010-09-22 Thread Jeff Squyres
erious ways (e.g., if a struct changed size somewhere that OMPI uses to interact with the system, then it may work in some cases and may fail in other seemingly unrelated cases. Such things are a total bugger to track down). -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to

Re: [OMPI users] function fgets hangs a mpi program when it is used ompi-ps command

2010-09-22 Thread Jeff Squyres
puts (mystring); > } > pclose (pFile); >} > } > > MPI_Finalize(); >return 0; > } > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] OpenMPI on the ARM processor architecture?

2010-09-22 Thread Jeff Squyres
Yes, the built-in GCC atomics might work. I don't know if anyone has tried them; they would be most useful because they would allow us to use multiple different platforms. Patches would definitely be appreciated here. On Sep 22, 2010, at 12:25 PM, Dave Love wrote: > Jeff Squyres <

Re: [OMPI users] BTL layer

2010-09-22 Thread Jeff Squyres
that will do all the same things that MPI_SEND does: eager sends for short messages, striping across multiple devices, using RDMA when appropriate, etc. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] PathScale problems persist

2010-09-23 Thread Jeff Squyres
> > On Wed, 2010-09-22 at 14:54 +0200, Ake Sandgren wrote: >> On Wed, 2010-09-22 at 14:16 +0200, Ake Sandgren wrote: >>> On Wed, 2010-09-22 at 07:42 -0400, Jeff Squyres wrote: >>>> This is a problem with the Pathscale compiler and old versions of >> GCC. See: >&

Re: [OMPI users] Question about Asynchronous collectives

2010-09-23 Thread Jeff Squyres
on-blocking collectives on a single communicator -- it doesn't matter if they are the same or different collective operations. They will each be unique instances and will be satisfied in order. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Question about Asynchronous collectives

2010-09-23 Thread Jeff Squyres
1: MPI_IBcast(MPI_COMM_WORLD, request_2) // first Bcast MPI_IBcast(MPI_COMM_WORLD, request_1) // second Bcast MPI_Wait(_1, ...); MPI_Wait(_2, ...); That may/will deadlock. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Question about Asynchronous collectives

2010-09-23 Thread Jeff Squyres
Wait call will > simply find that the first MPI_Ibcast is already done. The second MPI_Wait > call becomes, effectively, a query function. Er -- yes, you're right. I was incorrectly thinking about overtaking issues -- but they don't apply here. -- Jeff Squyres jsquy...@cisco.com For corporate le

Re: [OMPI users] How to know which process is running on which core?

2010-09-24 Thread Jeff Squyres
> ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > _______ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/list

Re: [OMPI users] Display in terminal of error message using throw std::runtime_error on distant node...

2010-09-24 Thread Jeff Squyres
t; > ( I don't think these information are relevant here, but just in case: I am > using openMPI 1.4.2, on a Mandriva 2008 system ) > > Thanks in advance for any help/info/advice. > > Olivier > ___ > users mailing li

Re: [OMPI users] How to know which process is running on which core?

2010-09-24 Thread Jeff Squyres
uses hwloc internally, but you could also link your application against hwloc and call its C functions to get information about your process' locality, etc. On Sep 24, 2010, at 8:14 AM, Jeff Squyres wrote: > On the OMPI SVN trunk, we have an "Open MPI extension" call named > O

Re: [OMPI users] Porting Open MPI to ARM: How essential is the opal_sys_timer_get_cycles() function?

2010-09-27 Thread Jeff Squyres
On Sep 23, 2010, at 1:24 PM, Ken Mighell wrote: > Would a hack written in C suffice? Assembly is always better, but C should be fine. If you really want to, could you write it in C and have the compiler generate optimized assembly for you. -- Jeff Squyres jsquy...@cisco.com For corpor

Re: [OMPI users] Potential developer to reinstate Xgrid support

2010-10-01 Thread Jeff Squyres
0D > 2401 E. Pilot Plant Rd. M/S 1109 > China Lake, CA 93555 > daniel.bea...@navy.mil > (760)939-7097 > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squy

Re: [OMPI users] location of ompi libraries

2010-10-05 Thread Jeff Squyres
__ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] hdf5 build error using openmpi and Intel Fortran

2010-10-06 Thread Jeff Squyres
trip these from the command. This might well be directly related to the version of Libtool that HDF bootstraps their build process with -- you really should talk to them. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] hdf5 build error using openmpi and Intel Fortran

2010-10-08 Thread Jeff Squyres
nd. Strange. Be sure to see: http://www.open-mpi.org/faq/?category=mpi-apps#cant-use-wrappers -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Pros and cons of --enable-heterogeneous

2010-10-08 Thread Jeff Squyres
's not too much that OMPI can do there. The heterogeneous flag is more traditionally used for endian flipping kinds of things. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Pros and cons of --enable-heterogeneous

2010-10-08 Thread Jeff Squyres
e I think we have done basic testing to ensure this stuff works, but it is used so infrequently (in reality, most people don't mix and match too often) that we don't stress test it, nor do we test it in exotic environments. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to

Re: [OMPI users] Pros and cons of --enable-heterogeneous

2010-10-08 Thread Jeff Squyres
On Oct 8, 2010, at 2:21 PM, Jeff Squyres wrote: > No. Open MPI does not have MPD -- I think you're thinking of a different MPI > implementation. :-) Someone pointed out to me off-list that I wasn't quite clear here... *If* all of your architectures are the same endian and oth

Re: [OMPI users] nonblocking send/receive question

2010-10-12 Thread Jeff Squyres
s doing this >> from the glut display call back causing the problem? Any suggestions would >> be greatly appreciated. >> Thanks, >> Ed >> >> ___ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users > >

Re: [OMPI users] nonblocking send/receive question

2010-10-12 Thread Jeff Squyres
roblem was but it wasn't the called to MPI_Send > blocking. > Ed > > > From: users-boun...@open-mpi.org on behalf of Jeff Squyres > Sent: Tue 10/12/2010 6:52 AM > To: Open MPI Users > Subject: Re: [OMPI users] nonblocking send/receive question > > On Oct 11, 2010, at 1

Re: [OMPI users] Send data structures

2010-10-12 Thread Jeff Squyres
er, PACKBUFFER_SIZE, , MPI_COMM_WORLD); > MPI_PACK(data->subData3->data1, sizeof(data->subDat2->data1), MPI_FLOAT, > packBuffer, PACKBUFFER_SIZE, , MPI_COMM_WORLD); > MPI_Send(packBuffer, packSize, MPI_PACKED, 1, 100, , MPI_COMM_WORLD); > > Ed > > >

Re: [OMPI users] Bug in MPI_Allreduce man page

2010-10-12 Thread Jeff Squyres
> root process and a "root" parameter name (which doesn't exist for > MPI_Allreduce). Should I add this to Trac? > > -- Jeremiah Willcock > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/m

Re: [OMPI users] Problem in installing; lib files missing

2010-10-14 Thread Jeff Squyres
cluster. See http://www.open-mpi.org/faq/?category=building#where-to-install. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] How to time data transfers?

2010-10-14 Thread Jeff Squyres
tion* that the process will never move and might be able to use the native x86 timer (there's some complications here, but we might be able to figure it out). Bad Things could happen if the process ever moved, though (e.g., if the application ever manually changed the binding). -- Jeff Squyres j

Re: [OMPI users] Problem with MPI_Comm_accept in a dynamic client/server application

2010-10-14 Thread Jeff Squyres
have > found had to do with different version of OpenMPI installed, but I only have > one version, and I believe it is the one being used. > > Thank you, > Kalin > _______ > users mailing list > us...@open-mpi.org >

Re: [OMPI users] Build failure with OMPI-1.5 (clang-2.8, gcc-4.5.1 with debug options)

2010-10-14 Thread Jeff Squyres
going through the pain right now of upgrading to Autoconf 2.68 / Libtool 2.4; hopefully we'll have fixes on the trunk RSN for the newest-of-the-new autotools. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Build failure with OMPI-1.5 (clang-2.8, gcc-4.5.1 with debug options)

2010-10-14 Thread Jeff Squyres
explicitly listed in the link line, which should contain MPI_Abort. Can you nm on that file and ensure that it is actually listed there? VT guys -- any ideas here? -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Build failure with OMPI-1.5 (clang-2.8, gcc-4.5.1 with debug options)

2010-10-14 Thread Jeff Squyres
On Oct 14, 2010, at 5:38 PM, Jed Brown wrote: > $ nm -D > /home/jed/src/openmpi-1.5/bclang/ompi/contrib/vt/vt/../../../.libs/libmpi.so > |grep MPI_Abort > 00074380 T PMPI_Abort > > In contrast, with gcc: > > $ nm -D >

Re: [OMPI users] self-messaging questions

2010-10-15 Thread Jeff Squyres
d-to-self case, but it should perform well. You might want to benchmark memcpy vs. MPI_Sendrecv() to self (for example) and see if there's a noticeable difference. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] MPI_STARTALL doesn't compile with mpif90 and a multi-dimensional request array

2010-10-15 Thread Jeff Squyres
iting MPI programs > in Fortran90. >From my understanding of Fortran, that would violate the MPI spec. You could, I think, use an array subsection when you call MPI_STARALL that would give you a 1D array of integers, right? -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] MPI_STARTALL doesn't compile with mpif90 and a multi-dimensional request array

2010-10-15 Thread Jeff Squyres
mo' betta than the existing MPI F90 bindings. We plan to prototype the F08 module in OMPI when we finish specifying it in the Forum, and revamp the F90 support in OMPI at more-or-less the same time. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Typo in man page for MPI_File_iwrite_at

2010-10-18 Thread Jeff Squyres
the rendered text. > > -- Jeremiah Willcock > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisc

Re: [OMPI users] openmpi 1.5 build from rpm fails: --program-prefix now checked in configure

2010-10-19 Thread Jeff Squyres
rogram-suffix or --program-transform-name. Users > are recommended to instead use --prefix with a unique directory and make > symbolic links as desired for renaming." >&2;} > > If I remove the new control on --program-prefix in openmpi-1.5 configure > script, the 1.5 build

Re: [OMPI users] Open MPI dynamic data structure error

2010-10-20 Thread Jeff Squyres
______ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Question about MPI_Barrier

2010-10-21 Thread Jeff Squyres
pty machines? -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] segmentation fault in mpiexec (Linux, Oracle/Sun C)

2010-10-21 Thread Jeff Squyres
. > > --td > > > On 10/21/2010 06:47 AM, Terry Dontje wrote: >> On 10/21/2010 06:43 AM, Jeff Squyres (jsquyres) wrote: >>> Also, i'm not entirely sure what all the commands are that you are showing. >>> Some of those warnings (eg in config.log) are normal. >

Re: [OMPI users] Question about MPI_Barrier

2010-10-21 Thread Jeff Squyres
code A seems to run faster. I do have a > lot of communications in code B too. It involves 500 procs. I had thought > code B should have no effect on the time spent on code A if I use MPI_Barrier. > > Linbao > On Thu, Oct 21, 2010 at 5:17 AM, Jeff Squyres <jsquy...@cisco.com>

Re: [OMPI users] Some problems

2010-10-22 Thread Jeff Squyres
erence to 'MPI::Datatype::Free()' > > So could I get some help? Thanks! > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For co

Re: [OMPI users] OPEN MPI data transfer error

2010-10-22 Thread Jeff Squyres
s the possible reason ? > > > Any help is appreciated. > > thanks > > jack > > Oct. 22 1010 > _______ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] OPEN MPI data transfer error

2010-10-22 Thread Jeff Squyres
a little suspect of what you sent already (e.g., you didn't include the waitall, which is kinda important :-) ). -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Open MPI program cannot complete

2010-10-25 Thread Jeff Squyres
t the ZOMBIE_PID ? > > "ps" from the command line, or getpid() from C code. > > Jed > > ___ users mailing list > us...@open-mpi.org http://www.open-mpi.org/mailman/listinfo.cgi/users > __

Re: [OMPI users] Open MPI program cannot complete

2010-10-25 Thread Jeff Squyres
> > > better in a non-interactive environment. > > > > > > Where should I put the (gdb --batch -ex 'bt full' -ex 'info reg' -pid > > > ZOMBIE_PID) in the script ? > > > > > > Is control returning to your script after rank 0 has exited? In

Re: [OMPI users] Cross compiling for 32 bit from a 64 bit machine

2010-10-25 Thread Jeff Squyres
elp would be greatly > regarded, since I'm supposed to meet a deadline, and half of my semester has > been spent trying to get either LAM-MPI or Open MPI (which I was suggested I > shift to) to work. > Thank you very much. > > Regards, > Saahil___

Re: [OMPI users] Fwd: Cross compiling for 32 bit from a 64 bit machine

2010-10-25 Thread Jeff Squyres
;>>> /usr/bin/ld: final link failed: Invalid operation >>> >>>>> collect2: ld returned 1 exit status >>> >>>>> make[2]: *** [libopen-pal.la] Error 1 >>> >>>>> make[2]: Leaving directory `/home/wolf/Downloads/openmpi-1.5/opal' >>> >>>>> make[1]: *** [all-recursive] Error 1 >>> >>>>> make[1]: Leaving directory `/home/wolf/Downloads/openmpi-1.5/opal' >>> >>>>> make: *** [all-recursive] Error 1 >>> >>>>> >>> >>>>> >>> >>>>> Any idea as to how to go around fixing this? Any help would be greatly >>>>> regarded, since I'm supposed to meet a deadline, and half of my semester >>>>> has been spent trying to get either LAM-MPI or Open MPI (which I was >>>>> suggested I shift to) to work. >>> >>>>> Thank you very much. >>> >>>>> >>> >>>>> Regards, >>> >>>>> Saahil___ >>> >>>> users mailing list >>> >>>> us...@open-mpi.org >>> >>>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> >>> >>> >>> >>> >>> ___ >>> >>> users mailing list >>> >>> us...@open-mpi.org >>> >>> http://www.open-mpi.org/mailman/listinfo.cgi/users >>> ___ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

[OMPI users] MPI-2.2: do you care?

2010-10-26 Thread Jeff Squyres
tures!" (to be fair, they're somewhat obscure features). Other than not having the obvious "OMPI is MPI-2.2 compliant" checkmark for marketing reasons, is there anyone who *needs* the functionality represented by those still-open tickets? Thanks for your input. -- Jeff Squyres jsquy...@cis

Re: [OMPI users] mixed versions of openmpi ? (1.4.1 and 1.4.3)

2010-10-29 Thread Jeff Squyres
> =NwEP > -END PGP SIGNATURE- > > _______ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] mixed versions of openmpi ? (1.4.1 and 1.4.3)

2010-10-29 Thread Jeff Squyres
and libmpi.so from v1.4.3 on other nodes). This is absolutely not guaranteed to work -- we don't even try to maintain this kind of compatibility. Does that make sense? -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] link problem on 64bit platform

2010-11-01 Thread Jeff Squyres
not CXXFLAGS to the configure script? -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] link problem on 64bit platform

2010-11-02 Thread Jeff Squyres
On Nov 2, 2010, at 4:57 AM, jody wrote: > So i guess the basic question is: > is it permitted to rename openMPI installations, and if yes how is > this porperly done (since a simple mv doesn't work) Yes: http://www.open-mpi.org/faq/?category=building#installdirs -- Jeff Squy

Re: [OMPI users] Configure error - ompi/mca/io/romio/romio/localdefs: No such file or directory

2010-11-02 Thread Jeff Squyres
** > ** This warning was added by the IU Computer Science Dept. mail scanner. ** > * > > > > ___ > users mailing list > us...@

Re: [OMPI users] Question about collective messages implementation

2010-11-02 Thread Jeff Squyres
ssions about this topic, but have never really addressed it -- the need for low latency has been greater than the need for blocking/not-consuming-CPU progress. -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Building OpenMPI 1.5.x

2010-11-02 Thread Jeff Squyres
__ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Building OpenMPI 1.5.x

2010-11-02 Thread Jeff Squyres
to proceed? (the specific version of openmpi in >> this case is the one available for download on Nov 2 from the main >> website; obtained via the src.rpm download) >> --Jim >> ___ >> users mailing list >> us..

Re: [OMPI users] open MPI please recommend a debugger for open MPI

2010-11-03 Thread Jeff Squyres
-CW-MPI-Mechanic.pdf :-) -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] open MPI please recommend a debugger for open MPI

2010-11-03 Thread Jeff Squyres
hem. But seriously, I try to use printf's for only truly trivial things; I use various kinds of debugging tools for everything else (e.g., I'll frequently use gdb to examine corefiles and/or attach to individual processes in parallel jobs to do heavy-lifting debugging). -- Jeff Squyres jsquy...@cisco.

Re: [OMPI users] Creating 64-bit PPC objects with xlc?

2010-11-09 Thread Jeff Squyres
f OpenMPI, and it seems to correctly support my > existing software. > > So, now for my question: > > Should FCFLAGS show up on these links lines and, if so, how would I get > 64-bit objects? > > Thanks, > > Brian Price > > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Creating 64-bit objects?

2010-11-10 Thread Jeff Squyres
nks, > > Brian Price > > > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Open MPI 1.5 is not detecting oversubscription

2010-11-10 Thread Jeff Squyres
parameter is not being set in cases of oversubscription, > with or without an explicit hostfile. > > Jed > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Jeff Squyres
d9c60 in PMPI_Send () from /home/gmaj/openmpi/lib/libmpi.so.0 > #8 0x0804e900 in main () > -- > > If anybody knows what may cause this problem or what may I do to find > the reason, any help is appreciated. > > My open-mpi is version 1.4.1. > > > Regards, > Grzegorz Maj > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > -- > David Zhang > University of California, San Diego > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Jeff Squyres
on both hosts (i.e., eth1 on hostA is reachable by eth1 on hostB; ditto for eth2). See the FAQ for more details on other ways of setting MCA parameters (environment vars, files, etc.). -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Jeff Squyres
s much better debug/error messages, but I've never had the time to finish it (indeed, I think there's a bug in that development branch right now, otherwise I'd recommend giving it a whirl). :-\ -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

[OMPI users] Open MPI at SC'10

2010-11-12 Thread Jeff Squyres
s on MPI-3. http://sc10.supercomputing.org/schedule/event_detail.php?evid=bof122 Many other Open MPI organizations who will also have a presence at SC; be sure to stop by their booths and say hello. See you there! -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www

Re: [OMPI users] Infiniband error

2010-11-12 Thread Jeff Squyres
find what the error codes mean? > > Thanks, > Ondrej Marsalek > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Deprecated parameter: plm_rsh_agent

2010-11-12 Thread Jeff Squyres
___ >>> users mailing list >>> us...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/users >> ___ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Calling MPI_Test() too many times results in a time spike

2010-11-12 Thread Jeff Squyres
Total time: " << total_time << " secs\n" > << "MPI_Test calls: " << time_res.size() << '\n' > << std::fixed << std::setprecision(10) > << "Average: " << average << &quo

  1   2   3   4   5   6   7   8   9   10   >