Re: [OMPI users] libnuma.so error

2023-07-20 Thread Gus Correa via users
to pull their system image, separate from yum/dnf/apt/] Gus On Thu, Jul 20, 2023 at 4:00 AM Luis Cebamanos via users < users@lists.open-mpi.org> wrote: > Hi Gus, > > Yeap, I can see softlink is missing on the compute nodes. > > Thanks! > Luis > &g

Re: [OMPI users] libnuma.so error

2023-07-19 Thread Gus Correa via users
with: yum list | grep numa (CentOS 7, RHEL 7) dnf list | grep numa (CentOS 8, RHEL 8, RockyLinux 8, Fedora, etc) apt list | grep numa (Debian, Ubuntu) If not, you can install (or ask the system administrator to do it). I hope this helps, Gus Correa On Wed, Jul 19, 2023 at 11:55 AM Jeff Squyres

Re: [OMPI users] Using OSU benchmarks for checking Infiniband network

2022-02-07 Thread Gus Correa via users
This may have changed since, but these used to be relevant points. Overall, the Open MPI FAQ have lots of good suggestions: https://www.open-mpi.org/faq/ some specific for performance tuning: https://www.open-mpi.org/faq/?category=tuning https://www.open-mpi.org/faq/?category=openfabrics 1) Make

Re: [OMPI users] stdout scrambled in file

2021-12-05 Thread Gus Correa via users
processes are talking. I hope this helps, Gus Correa On Sun, Dec 5, 2021 at 1:12 PM Jeff Squyres (jsquyres) via users < users@lists.open-mpi.org> wrote: > FWIW: Open MPI 4.1.2 has been released -- you can probably stop using an > RC release. > > I think you're probably run

Re: [OMPI users] Error with building OMPI with PGI

2021-01-14 Thread Gus Correa via users
.com/users@lists.open-mpi.org/msg08962.html https://www.mail-archive.com/users@lists.open-mpi.org/msg10375.html I hope this helps, Gus Correa On Thu, Jan 14, 2021 at 5:45 PM Passant A. Hafez via users < users@lists.open-mpi.org> wrote: > Hello, > > > I'm having an error when trying to

Re: [OMPI users] 4.0.5 on Linux Pop!_OS

2020-11-07 Thread Gus Correa via users
>> Core(s) per socket: 8 > "4. If none of a hostfile, the --host command line parameter, or an RM is > present, Open MPI defaults to the number of processor cores" Have you tried -np 8? On Sun, Nov 8, 2020 at 12:25 AM Paul Cizmas via users < users@lists.open-mpi.org> wrote: >

Re: [OMPI users] mpirun on Kubuntu 20.4.1 hangs

2020-10-21 Thread Gus Correa via users
-hostfile https://www.open-mpi.org/faq/?category=running I hope this helps, Gus Correa On Tue, Oct 20, 2020 at 4:47 PM Jorge SILVA via users < users@lists.open-mpi.org> wrote: > Hello, > > I installed kubuntu20.4.1 with openmpi 4.0.3-0ubuntu in two different > computers in the stand

Re: [OMPI users] Code failing when requesting all "processors"

2020-10-13 Thread Gus Correa via users
Can you use taskid after MPI_Finalize? Isn't it undefined/deallocated at that point? Just a question (... or two) ... Gus Correa > MPI_Finalize(); > > printf("END OF CODE from task %d\n", taskid); On Tue, Oct 13, 2020 at 10:34 AM Jeff Squyres (jsquyres) via users

Re: [OMPI users] MPI is still dominant paradigm?

2020-08-07 Thread Gus Correa via users
"The reports of MPI death are greatly exaggerated." [Mark Twain] And so are the reports of Fortran death (despite the efforts of many CS departments to make their students Fortran- and C-illiterate). IMHO the level of abstraction of MPI is adequate, and actually very well designed. Higher levels

Re: [OMPI users] Moving an installation

2020-07-24 Thread Gus Correa via users
+1 In my experience moving software, especially something of the complexity of (Open) MPI, is much more troublesome (and often just useless frustration) and time consuming than recompiling it. Hardware, OS, kernel, libraries, etc, are unlikely to be compatible. Gus Correa On Fri, Jul 24, 2020

Re: [OMPI users] mca_oob_tcp_recv_handler: invalid message type: 15

2019-12-10 Thread Gus Correa via users
Open MPI 4.0.2 here: /home/guido/libraries/compiled_with_gcc-7.3.0/openmpi-4.0.2/ Have you tried this instead? LD_LIBRARY_PATH=$HOME/libraries/compiled_with_gcc-7.3.0/openmpi-4.0.2/lib:$LD_LIBRARY_PATH I hope this helps, Gus Correa On Tue, Dec 10, 2019 at 4:40 PM Guido granda muñoz via users

Re: [OMPI users] know which CPU has the maximum value

2018-08-10 Thread Gus Correa
are great, knows nothing about the MPI Forum protocols and activities, but hopes the Forum pays attention to users' needs. Gus Correa PS - Jeff S.: Please, bring Diego's request to the Forum! Add my vote too. :) On 08/10/2018 02:19 PM, Jeff Squyres (jsquyres) via users wrote: Jeff H

Re: [OMPI users] know which CPU has the maximum value

2018-08-10 Thread Gus Correa
On 08/10/2018 01:27 PM, Jeff Squyres (jsquyres) via users wrote: It is unlikely that MPI_MINLOC and MPI_MAXLOC will go away any time soon. As far as I know, Nathan hasn't advanced a proposal to kill them in MPI-4, meaning that they'll likely continue to be in MPI for at least another 10

Re: [OMPI users] know which CPU has the maximum value

2018-08-10 Thread Gus Correa
if it strips off useful functionality. My cheap 2 cents from a user. Gus Correa On 08/10/2018 01:52 PM, Jeff Hammond wrote: This thread is a perfect illustration of why MPI Forum participants should not flippantly discuss feature deprecation in discussion with users.  Users who are not familiar

Re: [OMPI users] OpenMPI 3.0.1 - mpirun hangs with 2 hosts

2018-05-14 Thread Gus Correa
(as opposed to append) OpenMPI to your PATH? Say: export PATH='/home/user/openmpi_install/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/local/games:/snap/bin' I hope this helps, Gus Correa On 05/14/2018 12:40 PM, Max Mellette wrote: John, Thanks

Re: [OMPI users] OpenMPI with-tm is not obeying torque

2017-10-06 Thread Gus Correa
que to a job, if any, or when Torque is configured without cpuset support, to somehow still bind the MPI processes to cores/processors/sockets/etc. I hope this helps, Gus Correa On 10/06/2017 02:22 AM, Anthony Thyssen wrote: Sorry r...@open-mpi.org <mailto:r...@open-mpi.org>  as Gilles Gouai

Re: [OMPI users] Fwd: Make All error regarding either "Conflicting" or "Previous Declaration" among others

2017-09-21 Thread Gus Correa
p;1 | tee my_make_install.log ** If using csh/tcsh: ./configure CC=gcc CXX=g++ F77=gfortran FC=gfortran --prefix=/usr/local/openmpi |& tee my_configure.log make |& tee my_make.log make install |& tee my_make_install.log I hope this helps, Gus Correa For what i

Re: [OMPI users] -host vs -hostfile

2017-07-31 Thread Gus Correa
pirun in a short Torque script: #PBS -l nodes=4:ppn=1 ... mpirun hostname The output should show all four nodes. Good luck! Gus Correa On 07/31/2017 02:41 PM, Mahmood Naderan wrote: Well it is confusing!! As you can see, I added four nodes to the host file (the same nodes are used by PBS). The -

Re: [OMPI users] -host vs -hostfile

2017-07-31 Thread Gus Correa
S_NODEFILE. However, that doesn't seem to be the case here, as the mpirun command line in the various emails has a single executable "a.out". I hope this helps. Gus Correa On 07/31/2017 12:43 PM, Elken, Tom wrote: “4 threads” In MPI, we refer to this as 4 ranks or 4 processes. So w

Re: [OMPI users] Q: Basic invoking of InfiniBand with OpenMPI

2017-07-17 Thread Gus Correa
On 07/17/2017 01:06 PM, Gus Correa wrote: Hi Boris The nodes may have standard Gigabit Ethernet interfaces, besides the Infiniband (RoCE). You may want to direct OpenMPI to use the Infiniband interfaces, not Gigabit Ethernet, by adding something like this to "--mca btl self,vader,self&qu

Re: [OMPI users] Q: Basic invoking of InfiniBand with OpenMPI

2017-07-17 Thread Gus Correa
aq/?category=all#tcp-selection BTW, some of your questions (and others that you may hit later) are covered in the OpenMPI FAQ: https://www.open-mpi.org/faq/?category=all I hope this helps, Gus Correa On 07/17/2017 12:43 PM, Boris M. Vulovic wrote: Gus, Gilles, Russell, John: Thanks very much f

Re: [OMPI users] Q: Basic invoking of InfiniBand with OpenMPI

2017-07-13 Thread Gus Correa
Have you tried: -mca btl vader,openib,self or -mca btl sm,openib,self by chance? That adds a btl for intra-node communication (vader or sm). On 07/13/2017 05:43 PM, Boris M. Vulovic wrote: I would like to know how to invoke InfiniBand hardware on CentOS 6x cluster with OpenMPI (static

Re: [OMPI users] Help

2017-04-27 Thread Gus Correa
: command not found” I am following the instruction from here: https://na-inet.jp/na/pccluster/centos_x86_64-en.html Any help is much appreciated. J Corina You need to install openmpi.x86_64 also, not only openmpi-devel.x86_64. That is the minimum. I hope this helps, Gus Correa

Re: [OMPI users] MPI_ABORT was invoked on rank 0 in communicator compute with errorcode 59

2016-11-15 Thread Gus Correa
e more user friendly. You could also compile it with the flag -traceback (or -fbacktrace, the syntax depends on the compiler, check the compiler man page). This at least will tell you the location in the program where the segmentation fault happened (in the STDERR file of your job). I hope this h

Re: [OMPI users] how to build with memchecker using valgrind, preferable linux distro install of valgrind?

2016-07-14 Thread Gus Correa
Maybe just --with-valgrind or --with-valgrind=/usr would work? On 07/14/2016 11:32 AM, David A. Schneider wrote: I thought it would be a good idea to build a debugging version of openmpi 1.10.3. Following the instructions in the FAQ:

Re: [OMPI users] Restart after code hangs

2016-06-16 Thread Gus Correa
er/cluster), but in your case it can be adjusted to how often the program fails. All atmosphere/ocean/climate/weather_forecast models work this way (that's what we mostly run here). I guess most CFD, computational Chemistry, etc, programs also do. I hope this helps, Gus Correa On 06/16/2016 05:25 PM, A

Re: [OMPI users] "failed to create queue pair" problem, but settings appear OK

2016-06-15 Thread Gus Correa
to (#18 in tuning runtime MPI to OpenFabrics) regards the OFED kernel module parameters log_num_mtt and log_mtts_per_seg, not to the openib btl mca parameters. They may default to a less-than-optimal value. https://www.open-mpi.org/faq/?category=openfabrics#ib-low-reg-mem Gus Correa (not Chuck

Re: [OMPI users] "failed to create queue pair" problem, but settings appear OK

2016-06-15 Thread Gus Correa
) See also this FAQ related to registered memory. I set these parameters in /etc/modprobe.d/mlx4_core.conf, but where they're set may depend on the Linux distro/release and the OFED you're using. https://www.open-mpi.org/faq/?category=openfabrics#ib-low-reg-mem I hope this helps, Gus Correa On

Re: [OMPI users] [slightly off topic] hardware solutions with monetary cost in mind

2016-05-20 Thread Gus Correa
hope this helps, Gus Correa

Re: [OMPI users] No core dump in some cases

2016-05-10 Thread Gus Correa
ncarnation of an OpenMPI 1.6.5 question similar to yours (where .btr stands for backtrace): http://stackoverflow.com/questions/25275450/cause-all-processes-running-under-openmpi-to-dump-core Could this be due to a (unlikely) mix of OpenMPI 1.10 with 1.6.5? Gus Correa On Mon, May 9, 2016 at 12:04 PM,

Re: [OMPI users] No core dump in some cases

2016-05-09 Thread Gus Correa
this on the pbs_mom daemon init script (I am still before the systemd era, that lovely POS). And set the hard/soft limits on /etc/security/limits.conf as well. I hope this helps, Gus Correa On 05/07/2016 12:27 PM, Jeff Squyres (jsquyres) wrote: I'm afraid I don't know what a .btr file

Re: [OMPI users] Segmentation Fault (Core Dumped) on mpif90 -v

2016-05-05 Thread Gus Correa
you may need also to make the locked memory unlimited: ulimit -l unlimited I hope this helps, Gus Correa On 05/05/2016 05:15 AM, Giacomo Rossi wrote: gdb /opt/openmpi/1.10.2/intel/16.0.3/bin/mpif90 GNU gdb (GDB) 7.11 Copyright (C) 2016 Free Software Foundation, Inc. License GPLv3+: GNU GPL

Re: [OMPI users] MPIRUN SEGMENTATION FAULT

2016-04-25 Thread Gus Correa
events the core file to be created when the program crashes, but on the upside also prevents disk to fill up with big core files that are forgotten and hang around forever. [ulimit -a will tell.] I hope this helps, Gus Correa On 04/23/2016 07:06 PM, Gilles Gouaillardet wrote: If you bu

Re: [OMPI users] Problems in compiling a code with dynamic linking

2016-03-24 Thread Gus Correa
on the nodes' /opt, which *probably* will work: https://software.intel.com/en-us/articles/intelr-composer-redistributable-libraries-by-version ** I hope this helps, Gus Correa On 03/24/2016 12:01 AM, Gilles Gouaillardet wrote: Elio, usually, /opt is a local filesystem, so it is possible /opt/intel

Re: [OMPI users] cleaning up old ROMIO (MPI-IO) drivers

2016-01-05 Thread Gus Correa
Hi Rob Your email says you'll keep PVFS2. However, on your blog PVFS2 is not mentioned (on the "Keep" list). I suppose it will be kept, right? Thank you, Gus Correa On 01/05/2016 12:31 PM, Rob Latham wrote: I'm itching to discard some of the little-used file system drivers in ROMIO,

Re: [OMPI users] pbs vs openmpi node allocation

2015-08-03 Thread Gus Correa
node file would be $PBS_NODEFILE. [You don't need to do it if Open MPI was built with Torque support.] I hope this helps. Gus Correa Thank you. -- Abhisek Mondal /Research Fellow / /Structural Biology and Bioinformatics / /Indian Institute of Chemical Biology/ /Kolkata 700032 / /INDIA / ___

Re: [OMPI users] shared memory performance

2015-07-22 Thread Gus Correa
n this case, I guess the mpirun options would be: mpirun --machinefile machine_mpi_bug.txt --mca btl self,vader,tcp I am not even sure if with "vader" the "self" btl is needed, as it was the case with "sm". An OMPI developer could jump into this conversation and

Re: [OMPI users] mpirun fails across cluster

2015-02-27 Thread Gus Correa
that is a common cause of trouble. OpenMPI needs PATH and LD_LIBRARY_PATH at runtime also. I hope this helps, Gus Correa On Fri, Feb 27, 2015 at 10:44 PM, Syed Ahsan Ali <ahsansha...@gmail.com> wrote: Dear Gus Thanks once again for suggestion. Yes I did that before installation to new path

Re: [OMPI users] mpirun fails across cluster

2015-02-27 Thread Gus Correa
Hi Syed Ahsan Ali To avoid any leftovers and further confusion, I suggest that you delete completely the old installation directory. Then start fresh from the configure step with the prefix pointing to --prefix=/share/apps/openmpi-1.8.4_gcc-4.9.2 I hope this helps, Gus Correa On 02/27/2015 12

Re: [OMPI users] mpirun fails across cluster

2015-02-27 Thread Gus Correa
cure about this, not making clear the difference between /export/apps and /share/apps. Issuing the Rocks commands: "tentakel 'ls -d /export/apps'" "tentakel 'ls -d /share/apps'" may show something useful. I hope this helps, Gus Correa On 02/27/2015 11:47 AM, Syed Ahsan Ali wrote: I am try

Re: [OMPI users] How to handle strides in MPI_Create_type_subarray - Re: MPI_type_create_struct + MPI_Type_vector + MPI_Type_contiguous

2015-01-16 Thread Gus Correa
Hi George Many thanks for your answer and interest in my questions. ... so ... more questions inline ... On 01/16/2015 03:41 PM, George Bosilca wrote: Gus, Please see my answers inline. On Jan 16, 2015, at 14:24 , Gus Correa <g...@ldeo.columbia.edu> wrote: Hi George It is still not

[OMPI users] How to handle strides in MPI_Create_type_subarray - Re: MPI_type_create_struct + MPI_Type_vector + MPI_Type_contiguous

2015-01-16 Thread Gus Correa
ere any simple example of how to achieve stride effect with MPI_Create_type_subarray in a multi-dimensional array? BTW, when are you gentlemen going to write an updated version of the "MPI - The Complete Reference"? :) Thank you, Gus Correa (Hijacking Diego Avesani's thread, apologies t

Re: [OMPI users] MPI_type_create_struct + MPI_Type_vector + MPI_Type_contiguous

2015-01-15 Thread Gus Correa
I/content6.html Gus Correa On 01/15/2015 06:53 PM, Diego Avesani wrote: dear George, dear Gus, dear all, Could you please tell me where I can find a good example? I am sorry but I can not understand the 3D array. Really Thanks Diego On 15 January 2015 at 20:13, George Bosilca <bosi..

Re: [OMPI users] MPI_type_create_struct + MPI_Type_vector + MPI_Type_contiguous

2015-01-13 Thread Gus Correa
(as you did in your previous code, with all the surprises regarding alignment, etc), not array sections. Also, MPI type vector should be more easy going (and probably more efficient) than MPI type struct, with less memory alignment problems. I hope this helps, Gus Correa PS - These books have

Re: [OMPI users] MPI_Type_Create_Struct + MPI_TYPE_CREATE_RESIZED

2015-01-08 Thread Gus Correa
Hi Diego *EITHER* declare your QQ and PR (?) structure components as DOUBLE PRECISION *OR* keep them REAL(dp) but *fix* your "dp" definition, as George Bosilca suggested. Gus Correa On 01/08/2015 06:36 PM, Diego Avesani wrote: Dear Gus, Dear All, so are you suggesting to

Re: [OMPI users] MPI_Type_Create_Struct + MPI_TYPE_CREATE_RESIZED

2015-01-08 Thread Gus Correa
suggested a while back. I hope this helps, Gus Correa Thanks again Diego On 8 January 2015 at 23:24, George Bosilca <bosi...@icl.utk.edu <mailto:bosi...@icl.utk.edu>> wrote: Diego, Please find below the corrected example. There were several issues but the mos

Re: [OMPI users] libpsm_infinipath issues?

2015-01-08 Thread Gus Correa
Hi Michael, Andrew, list knem is doesn't work in OMPI 1.8.3. See this thread: http://www.open-mpi.org/community/lists/users/2014/10/25511.php A fix was promised on OMPI 1.8.4: http://www.open-mpi.org/software/ompi/v1.8/ Have you tried it? I hope this helps, Gus Correa On 01/08/2015 04:36 PM

Re: [OMPI users] Icreasing OFED registerable memory

2015-01-06 Thread Gus Correa
ent you before for more details. I hope this helps, Gus Correa On 01/06/2015 01:37 PM, Deva wrote: Hi Waleed, -- Memlock limit: 65536 -- such a low limit should be due to per-user lock memory limit . Can you make sure it is set to "unlimited" on all nodes ( "

Re: [OMPI users] Icreasing OFED registerable memory

2014-12-30 Thread Gus Correa
=openfabrics#ib-locked-pages-more http://www.open-mpi.org/faq/?category=openfabrics#ib-low-reg-mem *** Having said that, a question remains unanswered: Why is Infiniband such a nightmare? *** I hope this helps, Gus Correa On 12/30/2014 09:16 AM, Waleed Lotfy wrote: Thank Devendar for your response. I'll

Re: [OMPI users] Open mpi based program runs as root and gives SIGSEGV under unprivileged user

2014-12-10 Thread Gus Correa
number of open files is yet another hurdle. And if you're using Infinband, the max locked memory size should be unlimited. Check /etc/security/limits.conf and "ulimit -a". I hope this helps, Gus Correa On 12/10/2014 08:28 AM, Gilles Gouaillardet wrote: Luca, your email mentions ope

Re: [OMPI users] How OMPI picks ethernet interfaces

2014-11-13 Thread Gus Correa
(... well, I don't have voting rights on that, but I'll vote anyway ...) is to keeep the current approach. It is wise and flexible, and easy to adjust and configure to specific machines with their own oddities, via MCA parameters, as I tried to explain in previous postings. My two cents, Gus Corre

Re: [OMPI users] How OMPI picks ethernet interfaces

2014-11-12 Thread Gus Correa
e sensible. :) Cheers, Gus Correa It tries so independent from the internal or external name of the headnode given in the machinefile - I hit ^C then. I attached the output of Open MPI 1.8.1 for this setup too. -- Reuti ___ users mailing list us...@op

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-30 Thread Gus Correa
questions below (specially the 12 vader parameters). Many thanks, Gus Correa On Oct 30, 2014, at 4:24 PM, Gus Correa <g...@ldeo.columbia.edu> wrote: Hi Nathan Thank you very much for addressing this problem. I read your notes on Jeff's blog about vader, and that clarified many

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-30 Thread Gus Correa
time with the btl_vader_single_copy_mechanism parameter? Or must OMPI be configured with only one memory copy mechanism? Many thanks, Gus Correa On 10/30/2014 05:44 PM, Nathan Hjelm wrote: I want to close the loop on this issue. 1.8.5 will address it in several ways: - knem support in btl/sm has been fixed. A sanity c

Re: [OMPI users] New ib locked pages behavior?

2014-10-21 Thread Gus Correa
this (but apparently no solution): http://www.open-mpi.org/community/lists/users/2013/02/21430.php Maybe Mellanox has more information about this? Gus Correa On 10/21/2014 08:15 PM, Bill Broadley wrote: On 10/21/2014 04:18 PM, Gus Correa wrote: Hi Bill Maybe you're missing these settings in /etc

Re: [OMPI users] New ib locked pages behavior?

2014-10-21 Thread Gus Correa
Hi Bill Maybe you're missing these settings in /etc/modprobe.d/mlx4_core.conf ? http://www.open-mpi.org/faq/?category=openfabrics#ib-low-reg-mem I hope this helps, Gus Correa On 10/21/2014 06:36 PM, Bill Broadley wrote: I've setup several clusters over the years with OpenMPI. I often get

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-17 Thread Gus Correa
omatically) * -mca btl openib,self (and vader will come along automatically) * -mca btl openib,self,vader (because vader is default only for 1-node jobs) * something else (or several alternatives) Whatever happened to the "self" btl in this new context? Gone? Still there? Many thanks, Gus Corr

Re: [OMPI users] Open MPI 1.8.3 openmpi-mca-params.conf: old and new parameters

2014-10-17 Thread Gus Correa
rocess placement conceptual model, along with its syntax and examples. Thank you, Gus Correa On 10/17/2014 12:10 AM, Ralph Castain wrote: I know this commit could be a little hard to parse, but I have updated the mpirun man page on the trunk and will port the change over to the 1.8 serie

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
th-tm=/opt/torque/4.2.5/gnu-4.4.7 \ --with-verbs=/usr \ --with-knem=/opt/knem-1.1.1 \ 2>&1 | tee configure_${build_id}.log Many thanks, Gus On Oct 16, 2014, at 4:24 PM, Gus Correa <g...@ldeo.columbia.edu> wrote: On 10/16/2014 05:38 PM, Nathan Hjelm wrote: On Thu, Oct 16, 2014 a

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
. On Oct 16, 2014, at 4:06 PM, Gus Correa <g...@ldeo.columbia.edu> wrote: Hi All Back to the original issue of knem in Open MPI 1.8.3. It really seems to be broken. I launched the Intel MPI benchmarks (IMB) job both with '-mca btl ^vader,tcp', and with '-mca btl sm,self,openib'. Both syntaxe

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
On 10/16/2014 05:38 PM, Nathan Hjelm wrote: On Thu, Oct 16, 2014 at 05:27:54PM -0400, Gus Correa wrote: Thank you, Aurelien! Aha, "vader btl", that is new to me! I tought Vader was that man dressed in black in Star Wars, Obi-Wan Kenobi's nemesis. That was a while ago, my kids wer

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
me any incentive to upgrade our production codes to OMPI 1.8. Will this be fixed in the next Open MPI 1.8 release? Thank you, Gus Correa PS - Many thanks to Aurelien Boutelier for pointing out the existence of the vader btl. Without his tip I would still be in the dark side. On 10/16/2014 05:46

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
their MPI applications running in production mode, hopefully with Open MPI 1.8, can somebody explain more clearly what "vader" is about? Thank you, Gus Correa On Thu, Oct 16, 2014 at 01:49:09PM -0700, Ralph Castain wrote: FWIW: vader is the default in 1.8 On Oct 16, 2014, at 1:40 PM

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
openib, etc)? How does it affect knem? What are vader's pros/cons w.r.t. using the other btls? In which conditions is it good or bad to use it vs. the other btls? What do I gain/lose if I do "btl = sm,self,openib" (which presumably will knock off tcp and "vader'), or maybe "btl=^tcp,^v

Re: [OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
? I am in CentOS 6.5, stock kernel 2.6.32, no 3.1,no CMA linux, so I believe I need knem for now. I tried '-mca btl_base_verbose 30' but no knem information came out. Many thanks, Gus Correa On 10/16/2014 04:40 PM, Aurélien Bouteiller wrote: Are you sure you are not using the vader BTL ? Setting

[OMPI users] knem in Open MPI 1.8.3

2014-10-16 Thread Gus Correa
vidence I have that knem is active in 1.6.5 but not in 1.8.3 comes only from the statistics in /dev/knem. *** Thank you, Gus Correa *** PS - As an aside, I also have some questions on the knem setup, which I mostly copied from the knem web site (hopefully Brice Goglin is listening ...): -

Re: [OMPI users] Open MPI 1.8.3 openmpi-mca-params.conf: old and new parameters

2014-10-16 Thread Gus Correa
mpiexec options: -bind-to-core, rmaps_base_schedule_policy, orte_process_binding, etc. Thank you, Gus Correa On 10/15/2014 11:10 PM, Ralph Castain wrote: On Oct 15, 2014, at 11:46 AM, Gus Correa <g...@ldeo.columbia.edu <mailto:g...@ldeo.columbia.edu>> wrote: Thank you Ralph and Jeff fo

Re: [OMPI users] Hybrid OpenMPI/OpenMP leading to deadlocks?

2014-10-16 Thread Gus Correa
+ short job queue time policy is very common out there. Here most problems with long runs (we have some non-restartable serial code die-hards), happen due to NFS issues (busy, slow response, etc), and code with poorly designed IO. My two cents, Gus Correa On 10/16/2014 10:16 AM, McGrattan, Kevin B. Dr

Re: [OMPI users] Open MPI 1.8.3 openmpi-mca-params.conf: old and new parameters

2014-10-15 Thread Gus Correa
, Gus Correa On 10/15/2014 11:12 AM, Jeff Squyres (jsquyres) wrote: We talked off-list -- fixed this on master and just filed https://github.com/open-mpi/ompi-release/pull/33 to get this into the v1.8 branch. On Oct 14, 2014, at 7:39 PM, Ralph Castain <r...@open-mpi.org> wrote: On

Re: [OMPI users] General question about running single-node jobs.

2014-10-02 Thread Gus Correa
and $OMPI/lib to LD_LIBRARY_PATH and are these environment variables propagated to the job execution nodes (specially those that are failing)? Anyway, just a bunch of guesses ... Gus Correa * QCSCRATCH Defines the directory in which Q-Chem will store

Re: [OMPI users] About debugging and asynchronous communication

2014-09-18 Thread Gus Correa
There is no guarantee that the messages will be received in the same order that they were sent. Use tags or another mechanism to match the messages on send and recv ends. On 09/18/2014 10:42 AM, XingFENG wrote: I have found some thing strange. Basically, in my codes, processes send and receive

Re: [OMPI users] compilation problem with ifort

2014-09-04 Thread Gus Correa
, lapack, fft) and to build them. At least that is what seems to have happened on my computer. So, I don't think you need any other libraries. Good luck, Gus Correa On 09/04/2014 04:17 PM, Elio Physics wrote: Dear Gus, Firstly I really need to thank you for the effort you are doing to help me

Re: [OMPI users] compilation problem with ifort

2014-09-04 Thread Gus Correa
that it needs. And this is *exactly what the error message in your first email showed*, a bunch of object files that were not found. *** Sorry, but I cannot do any better than this. I hope this helps, Gus Correa On 09/03/2014 08:59 PM, Elio Physics wrote: Ray and Gus, Thanks a lot for your help. I fo

Re: [OMPI users] compilation problem with ifort

2014-09-03 Thread Gus Correa
d top EPW directory (which per the recipe is right below the top QE) plays a role. Anyway, phonons are not my playground, just trying to help two-cent-wise, although this is not really an MPI or OpenMPI issue, more or a Makefile/configure issue specific to QE and EPW. Thanks, Gus Correa On 09/03/2014 07:

Re: [OMPI users] compilation problem with ifort

2014-09-03 Thread Gus Correa
he EPW web site? http://epw.org.uk/Main/DownloadAndInstall ** I hope this helps, Gus Correa On 09/03/2014 06:48 PM, Elio Physics wrote: I have already done all of the steps you mentioned. I have installed the older version of quantum espresso, configured it and followed all the steps on the E

Re: [OMPI users] compilation problem with ifort

2014-09-03 Thread Gus Correa
they have a mailing list or bulletin board where you could get specific help for their software? (Either on EPW or on QuantumExpresso (which seems to be required): http://www.quantum-espresso.org/) That would probably be the right forum to ask your questions. My two cents, Gus Correa On 09/03

Re: [OMPI users] compilation problem with ifort

2014-09-03 Thread Gus Correa
Was the error that you listed the *first* error? Apparently various object files are missing from the ../../Modules/ directory, and were not compiled, suggesting something is amiss even before the compilation of the executable (epw.x). On 09/03/2014 05:20 PM, Elio Physics wrote: Dear all, I

Re: [OMPI users] building openmpi 1.8.1 with intel 14.0.1

2014-08-21 Thread Gus Correa
Hi Peter If I remember right from my compilation of OMPI on a Mac years ago, you need to have X-Code installed, in case you don't. If vampir-trace is the only problem, you can disable it when you configure OMPI (--disable-vt). My two cents, Gus Correa On 08/21/2014 03:35 PM, Bosler, Peter

Re: [OMPI users] Newbie query - mpirun will not run if it's previously been killed with Control-C

2014-08-07 Thread Gus Correa
On 08/07/2014 11:49 AM, Ralph Castain wrote: On Aug 7, 2014, at 8:47 AM, Reuti <re...@staff.uni-marburg.de <mailto:re...@staff.uni-marburg.de>> wrote: Am 07.08.2014 um 17:28 schrieb Gus Correa: I guess Control-C will kill only the mpirun process. You may need to kill the (tw

Re: [OMPI users] Newbie query - mpirun will not run if it's previously been killed with Control-C

2014-08-07 Thread Gus Correa
On 08/07/2014 11:28 AM, Gus Correa wrote: I guess Control-C will kill only the mpirun process. You may need to kill the (two) jules.exe processes separately, say, with kill -9. ps -u "yourname" will show what you have running. Something may have been left behind by Control-C

Re: [OMPI users] Newbie query - mpirun will not run if it's previously been killed with Control-C

2014-08-07 Thread Gus Correa
I guess Control-C will kill only the mpirun process. You may need to kill the (two) jules.exe processes separately, say, with kill -9. ps -u "yourname" will show what you have running. On 08/07/2014 11:16 AM, Jane Lewis wrote: Hi all, This is a really simple problem (I hope) where I’ve

Re: [OMPI users] How to keep multiple installations at same time

2014-08-05 Thread Gus Correa
und the official one to do what you need/want, is a pain. Anyway, this is the OMPI list, not a place for advocacy of either package, so I am going to stop here. I just wanted to set the record straight that: - the Enviroment Modules package is not dead, - it has a large user base, and - it is sooo good

Re: [OMPI users] How to keep multiple installations at same time

2014-08-05 Thread Gus Correa
the same exact thing that they currently have, and in the end gain little if any relevant/useful/new functionality. My two cents of opinion Gus Correa On 08/05/2014 12:54 PM, Ralph Castain wrote: Check the repo - hasn't been touched in a very long time On Aug 5, 2014, at 9:42 AM, Fabricio Cannini

Re: [OMPI users] How to keep multiple installations at same time

2014-08-05 Thread Gus Correa
from each of these dirctories, using the appropriate compilers, and pointing to two distinct *installation directories* (with configure -prefix). My two cents, Gus Correa On 08/04/2014 11:54 PM, Andrew Caird wrote: Hi Ahsan, We, and I think many people, use the Environment Modules software

Re: [OMPI users] openmpi 1.8.1 gfortran not working

2014-08-04 Thread Gus Correa
ix? (CC, CXX, FC) Then "make distclean; configure; make; make install". Gus Correa On 08/04/2014 04:10 PM, Dan Shell wrote: Ralph Ok I will give that a try Thanks Dan Shell -Original Message- From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Mo

Re: [OMPI users] Configuring openib on openmpi 1.8.1

2014-07-30 Thread Gus Correa
not be mixed. The OMPI implementations should be the same on all machines as well. Running "which mpirun" on those machines may help. These user enviroment problems often cause confusion. My two cents, Gus Correa On 07/30/2014 09:56 AM, Ralph Castain wrote: Does "polaris"

Re: [OMPI users] mpifort wrapper.txt

2014-07-29 Thread Gus Correa
ta.F90 -- No underlying compiler was specified in the wrapper compiler data file (e.g., mpicc-wrapper-data.txt) The error message is complaining about mpicc, not mpifort. I wonder if this may be due to a Makefile misconfiguration again. My two cents, Gus Correa

Re: [OMPI users] Trying to use openmpi with MOM getting a compile error

2014-07-25 Thread Gus Correa
On 07/25/2014 03:02 PM, Jeff Squyres (jsquyres) wrote: On Jul 25, 2014, at 1:14 PM, Gus Correa <g...@ldeo.columbia.edu> wrote: Change the mkmf.template file and replace the Fortran compiler name (gfortran) by the Open MPI (OMPI) Fortran compiler wrapper: mpifortran (or mpif90 if it

Re: [OMPI users] Trying to use openmpi with MOM getting a compile error

2014-07-25 Thread Gus Correa
(e.g. to MPICH libraries and include files). Then rebuild the Makefile and compile MOM again. I hope this helps. Gus Correa On 07/25/2014 12:37 PM, Dan Shell wrote: OpenMOM-mpi I am trying to compile MOM and have installed openmpi 1.8.1 getting an installation error below Looking for some help

Re: [OMPI users] configure fails to detect missing libcrypto

2014-07-24 Thread Gus Correa
lem. Could your libcrypto be in an an unusual location? Maybe you need to load a Torque environment module to add it to your LD_LIBRARY_PATH before you build OMPI? Gus Correa On 07/24/2014 05:18 PM, Jeff Hammond wrote: That could be the case. I've reported the missing libcrypto issue to NERSC alre

Re: [OMPI users] Problem moving from 1.4 to 1.6

2014-06-27 Thread Gus Correa
, env |grep PATH and env |grep LD_LIBRARY_PATH may hint if you have a mixed environment and mixed MPI implementations and versions. I hope this helps, Gus Correa PS - BTW, unless your company's policies forbid, you can install OpenMPI on a user directory, say, your /home directory

Re: [OMPI users] Problem moving from 1.4 to 1.6

2014-06-27 Thread Gus Correa
cause of the problem. If you want to use openib switch to --mca btl openib,sm,self Another thing to check is whether there is a mixup of enviroment variables, PATH and LD_LIBRARY_PATH perhaps pointing to the old OMPI version you may have installed. My two cents, Gus Correa On 06/27/2014 12

Re: [OMPI users] openib segfaults with Torque

2014-06-11 Thread Gus Correa
, etc), to inherit those limits. Or not? Gus Correa On 06/11/2014 06:20 PM, Jeff Squyres (jsquyres) wrote: +1 On Jun 11, 2014, at 6:01 PM, Ralph Castain <r...@open-mpi.org> wrote: Yeah, I think we've seen that somewhere before too... On Jun 11, 2014, at 2:59 PM, Joshua Ladd &l

Re: [OMPI users] Determining what parameters a scheduler passes to OpenMPI

2014-06-06 Thread Gus Correa
ue parlance). This mostly matter if there is more than one job running on a node. However, Torque doesn't bind processes/MPI_ranks to cores or sockets or whatever. As Ralph said, Open MPI does that. I believe Open MPI doesn't use the cpuset info from Torque. (Ralph, please correct me if I am w

Re: [OMPI users] intermittent segfaults with openib on ring_c.c

2014-06-04 Thread Gus Correa
ferred transport layer for intra-node communication. Gus Correa On 06/04/2014 11:13 AM, Ralph Castain wrote: Thanks!! Really appreciate your help - I'll try to figure out what went wrong and get back to you On Jun 4, 2014, at 8:07 AM, Fischer, Greg A. <fisch...@westinghouse.com <

Re: [OMPI users] openmpi configuration error?

2014-05-21 Thread Gus Correa
Linux2_x 007FD3A0 Unknown Unknown Unknown CCTM_V5g_Linux2_x 007BA9A2 Unknown Unknown Unknown CCTM_V5g_Linux2_x 00759288 Unknown Unknown Unknown ... On Wed, May 21, 2014 at 2:08 PM, Gus Correa <g...@ldeo.columbia.edu <mailto:g

Re: [OMPI users] openmpi configuration error?

2014-05-21 Thread Gus Correa
een marked to conflict with 1.4.4. Is it? Anyway, you may want to do a 'which mpiexec' to see which one is taking precedence in your environment (1.6.5 or 1.4.4) Probably 1.6.5. Does the code work now, or does it continue to fail? I hope this helps, Gus Correa On 05/21/2014 02:36 PM, Ben Lash wr

Re: [OMPI users] openmpi configuration error?

2014-05-16 Thread Gus Correa
Probably just module swap openmpi/1.4.4-intel openmpi/1.6.5-intel) You may need to tweak with the Makefile, if it hardwires the MPI wrappers/binary location, or the library and include paths. Some do, some don't. Gus Correa [bl10@login2 ~]$ echo $PATH /home/bl10/rlib/deps/bin:/opt/apps/netcdf

Re: [OMPI users] openmpi configuration error?

2014-05-16 Thread Gus Correa
directory, then it is probably out of date. Why don't you try to recompile the code with the current Open MPI installed in the cluster? module avail will show everyting, and you can pick the latest, load it, and try to recompile the program with that. Gus Correa On Fri, May 16, 2014 at 5:07

Re: [OMPI users] openmpi configuration error?

2014-05-16 Thread Gus Correa
lib:$LD_LIBRARY_PATH if csh to .cshrc setenv PATH /opt/apps/openmpi/retired/1.4.4-intel/bin:$PATH setenv LD_LIBRARY_PATH /opt/apps/openmpi/retired/1.4.4-intel/lib:$LD_LIBRARY_PATH I hope this helps, Gus Correa On 05/16/2014 05:39 PM, Ben Lash wrote: My cluster has just upgraded to a new ver

  1   2   3   4   5   >