Re: [OMPI devel] Process placement

2016-05-05 Thread Aurélien Bouteiller
OWN = dancer01 dancer00 dancer01 dancer01 dancer01 dancer00 dancer00 dancer00 dancer00 dancer00 dancer00 dancer00 -- Aurélien Bouteiller, Ph.D. ~~ https://icl.cs.utk.edu/~bouteill/ <https://icl.cs.utk.edu/~bouteill/> > Le 13 avr. 2016 à 13:38, R

Re: [OMPI devel] Confusion about slots

2016-03-23 Thread Aurélien Bouteiller
-- Aurélien Bouteiller, Ph.D. ~~ https://icl.cs.utk.edu/~bouteill/ <https://icl.cs.utk.edu/~bouteill/> > Le 23 mars 2016 à 16:24, Ralph Castain <r...@open-mpi.org> a écrit : > > “Slots” are an abstraction commonly used by schedulers as a way of indicating > how many processes are a

[OMPI devel] use-mpi mpiext?

2016-02-24 Thread Aurélien Bouteiller
mpi/libmpiext_blabla_usempi.la' make[1]: *** [libmpi_usempi_ignore_tkr.la] Error 1 -- Aurélien Bouteiller, Ph.D. ~~ https://icl.cs.utk.edu/~bouteill/ smime.p7s Description: S/MIME cryptographic signature

Re: [OMPI devel] Remote orted verbosity

2015-11-23 Thread Aurélien Bouteiller
Frederico, Just add -debug-daemons to the mpirun command options. Aurélien -- Aurélien Bouteiller, Ph.D. ~~ https://icl.cs.utk.edu/~bouteill/ <https://icl.cs.utk.edu/~bouteill/> > Le 23 nov. 2015 à 08:55, Federico Reghenzani > <federico1.reghenz...@mail.polimi.it>

[OMPI devel] smcuda higher exclusivity than anything else?

2015-05-20 Thread Aurélien Bouteiller
should be identical to the regular sm component (as long as no GPU operation are required). This is not the case, there is some performance penalty with smcuda compared to sm. Aurelien -- Aurélien Bouteiller ~~ https://icl.cs.utk.edu/~bouteill/ signature.asc Description: Message signed

[OMPI devel] 1.8.5rc1 and OOB on Cray XC30

2015-04-16 Thread Aurélien Bouteiller
r:16915] mca: base: close: unloading component tcp -- Aurélien Bouteiller ~ https://icl.cs.utk.edu/~bouteill/ signature.asc Description: Message signed with OpenPGP using GPGMail

Re: [OMPI devel] RFC: "v1.9.0" (vs. "v1.9")

2014-09-22 Thread Aurélien Bouteiller
a nightly build. Reserving a particular number (like 1.9.0) for all non-release versions of a general series could help avoid this. -- ~~~ Aurélien Bouteiller, Ph.D. ~~~ ~ Research Scientist @ ICL ~ The University of Tennessee, Innovative Computing Laboratory 1122 Volunteer

Re: [OMPI devel] RFC: "v1.9.0" (vs. "v1.9")

2014-09-22 Thread Aurélien Bouteiller
Could also start at 1.9.1 instead of 1.9.0. That gives a free number for the “trunk” nightly builds. -- ~~~ Aurélien Bouteiller, Ph.D. ~~~ ~ Research Scientist @ ICL ~ The University of Tennessee, Innovative Computing Laboratory 1122 Volunteer Blvd, suite 309, Knoxville

Re: [OMPI devel] KNEM + user-space hybrid for sm BTL

2013-07-18 Thread Aurélien Bouteiller
, PhD – High Performance Computing Team > RWTH Aachen University, Center for Computing and Communication > Rechen- und Kommunikationszentrum der RWTH Aachen > Seffenter Weg 23, D 52074 Aachen (Germany) > > > _______ > devel mailing list >

Re: [OMPI devel] [EXTERNAL] Re: RFC: support for Mellanox's "libhcoll" library

2013-06-18 Thread Aurélien Bouteiller
istinfo.cgi/devel >> >> >> -- >> Jeff Squyres >> jsquy...@cisco.com >> For corporate legal information go to: >> http://www.cisco.com/web/about/doing_business/legal/cri/ >> >> >> ___ >&

Re: [OMPI devel] June OMPI developer's meeting

2013-05-10 Thread Aurélien Bouteiller
> US Geological Survey > 650-329-5608 > ba...@usgs.gov > > > > ___ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel -- * Dr. Aurélien Bouteiller * Researcher at Innovative Computing Laboratory *

Re: [OMPI devel] enabling ft-enable-cr + vprotocol

2012-07-23 Thread Aurélien Bouteiller
gt; http://www.open-mpi.org/mailman/listinfo.cgi/devel -- * Dr. Aurélien Bouteiller * Researcher at Innovative Computing Laboratory * University of Tennessee * 1122 Volunteer Boulevard, suite 309b * Knoxville, TN 37996 * 865 974 9375 signature.asc Description: Message signed with OpenPGP using GPGMail

Re: [OMPI devel] Pessimist Event Logger

2012-01-27 Thread Aurélien Bouteiller
Hugo, It seems you want to implement some sort of remote pessimistic logging -a la MPICH-V1- ? MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes -- George Bosilca, Aurélien Bouteiller, Franck Cappello, Samir Djilali, Gilles Fédak, Cécile Germain, Thomas Hérault, Pierre

Re: [OMPI devel] Pessimist Event Logger

2012-01-27 Thread Aurélien Bouteiller
e something that i've got to turn on, or i will have to modify this > behavior manually to connect and send messages to the EL? > > Thanks in advance. > > Hugo Meyer > ___ > devel mailing list > de...@open-mpi.org > http://ww

Re: [OMPI devel] [OMPI svn] svn:open-mpi r23931

2010-10-25 Thread Aurélien Bouteiller
Ralph, In file included from ../../../../../trunk/opal/mca/event/libevent207/libevent207_module.c:44: ../../../../../trunk/opal/mca/event/libevent207/libevent/event.h:165:33: error: event2/event-config.h: No such file or directory Looks like you forgot some files. Aurelien Le 25 oct.

[OMPI devel] orte does not compile on XT5 (pgcc)

2010-09-29 Thread Aurélien Bouteiller
Here is the problem. The PGI compiler is especially paranoid regarding post declared structures typedefs. It looks like the include ordering makes the nidmap.h file being included before orte_jmap_t typedefs and siblings have been done. /opt/cray/xt-asyncpe/4.0/bin/cc: INFO: linux target is

Re: [OMPI devel] Autogen.pl, romio and autoconf 2.66

2010-09-28 Thread Aurélien Bouteiller
Le 28 sept. 2010 à 18:10, Aurélien Bouteiller a écrit : > > Le 28 sept. 2010 à 17:55, Jeff Squyres a écrit : > >> On Sep 28, 2010, at 5:30 PM, Aurélien Bouteiller wrote: >> >>> Hi there, >>> >>> has anybody tried to compile ompi trunk wit

Re: [OMPI devel] Autogen.pl, romio and autoconf 2.66

2010-09-28 Thread Aurélien Bouteiller
Le 28 sept. 2010 à 17:55, Jeff Squyres a écrit : > On Sep 28, 2010, at 5:30 PM, Aurélien Bouteiller wrote: > >> Hi there, >> >> has anybody tried to compile ompi trunk with autoconf 2.66 ? It fails when >> configuring romio with the following error: >>

[OMPI devel] Autogen.pl, romio and autoconf 2.66

2010-09-28 Thread Aurélien Bouteiller
Hi there, has anybody tried to compile ompi trunk with autoconf 2.66 ? It fails when configuring romio with the following error: === Processing subdir: /nics/c/home/bouteill/ompi/trunk/ompi/mca/io/romio/romio --- Found configure.in|ac; running autoreconf... autoreconf: Entering directory `.'

Re: [OMPI devel] what's the relationship between proc, endpoint and btl?

2010-02-24 Thread Aurélien Bouteiller
btl is the component responsible for a particular type of fabric. Endpoint is somewhat the instantiation of a btl to reach a particular destination on a particular fabric, proc is the generic name and properties of a destination. Aurelien Le 24 févr. 2010 à 09:59, hu yaohui a écrit : > Could

Re: [OMPI devel] Vprotocol pessimist - Open MPI 1.4.1 and 1.4.2a1r22558

2010-02-24 Thread Aurélien Bouteiller
Hi, The instructions you found are now obsolete. I'll update them, thank you for pointing out. The new procedure to use uncoordinated checkpoint is now mpirun -mca vprotocol pessimist -mca pml ob1,v [regular arguments]. The version available in trunk does not support actual restart due to

[OMPI devel] MCA component dependency

2009-03-25 Thread Aurélien Bouteiller
Hi everyone, I'm trying to state that a particular component depends on another that should therefore be dlopened automatically when it is loaded. I found some code doing exactly that in mca_base_component_find:open_component, but can't find any example of how to actually trigger that

Re: [OMPI devel] [OMPI svn] svn:open-mpi r20196

2009-01-05 Thread Aurélien Bouteiller
Addendum to the previous message concerning this discussion: I think we should stick with including opal_stdint everywhere instead of inttypes.h (this file does not always exist on ansi pedantic compilers). Aurelien Le 4 janv. 09 à 00:09, timat...@osl.iu.edu a écrit : Author: timattox

Re: [OMPI devel] [OMPI svn] svn:open-mpi r20196

2009-01-05 Thread Aurélien Bouteiller
Tim, To answer to your question in ticket #869: the only known missing feature to the opal_stdint.h is that there is no portable way to printf size_t. Their type is subject to so many changes depending on the platform and compiler that it is impossible to be sure that PRI_size_t is not

Re: [OMPI devel] Should visibility and memchecker abort configure?

2008-10-03 Thread Aurélien Bouteiller
Hi Ralph, 1. No. Having visibility turned off without knowing it is the best way for us to commit bugs in the trunk without noticing, I mean before somebody else get the leg caught in the "not-compiling-trunk trap". I had more of my share of responsibility for that kind of problems in

Re: [OMPI devel] [OMPI svn] svn:open-mpi r19653

2008-09-29 Thread Aurélien Bouteiller
nks Ralph Aurelien On Sep 28, 2008, at 8:33 AM, Aurélien Bouteiller wrote: Ralph, I just split the existing static function from inside the dpm and exposed it to the outside world. The idea is that the dpm create the (opaque) port strings and therefore nows how they are supposed t

Re: [OMPI devel] trunk temporarily closed

2008-09-25 Thread Aurélien Bouteiller
Any idea of a timeframe for the problem to get fixed ? Aurelien Le 25 sept. 08 à 14:03, Jeff Squyres a écrit : On Sep 25, 2008, at 1:44 PM, Jeff Squyres (jsquyres) wrote: The SVN trunk has been temporarily closed due to what may be an accidental commit. The entire OMPI SVN is now

Re: [OMPI devel] gdb libmpi.dylib on Leopard

2008-09-19 Thread Aurélien Bouteiller
iendly side effect compilation time should improve a lot. Thanks, -- * Dr. Aurélien Bouteiller * Sr. Research Associate at Innovative Computing Laboratory * University of Tennessee * 1122 Volunteer Boulevard, suite 350 * Knoxville, TN 37996 * 865 974 6321 (on behalf of the Open MPI development commun

Re: [OMPI devel] gdb libmpi.dylib on Leopard

2008-09-19 Thread Aurélien Bouteiller
e "/data/jsquyres/svn/ompi/ ompi/.libs/libmpi.lax/libdatatype.a/copy_functions_heterogeneous.o" - no debug information available for "copy_functions_heterogeneous.c". ----- On Sep 19, 2008, at 2:31 PM, Ralf Wildenhues wrote: * Aurélien Bouteiller wrote on Fri, Sep 19, 2008

Re: [OMPI devel] gdb libmpi.dylib on Leopard

2008-09-19 Thread Aurélien Bouteiller
Thanks Ralf for the support. I upgraded to libtool 2.2.6 and it didn't solved the problem though. Still looking for somebody to confirm that its working or not working on their Mac. Aurelien Le 17 sept. 08 à 12:39, Ralf Wildenhues a écrit : Hello Aurélien, * Aurélien Bouteiller wrote

[OMPI devel] gdb libmpi.dylib on Leopard

2008-09-17 Thread Aurélien Bouteiller
essful at debugging Open MPI on Leopard ? Is this a bug of Open MPI or a bug in libtool/ gdb ? Any known fix ? Aurelien -- * Dr. Aurélien Bouteiller * Sr. Research Associate at Innovative Computing Laboratory * University of Tennessee * 1122 Volunteer Boulevard, suite 350 * Knoxville, TN 37996 * 865 974 6321

Re: [OMPI devel] PLM consistency: priority

2008-07-11 Thread Aurélien Bouteiller
We don't want the user to have to select by hand the best PML. The logic inside the current selection process selects the best pml for the underlying network. However changing the priority is pretty meaningless from the user's point of view. So while retaining the selection process

Re: [OMPI devel] [OMPI svn] svn:open-mpi r18804

2008-07-03 Thread Aurélien Bouteiller
Thanks Ralph, this fix does the trick. Aurelien Le 3 juil. 08 à 13:53, r...@osl.iu.edu a écrit : Author: rhc Date: 2008-07-03 13:53:37 EDT (Thu, 03 Jul 2008) New Revision: 18804 URL: https://svn.open-mpi.org/trac/ompi/changeset/18804 Log: Repair the MPI-2 dynamic operations. This includes:

Re: [OMPI devel] PML selection logic

2008-06-23 Thread Aurélien Bouteiller
The first approach sounds fair enough to me. We should avoid 2 and 3 as the pml selection mechanism used to be more complex before we reduced it to accommodate a major design bug in the BTL selection process. When using the complete PML selection, BTL would be initialized several times,

Re: [OMPI devel] "__printf__" attribute

2008-05-08 Thread Aurélien Bouteiller
They refer to the parameters of the function. In the example linked, 2 means the fmt is the second argument of the function and 3 is the first variadic arg related to the fmt string. Aurelien Le 8 mai 08 à 18:24, Jeff Squyres a écrit : Rainer -- What do the numeric arguments refer to in

Re: [OMPI devel] [OMPI svn] svn:open-mpi r18303

2008-04-25 Thread Aurélien Bouteiller
closing the port? Or do we automatically close any open ports when finalize is called? Or do we automatically close the port after the connect/accept is completed? Thanks Ralph On 4/25/08 3:13 PM, "Aurélien Bouteiller" <boute...@eecs.utk.edu> wrote: Actually, the port was

Re: [OMPI devel] MPI_Comm_connect/Accept

2008-04-08 Thread Aurélien Bouteiller
oper format of the orterun cmd-line option. The syntax is: -ompi-server uri or -ompi-server file:filename-where-uri-exists Problem here is that you gave it a uri of "test", which means nothing. ;-) Should have it up-and-going soon. Ralph On 4/4/08 12:02 PM, "Aurélien Bo

Re: [OMPI devel] MPI_Comm_connect/Accept

2008-04-04 Thread Aurélien Bouteiller
erver" and see if it shows up. Holler if you have a question - not sure I documented it very thoroughly at the time. On 4/3/08 3:10 PM, "Aurélien Bouteiller" <boute...@eecs.utk.edu> wrote: Ralph, I am using trunk. Is there a documentation for ompi-server ? Sounds exact

Re: [OMPI devel] MPI_Comm_connect/Accept

2008-04-03 Thread Aurélien Bouteiller
he same mechanism, I would have expected connect/accept to work as well. If you are talking about 1.2.x, then the story is totally different. Ralph On 4/3/08 2:29 PM, "Aurélien Bouteiller" <boute...@eecs.utk.edu> wrote: Hi everyone, I'm trying to figure out how complete i

[OMPI devel] MPI_Comm_connect/Accept

2008-04-03 Thread Aurélien Bouteiller
t); MPI_Waitall(nc, reqs, MPI_STATUSES_IGNORE); for(i = 0; i < nc; i++) { printf("event[%d] = %d\n", i, event[i]); MPI_Comm_disconnect(_nodes[i]); printf("Disconnect %d\n", i); } } MPI_Finalize();

Re: [OMPI devel] Fault tolerance

2008-03-07 Thread Aurélien Bouteiller
We now use the errmgr. Aurelien Le 6 mars 08 à 13:38, Aurélien Bouteiller a écrit : Aside of what Josh said, we are working right know at UTK on orted/MPI recovery (without killing/respawning all). For now we had no use of the errgmr, but I'm quite sure this would be the smartest place

Re: [OMPI devel] OMPI and Mac Leopard

2008-02-23 Thread Aurélien Bouteiller
Trunk works fine in Leopard in both static and dso build. Didn't tried the tmp branch on Leopard tough. Aurelien Le 22 févr. 08 à 23:17, Ralph Castain a écrit : I have confirmed that my tmp branch now builds and works on the Mac Leopard OS, at least on an Intel arch. It is really

[OMPI devel] PML V will be enabled again

2008-02-08 Thread Aurélien Bouteiller
plan to remove the ignore tag on wed. feb. 6. Thanks, Aurelien -- Dr. Aurélien Bouteiller Sr. Research Associate - Innovative Computing Laboratory Suite 350, 1122 Volunteer Boulevard Knoxville, TN 37996 865 974 6321

Re: [OMPI devel] orte_ns_base_select failed: returned value -1 instead of ORTE_SUCCESS

2008-01-31 Thread Aurélien Bouteiller
regularly. On 1/30/08 2:54 PM, "Aurélien Bouteiller" <boute...@eecs.utk.edu> wrote: I get a runtime error in static build on Mac OS 10.5 (automake 1.10, autoconf 2.60, gcc-apple-darwin 4.01, libtool 1.5.22). The error does not occur in dso builds, and everything seems to work fine

[OMPI devel] orte_ns_base_select failed: returned value -1 instead of ORTE_SUCCESS

2008-01-30 Thread Aurélien Bouteiller
MPI developer): ompi_mpi_init: orte_init_stage1 failed --> Returned "Error" (-1) instead of "Success" (0) -- *** An error occurred in MPI_Init *** before MPI was initialized *** MPI_ERRORS_ARE_FATAL (g

Re: [OMPI devel] RES: v pml question

2008-01-29 Thread Aurélien Bouteiller
I just agree with Josh. We though about it a bit, and nothing should prevent to use both. Aurelien Le 29 janv. 08 à 15:01, Josh Hursey a écrit : At the moment I do not plan on joining the crcpw and v_protocol. However those two components may currently work just fine together. They are both

Re: [OMPI devel] Trunk borked

2008-01-29 Thread Aurélien Bouteiller
DSO build also fail. ../../../../../../trunk/ompi/contrib/vt/vt/vtlib/vt_comp_gnu.c:312:5: warning: "VT_BFD" is not defined ../../../../../../trunk/ompi/contrib/vt/vt/vtlib/vt_comp_gnu.c:312:5: warning: "VT_BFD" is not defined /usr/bin/ld: cannot find -lz collect2: ld returned 1 exit status

Re: [OMPI devel] Fwd: === CREATE FAILURE ===

2008-01-24 Thread Aurélien Bouteiller
According to posix, tar should not limit the file name length. Only the v7 implementation of tar is limited to 99 characters. GNU tar has never been limited in the number of characters file names can have. You should check with tar --help that tar on your machine defaults to format=gnu or

Re: [OMPI devel] RES: v pml question

2008-01-23 Thread Aurélien Bouteiller
Hi, Actually it might already work. We never tried yet but nothing should prevent it. The symlinks are necessary to trick the autogen and configure stages. This is required to avoid code replication from autogen.sh. If you look carefully you will see that the simlinks are created only

Re: [OMPI devel] trunk breakage

2008-01-23 Thread Aurélien Bouteiller
Should be fixed with r17184. Thanks for the quick bug report ! Aurelien Le 23 janv. 08 à 14:08, Jeff Squyres a écrit : The vprotocol pml does not compile for me. make[4]: Entering directory `/home/jsquyres/svn/ompi2/ompi/mca/pml/v/ vprotocol/pessimist' /bin/sh ../../../../../../libtool

Re: [OMPI devel] [OMPI svn] svn:open-mpi r17177

2008-01-23 Thread Aurélien Bouteiller
Undefined symbols: "_opal_carto_base_components_opened", referenced from: _opal_carto_base_components_opened$non_lazy_ptr in components.o "_opal_carto_base_open", referenced from: ompi_info::open_components() in components.o "_opal_carto_base_close", referenced from: