Re: [OMPI users] Error code for I/O operations

2021-07-01 Thread Aurelien Bouteiller via users
Eric, 

Error codes are **not** mandated to be consistent across ranks in the MPI 
standard. Although file errors are not expected to cause the program to abort, 
there is no special case for I/O in that regard in the MPI standard. The Open 
MPI implementation acts accordingly and will not automatically ‘reduce’ error 
codes.


> On Jun 30, 2021, at 23:53, Eric Chamberland via users 
>  wrote:
> 
> Hi,
> 
> I have a simple question about error codes returned by MPI_File_*_all* and 
> MPI_File_open/close functions:
> 
> If an error is returned will it be the same for *all* processes? In other 
> worlds, are error codes communicated under the hood so we, end users, can 
> avoid to add "reduce" on those error codes?
> 
> Thanks,
> 
> Eric
> 
> -- 
> Eric Chamberland, ing., M. Ing
> Professionnel de recherche
> GIREF/Université Laval
> (418) 656-2131 poste 41 22 42
> 



Best Regards, 
Aurelien

--
Aurelien Bouteiller, Ph.D. 
Innovative Computing Laboratory; The University of Tennessee; 
1122 Volunteer Blvd.; Claxton suite 203; 37996, Knoxville, TN, USA
+1 865 974 9308 (p); +1 865 974 8296 (f); Claxton 316 (f2f)








Re: [OMPI users] Are there any issues (performance or otherwise) building apps with different compiler from the one used to build openmpi?

2019-03-22 Thread Aurelien Bouteiller
You can set the environment variable OMPI_CC to change what compiler mpicc uses 
as the application compiler, irrespective of what compiler Open MPI was built 
with. 

That should work for C applications just fine.

For Fortran programs the story is more complicated, as the MPI Fortran module 
would be tied to the compiler version.


Best,
Aurelien

> On Mar 20, 2019, at 04:26, Daniel Letai  wrote:
> 
> Hello,
> 
> Assuming I have installed openmpi built with distro stock gcc(4.4.7 on rhel 
> 6.5), but an app requires a different gcc version (8.2 manually built on dev 
> machine).
> 
> Would there be any issues, or performance penalty, if building the app using 
> the more recent gcc with flags from wrapper compiler's --showme as 
> perhttps://www.open-mpi.org/faq/?category=mpi-apps#cant-use-wrappers 
>  ?
> Openmpi is built with both pmix and ucx enabled, all built with stock 
> gcc(4.4.7).
> 
> Since the constraint is the app, if the answer is yes I would have to build 
> openmpi using non-distro gcc which is a bit of a hassle.
> 
> Thanks in advance
> --Dani_L.
> ___
> users mailing list
> users@lists.open-mpi.org 
> https://lists.open-mpi.org/mailman/listinfo/users 
> 
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] Set oversubscribe as default

2019-02-15 Thread Aurelien Bouteiller
Florian,
this issue has already been addressed and will be part of future 4.x releases. 

https://github.com/open-mpi/ompi/pull/6139 


Best,
Aurelien

> On Feb 15, 2019, at 11:52, Florian Lindner  wrote:
> 
> Hello,
> 
> I used to have oversubscribe set a default using the environment variable 
> OMPI_MCA_rmaps_base_oversubscribe. However, since recently, probably since 
> 4.0.0, that doesn't seem to work anymore.
> 
> % echo $OMPI_MCA_rmaps_base_oversubscribe 
> 1
> % mpirun --version
> mpirun (Open MPI) 4.0.0
> 
> Report bugs to http://www.open-mpi.org/community/help/
> lindnefn@asaru ~ % mpirun -n 4 ls
> --
> There are not enough slots available in the system to satisfy the 4 slots
> that were requested by the application:
>  ls
> 
> Either request fewer slots for your application, or make more slots available
> for use.
> --
> % mpirun --oversubscribe -n 4 ls
> [works]
> 
> How can enable oversubscribe as default again?
> 
> Thanks,
> Florian
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] How to enable vprotocol + pessimistic message logging?

2019-01-22 Thread Aurelien Bouteiller
Hey Kiril, 

Indeed, the pessimist message logging does not support threaded accesses. That 
test is however overly cautious if you are initializing as multiple but then do 
not perform concurrent accesses. Please verify that he NBP3.3 do 
MPI_INIT_THREADS(MPI_THREAD_SINGLE), as doing other initialization will set 
MPI_THREAD_MULTIPLE implicitly. 

Best,
Aurelien

> On Jan 22, 2019, at 12:13, Kiril Dichev  wrote:
> 
> Hi,
> 
> I’m doing some research on message logging protocols. It seems that Vprotocol 
> in Open MPI can wrap around communication calls and log messages, if enabled. 
> Unfortunately, when I try to use it with Open MPI- 4.0.0, I get an error:
> 
> mpirun   --mca vprotocol pessimist-mca vprotocol_pessimist_priority 10  
> -n 4 $HOME/NPB3.3-MPI/bin/cg.B.4
> …
> vprotocol_pessimist: component_init: threads are enabled, and not supported 
> by vprotocol pessimist fault tolerant layer, will not load
> …
> 
> Unfortunately, it seems that actually disabling multi-threading is not 
> possible in 4.0.0 (MPI_THREAD_MULTIPLE is always used during compilation, and 
> in contrast to the README file, --enable-mpi-thread-multiple or 
> —disable-mpi-thread-multiple are not recognised as options). 
> 
> I’m pretty much stuck. Should I give up on the VProtocol as unusable then at 
> the moment?
> 
> Thanks,
> Kiril
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] Forcing MPI processes to end

2017-11-16 Thread Aurelien Bouteiller
Adam. Your MPI program is incorrect. You need to replace the finalize on
the process that found the error with MPIAbort

On Nov 16, 2017 10:38, "Adam Sylvester"  wrote:

> I'm using Open MPI 2.1.0 for this but I'm not sure if this is more of an
> Open MPI-specific implementation question or what the MPI standard
> guarantees.
>
> I have an application which runs across multiple ranks, eventually
> reaching an MPI_Gather() call.  Along the way, if one of the ranks
> encounters an error, it will call report the error to a log, call
> MPI_Finalize(), and exit with a non-zero return code.  If this happens
> prior to the other ranks making it to the gather, it seems like mpirun
> notices this and the process ends on all ranks.  This is what I want to
> happen - it's a legitimate error, so all processes should be freed up so
> the next job can run.  It seems like if the other ranks make it into the
> MPI_Gather() before the one rank reports an error, the other ranks wait in
> the MPI_Gather() forever.
>
> Is there something simple I can do to guarantee that if any process calls
> MPI_Finalize(), all my ranks terminate?
>
> Thanks.
> -Adam
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users
>
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] MPI Spawn functionality and core allocation questions

2016-06-29 Thread Aurelien Bouteiller
You may want to look at the —oversubscribe mpirun option. 

If you want more control, you can consider making a rankfile where you place 
explicitly processes. 

Aurélien 

> Le 29 juin 2016 à 11:50, Jason Maldonis  a écrit :
> 
> Hi everyone,
> 
> I am having trouble developing a complicated parallelization algorithm with 
> MPI and I'm hoping for some tips (I am using OpenMPI 1.10.2). I posted the 
> latest problem I ran into on Stack Overflow and got a response from someone 
> saying they don't think it is possible to do the spawn allocation that I want.
> 
> You have all been very helpful in the past, so I am hoping to get your 
> opinions.
> 
> Briefly, I am trying to oversubscribe cores to run spawned processes while 
> the main program's processes sit and wait in a barrier.  Because the syntax 
> highlighting is better, it might be easiest to look at my post on Stack 
> Overflow than me trying to paste it in an email. Please see this link:  
> http://stackoverflow.com/questions/38087327/mpi-spawn-not-enough-slots-available-all-which-nodes-are-allocated-for-this-j
>  
> 
> 
> Thanks for your help,
> Jason
> ___
> users mailing list
> us...@open-mpi.org
> Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: 
> http://www.open-mpi.org/community/lists/users/2016/06/29561.php



smime.p7s
Description: S/MIME cryptographic signature


Re: [OMPI users] error with Vprotocol pessimist

2007-12-13 Thread Aurelien Bouteiller
If you want to use the pessimist message logging you have to use the "- 
mca vprotocol pessimist" flag on your command line. This should work  
despite the bug because if I understand correctly, the issue you  
experience should occur only when fault tolerance is disabled.
I have troubles to reproduce the particular bug you are experiencing.  
What compiler and what architecture are you using ?


Aurelien
Le 13 déc. 07 à 07:58, Thomas Ropars a écrit :


I still have the same error after update (r16951).

I have the lib/openmpi/mca_pml_v.so file in my builld and the command
line I use is: mpirun -np 4 my_application

Thomas


Aurelien Bouteiller wrote:

I could reproduce and fix the bug. It will be corrected in trunk as
soon as the svn is online again. Thanks for reporting the problem.

Aurelien

Le 11 déc. 07 à 15:02, Aurelien Bouteiller a écrit :



I cannot reproduce the error. Please make sure you have the lib/
openmpi/mca_pml_v.so file in your build. If you don't, maybe you
forgot to run autogen.sh at the root of the trunk when you
removed .ompi_ignore.

If this does not fix the problem, please let me know your command  
line

options to mpirun.

Aurelien

Le 11 déc. 07 à 14:36, Aurelien Bouteiller a écrit :



Mmm, I'll investigate this today.

Aurelien
Le 11 déc. 07 à 08:46, Thomas Ropars a écrit :



Hi,

I've tried to test the message logging component vprotocol
pessimist.
(svn checkout revision 16926)
When I run an mpi application, I get the following error :

mca: base: component_find: unable to open vprotocol pessimist:
/local/openmpi/lib/openmpi/mca_vprotocol_pessimist.so: undefined
symbol:
pml_v_output (ignored)


Regards

Thomas
___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users



--
Dr. Aurelien Bouteiller, Sr. Research Associate
Innovative Computing Laboratory - MPI group
+1 865 974 6321
1122 Volunteer Boulevard
Claxton Education Building Suite 350
Knoxville, TN 37996


___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users


___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users




___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users


___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users





Re: [OMPI users] error with Vprotocol pessimist

2007-12-12 Thread Aurelien Bouteiller
I could reproduce and fix the bug. It will be corrected in trunk as  
soon as the svn is online again. Thanks for reporting the problem.


Aurelien

Le 11 déc. 07 à 15:02, Aurelien Bouteiller a écrit :


I cannot reproduce the error. Please make sure you have the lib/
openmpi/mca_pml_v.so file in your build. If you don't, maybe you
forgot to run autogen.sh at the root of the trunk when you
removed .ompi_ignore.

If this does not fix the problem, please let me know your command line
options to mpirun.

Aurelien

Le 11 déc. 07 à 14:36, Aurelien Bouteiller a écrit :


Mmm, I'll investigate this today.

Aurelien
Le 11 déc. 07 à 08:46, Thomas Ropars a écrit :


Hi,

I've tried to test the message logging component vprotocol  
pessimist.

(svn checkout revision 16926)
When I run an mpi application, I get the following error :

mca: base: component_find: unable to open vprotocol pessimist:
/local/openmpi/lib/openmpi/mca_vprotocol_pessimist.so: undefined
symbol:
pml_v_output (ignored)


Regards

Thomas
___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users




--
Dr. Aurelien Bouteiller, Sr. Research Associate
Innovative Computing Laboratory - MPI group
+1 865 974 6321
1122 Volunteer Boulevard
Claxton Education Building Suite 350
Knoxville, TN 37996


___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users



___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users





Re: [OMPI users] error with Vprotocol pessimist

2007-12-11 Thread Aurelien Bouteiller
I cannot reproduce the error. Please make sure you have the lib/ 
openmpi/mca_pml_v.so file in your build. If you don't, maybe you  
forgot to run autogen.sh at the root of the trunk when you  
removed .ompi_ignore.


If this does not fix the problem, please let me know your command line  
options to mpirun.


Aurelien

Le 11 déc. 07 à 14:36, Aurelien Bouteiller a écrit :


Mmm, I'll investigate this today.

Aurelien
Le 11 déc. 07 à 08:46, Thomas Ropars a écrit :


Hi,

I've tried to test the message logging component vprotocol pessimist.
(svn checkout revision 16926)
When I run an mpi application, I get the following error :

mca: base: component_find: unable to open vprotocol pessimist:
/local/openmpi/lib/openmpi/mca_vprotocol_pessimist.so: undefined
symbol:
pml_v_output (ignored)


Regards

Thomas
___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users




--
Dr. Aurelien Bouteiller, Sr. Research Associate
Innovative Computing Laboratory - MPI group
+1 865 974 6321
1122 Volunteer Boulevard
Claxton Education Building Suite 350
Knoxville, TN 37996


___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users





Re: [OMPI users] error with Vprotocol pessimist

2007-12-11 Thread Aurelien Bouteiller

Mmm, I'll investigate this today.

Aurelien
Le 11 déc. 07 à 08:46, Thomas Ropars a écrit :


Hi,

I've tried to test the message logging component vprotocol pessimist.
(svn checkout revision 16926)
When I run an mpi application, I get the following error :

mca: base: component_find: unable to open vprotocol pessimist:
/local/openmpi/lib/openmpi/mca_vprotocol_pessimist.so: undefined  
symbol:

pml_v_output (ignored)


Regards

Thomas
___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users




--
Dr. Aurelien Bouteiller, Sr. Research Associate
Innovative Computing Laboratory - MPI group
+1 865 974 6321
1122 Volunteer Boulevard
Claxton Education Building Suite 350
Knoxville, TN 37996




Re: [OMPI users] Tmpdir work for first process only

2007-11-15 Thread Aurelien Bouteiller
sessions-kxc565@ac27_0/default-
universe-27928/0
[ac27:27928] unidir:
/home/565/kxc565/tmpdir/openmpi-sessions-kxc565@ac27_0/default-
universe-27928
[ac27:27928] top: openmpi-sessions-kxc565@ac27_0
[ac27:27928] tmp: ?
[ac27:27928] [0,0,0] contact_file
/home/565/kxc565/tmpdir/openmpi-sessions-kxc565@ac27_0/default-
universe-27928/universe-setup.txt
[ac27:27928] [0,0,0] wrote setup file
[ac27:27932] [0,0,1] setting up session dir with
[ac27:27932] universe default-universe-27928
[ac27:27932] user kxc565
[ac27:27932] host ac27
[ac27:27932] jobid 0
[ac27:27932] procid 1
[ac27:27932] procdir:
/tmp/openmpi-sessions-kxc565@ac27_0/default-universe-27928/0/1
[ac27:27932] jobdir:
/tmp/openmpi-sessions-kxc565@ac27_0/default-universe-27928/0
[ac27:27932] unidir:
/tmp/openmpi-sessions-kxc565@ac27_0/default-universe-27928
[ac27:27932] top: openmpi-sessions-kxc565@ac27_0
[ac27:27932] tmp: /tmp
[ac27:27932] [0,0,1] ORTE_ERROR_LOG: Out of resource in file
base/iof_base_setup.c at line 106
[ac27:27932] [0,0,1] ORTE_ERROR_LOG: Out of resource in file
odls_default_module.c at line 663
[ac27:27932] [0,0,1] ORTE_ERROR_LOG: Out of resource in file
odls_default_module.c at line 1191
[ac27:27932] [0,0,1] ORTE_ERROR_LOG: Out of resource in file
orted.c at
line 594
[ac27:27928] spawn: in job_state_callback(jobid = 1, state = 0x80)
mpirun noticed that job rank 0 with PID 0 on node ac27 exited on
signal
15 (Terminated).
[ac27:27932] sess_dir_finalize: job session dir not empty - leaving
[ac27:27932] sess_dir_finalize: proc session dir not empty -  
leaving
[ac27:27928] sess_dir_finalize: proc session dir not empty -  
leaving





--
Clement Kam Man Chu
Research Assistant
Faculty of Information Technology
Monash University, Caulfield Campus
Ph: 61 3 9903 2355

___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users








--
Clement Kam Man Chu
Research Assistant
Faculty of Information Technology
Monash University, Caulfield Campus
Ph: 61 3 9903 2355

___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users




--
Dr. Aurelien Bouteiller, Sr. Research Associate
Innovative Computing Laboratory - MPI group
+1 865 974 6321
1122 Volunteer Boulevard
Claxton Education Building Suite 350
Knoxville, TN 37996