Re: [OMPI users] Error handling

2023-07-18 Thread George Bosilca via users
Alex, How are your values "random" if you provide correct values ? Even for negative values you could use MIN to pick one value and return it. What is the problem with `MPI_Abort` ? it does seem to do what you want. George. On Tue, Jul 18, 2023 at 4:38 AM Alexander Stadik via users < users@li

[OMPI users] Error handling

2023-07-18 Thread Alexander Stadik via users
Hey everyone, I am working for longer time now with cuda-aware OpenMPI, and developed longer time back a small exceptions handling framework including MPI and CUDA exceptions. Currently I am using MPI_Abort with costum error numbers, to terminate everything elegantly, which works well, by just

Re: [OMPI users] Error Handling Problem

2006-10-27 Thread George Bosilca
On Oct 27, 2006, at 10:56 AM, laurent.po...@fr.thalesgroup.com wrote: I did change the default error handler (using Mpi_Comm_set_errhandler) in the main_exe program. I replaced it with a printf. My error handler is never called, but main_exe receives a SIGPIPE signal. So the only solution

Re: [OMPI users] Error Handling Problem

2006-10-27 Thread Laurent . POREZ
> From: George Bosilca > Subject: Re: [OMPI users] Error Handling Problem > To: Open MPI Users > Message-ID: > Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed > > How about changing the default error handler ? I did change the default

Re: [OMPI users] Error Handling Problem

2006-10-26 Thread George Bosilca
How about changing the default error handler ? It is not supposed to work, and if you find an MPI implementation that support this approach please tell me. I know the paper where you read about this, but even with their MPI library this approach does not work. Soon, Open MPI will be able

[OMPI users] Error Handling Problem

2006-10-26 Thread Laurent . POREZ
Hi, I developped a launcher application : a MPI application (say main_exe) lauches 2 MPI applications (say exe1 and exe2), using MPI_Comm_spawn_multiple. Now, I'm looking at the behavior when an exe crashes. What I can see is the following : 1) when everybody is launched, I see the following