Re: [OMPI devel] Retrying a MPI_SEND

2012-01-26 Thread Hugo Daniel Meyer
Hello @ll. I'm reviving this topic because i've done things as you propose, and i still can't catch the error mentioned before. I will put here some pieces of code to contextualize. I've set the error handler: *MPI_Errhandler_set(MPI_COMM_WORLD,MPI_ERRORS_RETURN);* Then do a send like this: *

Re: [OMPI devel] Retrying a MPI_SEND

2011-12-20 Thread Hugo Daniel Meyer
Sorry for the delay. I will try with the MPI_ERRORS_RETURN handler, maybe that is my problem. Thanks a lot for your help. I'll let you know how it goes. Best regards. Hugo 2011/12/16 George Bosilca > Setting the error handler to MPI_ERRORS_RETURN is the right solution for > mechanism using th

Re: [OMPI devel] Retrying a MPI_SEND

2011-12-16 Thread George Bosilca
Setting the error handler to MPI_ERRORS_RETURN is the right solution for mechanism using the PMPI interface. Hugo is one software layer below the MPI interface, so the error handler is not affecting his code. However, once he reacts to an error, he should reset the error (in the status attached

Re: [OMPI devel] Retrying a MPI_SEND

2011-12-16 Thread Jeff Squyres
I'm jumping into the middle of this conversation and probably don't have all the right context, so forgive me if this is a stupid question: did you set MPI_ERRORS_RETURN on the communicator in question? On Dec 14, 2011, at 10:43 AM, Hugo Daniel Meyer wrote: > Hello George and @ll. > > Sorry f

Re: [OMPI devel] Retrying a MPI_SEND

2011-12-14 Thread Hugo Daniel Meyer
Hello George and @ll. Sorry for the late answer, but i was doing some trace to see where is set the MPI_ERROR. I took a look to ompi_request_default_wait and try to see what happen with request. Well, i've noticed that all requests that are not inmediately solved go to ompi_request_wait_completio

Re: [OMPI devel] Retrying a MPI_SEND

2011-12-09 Thread George Bosilca
On Dec 9, 2011, at 06:59 , Hugo Daniel Meyer wrote: > Hello George and all. > > I've been adapting some of the code to copy the request, and now i think that > it is working ok. I'm storing the request as you do on the pessimist, but i'm > only logging received messages, as my approach is a pe

Re: [OMPI devel] Retrying a MPI_SEND

2011-12-09 Thread Hugo Daniel Meyer
Hello George and all. I've been adapting some of the code to copy the request, and now i think that it is working ok. I'm storing the request as you do on the pessimist, but i'm only logging received messages, as my approach is a pessimist log based on the receiver. I do have a question about how

Re: [OMPI devel] Retrying a MPI_SEND

2011-11-19 Thread Hugo Daniel Meyer
2011/11/18 George Bosilca > > On Nov 18, 2011, at 11:50 , Hugo Daniel Meyer wrote: > > > 2011/11/18 George Bosilca > >> >> On Nov 18, 2011, at 11:14 , Hugo Daniel Meyer wrote: >> >> 2011/11/18 George Bosilca >> >>> >>> On Nov 18, 2011, at 07:29 , Hugo Daniel Meyer wrote: >>> >>> Hello again. >>

Re: [OMPI devel] Retrying a MPI_SEND

2011-11-18 Thread George Bosilca
On Nov 18, 2011, at 11:50 , Hugo Daniel Meyer wrote: > > 2011/11/18 George Bosilca > > On Nov 18, 2011, at 11:14 , Hugo Daniel Meyer wrote: > >> 2011/11/18 George Bosilca >> >> On Nov 18, 2011, at 07:29 , Hugo Daniel Meyer wrote: >> >>> Hello again. >>> >>> I was doing some trace into de

Re: [OMPI devel] Retrying a MPI_SEND

2011-11-18 Thread Hugo Daniel Meyer
2011/11/18 George Bosilca > > On Nov 18, 2011, at 11:14 , Hugo Daniel Meyer wrote: > > 2011/11/18 George Bosilca > >> >> On Nov 18, 2011, at 07:29 , Hugo Daniel Meyer wrote: >> >> Hello again. >> >> I was doing some trace into de PML_OB1 files. I start to follow a >> MPI_Ssend() trying to find w

Re: [OMPI devel] Retrying a MPI_SEND

2011-11-18 Thread George Bosilca
On Nov 18, 2011, at 11:14 , Hugo Daniel Meyer wrote: > 2011/11/18 George Bosilca > > On Nov 18, 2011, at 07:29 , Hugo Daniel Meyer wrote: > >> Hello again. >> >> I was doing some trace into de PML_OB1 files. I start to follow a >> MPI_Ssend() trying to find where a message is stored (in the

Re: [OMPI devel] Retrying a MPI_SEND

2011-11-18 Thread Hugo Daniel Meyer
2011/11/18 George Bosilca > > On Nov 18, 2011, at 07:29 , Hugo Daniel Meyer wrote: > > Hello again. > > I was doing some trace into de PML_OB1 files. I start to follow a > MPI_Ssend() trying to find where a message is stored (in the sender) if it > is not send until the receiver post the recv, bu

Re: [OMPI devel] Retrying a MPI_SEND

2011-11-18 Thread George Bosilca
On Nov 18, 2011, at 07:29 , Hugo Daniel Meyer wrote: > Hello again. > > I was doing some trace into de PML_OB1 files. I start to follow a MPI_Ssend() > trying to find where a message is stored (in the sender) if it is not send > until the receiver post the recv, but i didn't find that place.

Re: [OMPI devel] Retrying a MPI_SEND

2011-11-18 Thread Hugo Daniel Meyer
Hello again. I was doing some trace into de PML_OB1 files. I start to follow a MPI_Ssend() trying to find where a message is stored (in the sender) if it is not send until the receiver post the recv, but i didn't find that place. I've noticed that the message to be sent enters in * mca_pml_ob1_rn

[OMPI devel] Retrying a MPI_SEND

2011-11-17 Thread Hugo Daniel Meyer
Hello @ll. I'm doing some changes in the communication framework. Right now i'm working on a "secure" MPI_Send, this send needs to know when an endpoint goes down, and then retry the communication constructing a new endpoint, or at least, overwriting the data of the old endpoint with the new addre