[OMPI devel] MPI_Iprobe and mca_btl_sm_component_progress

Terry Dontje Thu, 19 Jun 2008 08:16:16 -0400

Galen, George and others that might have SM BTL interest.

In my quest of looking at MPI_Iprobe performance I found what I think isan issue. If you have an application that is using the SM BTL and doesa small message send <=256 followed by an MPI_Iprobe themca_btl_sm_component function that is eventually called as a result ofthe opal_progress will receive and ack message from its send and thenreturn. The net affect is that the real message is after the ackmessage doesn't get read until a second MPI_Iprobe is made.It seems to me that mca_btl_sm_component should read all Ack messagesfrom a particular fifo until it either finds a real send fragment or nomore messages on the fifo. Otherwise, we are forcing calls likeMPI_Iprobe to not return messages that are really there. I am not sureby IB but I know that the TCP BTL does not show this issue (whichdoesn't surprise me since I imagine the BTL is relying on TCP to handlethis type of protocol stuff).

Before I go munging with the code I wanted to make sure I am notoverlooking something here. One concern is if I change the code todrain all the ack messages is that going to disrupt performance elsewhere?


--td

[OMPI devel] MPI_Iprobe and mca_btl_sm_component_progress

Reply via email to