Re: [OMPI users] Coding help requested

Eugene Loh Tue, 10 Nov 2009 15:44:31 -0500

amjad ali wrote:

Hi all.
(sorry for duplication, if it is)
I have to parallelize a CFD code using domain/grid/mesh partitioningamong the processes. Before running, we do not know,
(i) How many processes we will use ( np  is unknown)
(ii) A process will have how many neighbouring processes (my_nbrs = ?)
(iii) How many entries a process need to send to a particularneighbouring process.
But when the code run, I calculate all of this info easily.
The problem is to copy a number of entries to an array then send thatarray to a destination process. The same sender has to repeat thiswork to send data to all of its neighbouring processes. Is thisfollowing code fine:
DO i = 1, my_nbrs
   DO j = 1, few_entries_for_this_neighbour
       send_array(j)   =    my_array(jth_particular_entry)
   ENDDO
CALL MPI_ISEND(send_array(1:j),j, MPI_REAL8, dest(i), tag,MPI_COMM_WORLD, request1(i), ierr)

instead of "j" I assume you intended something like"few_entries_for_this_neighbour"

ENDDO

And the corresponding receives, at each process:

DO i = 1, my_nbrs
   k = few_entries_from_this_neighbour
CALL MPI_IRECV(recv_array(1:k),k, MPI_REAL8, source(i), tag,MPI_COMM_WORLD, request2(i), ierr)
   DO j = 1, few_from_source(i)
       received_data(j)   =    recv_array(j)
   ENDDO
ENDDO

After the above MPI_WAITALL.
I think this code will not work. Both for sending and receiving. Forthe non-blocking sends we cannot use send_array to send data to otherprocesses like above (as we are not sure for the availability ofapplication buffer for reuse). Am I right?
Similar problem is with recv array; data from multiple processescannot be received in the same array like above. Am I right?

Correct for both send and receive. When you call MPI_Isend, the buffercannot be written until the MPI_Waitall. When you use MPI_Irecv, youcannot read the data until MPI_Waitall. You're reusing both send andreceive buffers too often and too soon.

Target is to hide communication behind computation. So need nonblocking communication. As we do know value of np or values of my_nbrsfor each process, we cannot decide to create so many arrays. Pleasesuggest solution.


You can allocate memory dynamically, even in Fortran.

A more subtle solution that I could assume is following:

cc = 0
DO i = 1, my_nbrs
   DO j = 1, few_entries_for_this_neighbour
       send_array(cc+j)   =    my_array(jth_particular_entry)
   ENDDO

CALL MPI_ISEND(send_array(cc:cc+j),j, MPI_REAL8, dest(i), tag,MPI_COMM_WORLD, request1(i), ierr)

   cc = cc  + j
ENDDO

Same issue with j as before, but yes concatenating the various sendbuffers in a one-dimensional fashion should work.

And the corresponding receives, at each process:

cc = 0
DO i = 1, my_nbrs
   k = few_entries_from_this_neighbour
CALL MPI_IRECV(recv_array(cc+1:cc+k),k, MPI_REAL8, source(i), tag,MPI_COMM_WORLD, request2(i), ierr)
   DO j = 1, k
       received_data(j)   =    recv_array(cc+j)
   ENDDO
   cc = cc + k
ENDDO

Okay, but you're still reading the data before the MPI_Waitall call. Ifyou call MPI_Irecv(buffer,...), you cannot read the buffer's contentsuntil the corresponding MPI_Waitall (or variant).

After the above MPI_WAITALL.

Means that,
send_array for all neighbours will have a collected shape:
send_array = [... entries for nbr 1 ..., ... entries for nbr 1 ...,..., ... entries for last nbr ...]And the respective entries will be send to respective neighbours asabove.
recv_array for all neighbours will have a collected shape:
recv_array = [... entries from nbr 1 ..., ... entries from nbr 1 ...,..., ... entries from last nbr ...]And the entries from the processes will be received at respectivelocations/portion in the recv_array.
Is this scheme is quite fine and correct.

I am in search of efficient one.

Re: [OMPI users] Coding help requested

Reply via email to