Re: [OMPI users] Timing communication

Jeff Squyres Mon, 11 Jun 2007 08:17:45 -0400

Measuring communications is a very tricky process; there's a lot offactors involved. Check out this FAQ item:


    http://www.open-mpi.org/faq/?category=tuning#running-perf-numbers

You might want to use a well-known benchmark program (e.g., NetPIPE,link checker, etc.) to run pair-wise communication performanceanalysis rather than write your own application; it's typically notas simple as just doing a few sends within a loop.

The issue is that MPI may make different decisions on how to sendmessages, including factors such as:


- is this the first time you have sent between these peer pair?
- who are you sending to?
- what is the size of the message?
- are there other messages pending?

- are other messages incoming from different peers while you aresending?

Your simplistic loop below can cause some "bad" things to happen(i.e., not give a true/absolute measure of what max performance isbetween a pair of peers) by unintentionally stepping on several ofthe things that Open MPI does behind the scenes (e.g., we don't makenetwork connections until the first time a message is sent between agiven peer pair).

But on the flip side, there's a whole school of thought that microbenchmarks are only useful in a limited sense (because they testartificial scenarios), and the only thing that *really* matters isyour application's performance. Hence, micro benchmarks are good asinput for guiding tuning issues, but they are not the absolutemeasure of how well a given OS/middleware/network are performing.That being said, a poorly-written application will tend performpoorly regardless of how well the OS/middleware/network performs.


And so on.

This is an age-old religious debate, and both sides have some goodpoints. I won't re-hash the entire debate here. :-)



On Jun 4, 2007, at 10:00 AM, Allan, Mark ((UK Filton)) wrote:

Hi,
I'm new to this list and wonder if anyone can help. I'm trying tomeasure communication time between parallel processes usingopenmpi. As an example I might be running on 4 dual coreprocessors (8 processes in total). I was hoping that communicationusing shared memory (comms between dual cores on the same chip)would be faster than that over the network. To measurecommunication time I'm sending a block of data to each process(from each process) using a blocking send, and am timing how longit takes. I repeat this 50 times (for example) and take theaverage time. The code is something like:
 for(int i=0;i<numProcs;i++)
    for(int j=0;j<numProcs;j++)
      if(i!=j)
         {
           // // // i is the sending proc to j, others wait
             double time = 0.0;
             for(int kk=0; kk<50; kk++)
             {
                  if(i==my_rank)
                  {
                      double start = MPI::Wtime();
MPI::COMM_WORLD.Send(&sendData[0],dataSize,MPI::DOUBLE,j,i);
                      double end = MPI::Wtime();
                      time+=(end-start);
                  }
                  if(j==my_rank)
                  {
MPI::COMM_WORLD.Recv(&recvData[0],dataSize,MPI::DOUBLE,i,i);
                  }
             }
             if(i==my_rank)
out << i << " " << j << " " << time/50.0 <<std::endl;
             MPI::COMM_WORLD.Barrier();
         }
The problem I am having is that I'm not noticing any appreciabledifference in communication times between shared memory and networkprotocols. I expected shared memory to be faster(!?!).
Does anyone have a better way of measuring communication times?

Thanks,

Mark.
********************************************************************
This email and any attachments are confidential to the intended
recipient and may also be privileged. If you are not the intended
recipient please delete it from your system and notify the sender.
You should not copy it or use it for any purpose nor disclose or
distribute its contents to any other person.
********************************************************************

_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users



--
Jeff Squyres
Cisco Systems

Re: [OMPI users] Timing communication

Reply via email to