date:20190320

Re: [OMPI devel] Memory performance with Bcast

2019-03-20 Thread George Bosilca

If you have support for FCA then it might happen that the collective will use the hardware support. In any case, most of the bcast algorithms have a logarithmic behavior, so there will be at most O(log(P)) memory accesses on the root. If you want to take a look at the code in OMPI to understand

[OMPI devel] Memory performance with Bcast

2019-03-20 Thread marcin.krotkiewski

Hi! I'm wondering about the details of Bcast implementation in OpenMPI. I'm specifically interested in IB interconnects, but information about other architectures (and OpenMPI in general) would also be very useful. I am working with a code, which sends the same (large) message to a bunch