Hi,
the hostnames (or their IPs) are only used to ssh orted.
if you use only the tcp btl :
TCP *MPI* communications (vs OOB management communications) are handled
by btl/tcp
by default, all usable interfaces are used, then messages are split
(iirc, by ob1 pml) and then "fragments"
are sent using all interfaces.
each interface has a latency and bandwidth that is used to split message
into fragments.
(assuming it is correctly configured, 90% of a large message is sent
over the 10GbE interface, and 10% is sent over the GbE interface)
if you can explicitly list/blacklist interface
mpirun --mca btl_tcp_if_include ...
or
mpirun --mca btl_tcp_if_exclude ...
(see ompi_info --all for the syntax)
but if you use several btls (for example tcp and openib), the btl(s)
with the lower exclusivity are not used.
(for example, a large message is *not* split and send using native ib,
IPoIB and GbE because the openib btl
has a higher exclusivity than the tcp btl)
did this answer your question ?
Cheers,
Gilles
On 4/8/2016 12:24 PM, dpchoudh . wrote:
Hello all
(Newbie warning! Sorry :-( )
Let's say my cluster has 7 nodes, connected via IP-over-Ethernet for
control traffic and some kind of raw verbs (or anything else such as
SRIO) interface for data transfer. Let's say my host file chooses 4
out of the 7 nodes for an MPI job, based on the IP address, which are
assigned to the Ethernet interfaces.
My question is: where in the code does this mapping between
IP-to-whatever_interface_is_used_for_MPI_Send/Recv is determined, such
as only those chosen nodes receive traffic over the verbs interface?
Thanks in advance
Durga
We learn from history that we never learn from history.
_______________________________________________
devel mailing list
de...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post:
http://www.open-mpi.org/community/lists/devel/2016/04/18746.php