Re: [OMPI devel] Fwd: Questions on rank-to-address mapping

2017-04-20 Thread Jeff Squyres (jsquyres)
On Apr 20, 2017, at 12:16 AM, Marc Cooper wrote: > > I am trying to understand how connections are established among MPI ranks. > Pardon for the list of questions. > > 1) Is there a global data structure that creates and stores rank to network > address (uri or port number) mapping It's quite

Re: [OMPI devel] openib oob module

2017-04-20 Thread r...@open-mpi.org
I’m not seeing any problem inside the OOB - the problem appears to be in the info being given to it: [host1:16244] 1 more process has sent help message help-mpi-btl-openib.txt / default subnet prefix [host1:16244] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error message

Re: [OMPI devel] openib oob module

2017-04-20 Thread Shiqing Fan
I have already started a bit on 1.10 branch with following command line: mpirun -np 2 -host 192.168.2.21,192.168.2.22 --mca btl_openib_use_eager_rdma 0 --mca btl openib,self --mca btl_openib_cpc_include oob -mca oob_tcp_if_include br0 -mca btl_base_verbose 100 /home/shiqing/workspace/netpipe-3.7

Re: [OMPI devel] openib oob module

2017-04-20 Thread r...@open-mpi.org
Yeah, I forgot that the 1.10 series still had the BTLs in OMPI. Should be able to restore it. I honestly don’t recall the bug, though :-( If you want to try reviving it, you can add some debug in there (plus turn on the OOB verbosity) and I’m happy to help you figure it out. Ralph > On Apr 20,

Re: [OMPI devel] openib oob module

2017-04-20 Thread Shiqing Fan
Hi Ralph, Yes, it’s been a long time. Hope you all are doing well (I believe so ☺ ). I’m working on a virtualization project, and need to run Open MPI on an unikernel OS (most of OFED is missing/unsupported). Actually I’m only focusing on 1.10.2, which still has oob in ompi. Probably it might

Re: [OMPI devel] openib oob module

2017-04-20 Thread Gilles Gouaillardet
Ralph, in v1.10 series, BTL is still in the OMPI layer. from v2, could the BTL directly use PMIx instead of rml (orte) ? Cheers, Gilles On Thursday, April 20, 2017, r...@open-mpi.org wrote: > Hi Shiqing! > > Been a long time - hope you are doing well. > > I see no way to bring the oob module

Re: [OMPI devel] openib oob module

2017-04-20 Thread r...@open-mpi.org
Hi Shiqing! Been a long time - hope you are doing well. I see no way to bring the oob module back now that the BTLs are in the OPAL layer - this is why it was removed as the oob is in ORTE, and thus not accessible from OPAL. Ralph > On Apr 20, 2017, at 6:02 AM, Shiqing Fan wrote: > > Dear al

[OMPI devel] openib oob module

2017-04-20 Thread Shiqing Fan
Dear all, I noticed that openib oob module has been removed since a long time ago, because it wasn't working anymore and nobody seemed need it. But for some special operating system, where the rdmacm, udcm or ibcm kernel support is missing, oob may still be necessary. I'm curious if it's possib