[hwloc-devel] hwloc nightly build: FAILURE

2017-04-21 Thread mpiteam
Successful builds: ['v1.11'] Skipped builds: [] Failed builds: ['master'] === Build output === Branches: ['v1.11', 'master'] Starting build for v1.11 Found new revision 646c39e v1.11 build of revision 646c39e completed successfully Starting build for master Found new revision 4aacd8b

Re: [OMPI devel] openib oob module

2017-04-21 Thread Shiqing Fan
The gap between these two versions is quite huge. I will first try to debug a bit more in 1.10. Regards, Shiqing -Original Message- From: devel [mailto:devel-boun...@lists.open-mpi.org] On Behalf Of r...@open-mpi.org Sent: Friday, April 21, 2017 4:02 PM To: OpenMPI Devel Subject: Re:

Re: [OMPI devel] openib oob module

2017-04-21 Thread r...@open-mpi.org
I’m not familiar with the openib code, but this looks to me like it may be caused by a change in the openib code itself. Have you looked to see what the diff might be between the two versions? > On Apr 21, 2017, at 6:45 AM, Shiqing Fan wrote: > > I've tried this out,

Re: [OMPI devel] openib oob module

2017-04-21 Thread Shiqing Fan
I've tried this out, and got the same problem as I sent before. With the same configuration and command line, 1.6.5 works for me, 1.10 series seem not. Could it also be IB configuration issue? (ib_write/read_bw/lat work fine across the two nodes) Error output below:

[OMPI devel] rpm filename (was: Program which runs wih 1.8.3, fails with 2.0.2)

2017-04-21 Thread Jeff Squyres (jsquyres)
On Apr 19, 2017, at 10:12 PM, Kevin Buckley wrote: > > This observation may not belong here, but as there are some eyes on this > issue, I might as well raise it here, as I came across it in the wake of going > with Choice 2 > > If one wishes to take a

Re: [OMPI devel] openib oob module

2017-04-21 Thread Shiqing Fan
Thanks Gilles, I will try it out today and let you know if it's working for me or not. Regards, Shiqing -Original Message- From: devel [mailto:devel-boun...@lists.open-mpi.org] On Behalf Of Gilles Gouaillardet Sent: Friday, April 21, 2017 9:41 AM To: devel@lists.open-mpi.org Subject:

Re: [OMPI devel] openib oob module

2017-04-21 Thread Shiqing Fan
The last message was from my test output, it makes no sense anyway. It looks like some QP/CQ initialization problem, but it’s hard to find the exact place at momemnt. I will try Gilles’ patch and see if it’s working for me. PS: Actually I made the patch from 1.10 series when OOB was removed.

Re: [OMPI devel] openib oob module

2017-04-21 Thread Gilles Gouaillardet
Folks, fwiw, i made https://github.com/open-mpi/ompi/pull/3393 and it works for me on a mlx4 cluster (Mellanox QDR) Cheers, Gilles On 4/21/2017 1:31 AM, r...@open-mpi.org wrote: I’m not seeing any problem inside the OOB - the problem appears to be in the info being given to it: