The timing on this is less than ideal for me.

To accommodate work on some high-voltage switching equipment, our building
will be without power over the weekend.
The system I use to autogen will be OFF from around 3pm today until perhaps
3pm on Monday.
I will also be busy with shutting down our group's systems gracefully today
and bringing them back on Monday.

The test platforms where I have reproduced the failures is NOT going to be
off-line.
So, I will be able to test only *tarballs* (but not patches to .m4 files)
until probably Monday evening.

Gilles,

I think it reasonable to suspect the lib could hold a stub that returns
ENOSYS for the deprecated function.
I suspect that checking for ibv_create_xrc_rcv_qp+IBV_QPT_XRC should work
for the rhel6.5 failure case described previously.
That way the checks for the two flavors both look for a function in the lib
and a constant in the header.


-Paul

On Fri, Jul 10, 2015 at 8:21 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com
> wrote:

> Yes, I seem to recall that this issue came up before... ah, here it is:
>
> commit 04bec4475e5a962432b73dd6254f62bb263703ab
> Author: Jeff Squyres <jsquy...@cisco.com>
> Date:   Fri Jan 16 18:13:31 2015 -0800
>
>     openib: check more thoroughly for XRC
>
>     Some systems have XRC symbols in their libibverbs libraries, but do
>     not have the appropriate XRC bits in their devel headers (cough cough
>     RHEL 6.5 libibverbs-rocee-*.x86-64.rpm cough cough).
>
>     So expand the XRC config checks to ensure that we can actually find
>     one of the XRC constants that we need to compile XRC code before
>     ruling that we can actually build XRC support.
>
>
>
> > On Jul 10, 2015, at 10:33 AM, Gilles Gouaillardet <
> gilles.gouaillar...@gmail.com> wrote:
> >
> > Sorry about that, and thanks for reverting the commit.
> >
> > Paul mentioned a patch I sent to the ml, and that worked for him.
> > The commit was supposed to be a more robust version.
> > For example, in rhel7, the deprecated function have been removed, but
> the xrc domains is fine.
> > Currently, xrc is not supported as it should.
> >
> > It seems rhel 6.5 has the deprecated function, but it is not in the
> header files are missing it among other things.
> >
> > I will fix that and post a issue a pr so you can test it on rhel6.5
> before I commit it.
> >
> > I noticed there is no infiniband/verbs.h on a lanl test cluster (the non
> cray one)
> > Is it possible to have it installed ?
> >
> > Cheers,
> >
> > Gilles
> >
> > On Friday, July 10, 2015, Jeff Squyres (jsquyres) <jsquy...@cisco.com>
> wrote:
> > On Jul 10, 2015, at 2:12 AM, Paul Hargrove <phhargr...@lbl.gov> wrote:
> > >
> > > The only "new" (non-cosmetic) problem I observed was the failure to
> detect "ConnectX XRC support".
> > > It looks like Gilles and I iterated on that issue until we have
> something that works now.
> >
> > 'fraid not.  :-(
> >
> > Per
> https://github.com/open-mpi/ompi-release/pull/384#issuecomment-120412836,
> the latest commit breaks on RHEL 6.5 systems that do not have MOFED
> installed.
> >
> > --
> > Jeff Squyres
> > jsquy...@cisco.com
> > For corporate legal information go to:
> http://www.cisco.com/web/about/doing_business/legal/cri/
> >
> > _______________________________________________
> > devel mailing list
> > de...@open-mpi.org
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > Link to this post:
> http://www.open-mpi.org/community/lists/devel/2015/07/17618.php
> > _______________________________________________
> > devel mailing list
> > de...@open-mpi.org
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > Link to this post:
> http://www.open-mpi.org/community/lists/devel/2015/07/17620.php
>
>
> --
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to:
> http://www.cisco.com/web/about/doing_business/legal/cri/
>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2015/07/17623.php
>



-- 
Paul H. Hargrove                          phhargr...@lbl.gov
Computer Languages & Systems Software (CLaSS) Group
Computer Science Department               Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900

Reply via email to