[OMPI devel] RFC: Merge PMIx branch to trunk

2014-08-14 Thread Ralph Castain
WHAT:Merge the PMIx branch into the devel repo, creating a new

   OPAL “lmix” framework to abstract PMI support for all RTEs.

   Replace the ORTE daemon-level collectives with a new PMIx

   server and update the ORTE grpcomm framework to support

   server-to-server collectives


WHY:  We’ve had problems dealing with variations in PMI implementations,

   and need to extend the existing PMI definitions to meet
exascale

   requirements.


WHEN:   Mon, Aug 25


WHERE:  https://github.com/rhc54/ompi-svn-mirror.git


Several community members have been working on a refactoring of the current
PMI support within OMPI. Although the APIs are common, Slurm and Cray
implement a different range of capabilities, and package them differently.
For example, Cray provides an integrated PMI-1/2 library, while Slurm
separates the two and requires the user to specify the one to be used at
runtime. In addition, several bugs in the Slurm implementations have caused
problems requiring extra coding.


All this has led to a slew of #if’s in the PMI code and bugs when the
corner-case logic for one implementation accidentally traps the other.
Extending this support to other implementations would have increased this
complexity to an unacceptable level.


Accordingly, we have:


* created a new OPAL “pmix” framework to abstract the PMI support, with
separate components for Cray, Slurm PMI-1, and Slurm PMI-2 implementations.


* Replaced the current ORTE grpcomm daemon-based collective operation with
an integrated PMIx server, and updated the grpcomm APIs to provide more
flexible, multi-algorithm support for collective operations. At this time,
only the xcast and allgather operations are supported.


* Replaced the current global collective id with a signature based on the
names of the participating procs. The allows an unlimited number of
collectives to be executed by any group of processes, subject to the
requirement that only one collective can be active at a time for a unique
combination of procs. Note that a proc can be involved in any number of
simultaneous collectives - it is the specific combination of procs that is
subject to the constraint


* removed the prior OMPI/OPAL modex code


* added new macros for executing modex send/recv to simplify use of the new
APIs. The send macros allow the caller to specify whether or not the BTL
supports async modex operations - if so, then the non-blocking “fence”
operation is used, if the active PMIx component supports it. Otherwise, the
default is a full blocking modex exchange as we currently perform.


* retained the current flag that directs us to use a blocking fence
operation, but only to retrieve data upon demand


Re: [OMPI devel] RFC: add opal/threads/spinlock.h

2014-08-14 Thread George Bosilca
SHARED is only supported when the pthread library does support spinlock,
while in all other case it falls back into using atomic locks. Providing
support only for a small fraction of environments without reporting errors
or providing any alternative on other systems is difficult to accept.

I think I prefer your suggestion to integrate this with the atomic locks
and provide a single mechanism to handle them. Regarding this do you think
there will be a need to separate the spinlocks and the atomic locks?

  George.



On Thu, Aug 14, 2014 at 11:40 AM, Dave Goodell (dgoodell) <
dgood...@cisco.com> wrote:

> WHAT: Add a new "opal/threads/spinlock.h" header to OPAL that will
> typically use the OS spinlock primitives if present.
>
> WHY: opal_mutex_t is too slow for some use cases and opal_atomic_lock_t is
> inefficiently implemented for most architectures
>
> WHEN: timeout is after next week's engineering call on Tuesday, 2014-08-19
>
>
> As discussed at the June developer meeting, I propose this patch to add
> spinlocks to OPAL.  There are at least a half dozen reasonable ways to
> implement spinlocks; which one is best will vary from platform to platform.
>  In general, the OS spinlock implementations are well tested and efficient.
>  We should usually be relying on those implementations instead of rolling
> our own.
>
>
> My primary criticism of this patch is that it muddies the waters a bit
> with opal_atomic_lock_t.  An alternative approach would be to spend some
> time working on improving the opal_atomic_lock_t implementation, but I have
> two concerns with this approach:
>
> 1) It's very difficult for me to measure the potential performance impact
> of opal_atomic_lock_t modifications on all of the various platforms that we
> currently run on.  Adding this new implementation allows component
> maintainers to decide if and when to convert to using the new facility.
>
> 2) There's a reasonable chance that I'll make a mistake.  Writing tests
> for this stuff helps to catch the really basic errors, but it doesn't help
> as much with the really subtle mistakes.
>
> -Dave
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15652.php
>


Re: [OMPI devel] [1.8.2rc4] OSHMEM fortran bindings with bad compilers

2014-08-14 Thread Paul Hargrove
Josh,

The specific compilers that caused the most problems are the older PGI
compilers (any before 13.x).
In this case the user has the option to update their compiler to 13.10 or
newer.

There are also issues with IBM's xlf.
For the IBM compiler I have never found a version that builds/links the MPI
f08 bindings, and now also find no version that can link the OSHMEM fortran
bindings.

-Paul

-Paul


On Thu, Aug 14, 2014 at 11:30 AM, Joshua Ladd  wrote:

> We will update the README accordingly. Thank you, Paul.
>
> Josh
>
>
> On Thu, Aug 14, 2014 at 10:00 AM, Jeff Squyres (jsquyres) <
> jsquy...@cisco.com> wrote:
>
>> Good points.
>>
>> Mellanox -- can you update per Paul's suggestions?
>>
>>
>> On Aug 13, 2014, at 8:26 PM, Paul Hargrove  wrote:
>>
>> > The following is NOT a bug report.
>> > This is just an observation that may deserve some text in the README.
>> >
>> > I've reported issues in the past with some Fortran compilers (mostly
>> older XLC and PGI) which either cannot build the "use mpi_f08" module, or
>> cannot correctly link to it (and sometimes this fails only if configured
>> with --enable-debug).
>> >
>> > Testing the OSHMEM Fortran bindings (enabled by default on Linux) I
>> have found several compilers which fail to link the examples
>> (hello_oshmemfh and ring_oshmemfh).  I reported one specific instance (with
>> xlc-11/xlf-13) back in February:
>> http://www.open-mpi.org/community/lists/devel/2014/02/14057.php
>> >
>> > So far I have these failures only on platforms where the Fortran
>> compiler is *known* to be broken for the MPI f90 and/or f08 bindings.
>> Specifically, all the failing platforms are ones on which either:
>> > + Configure determines (without my help) that FC cannot build the F90
>> and/or F08 modules.
>> > OR
>> > + I must pass --enable-mpi-fortran=usempi or --enable-mpi-fortran=mpifh
>> for cases configure cannot detect.
>> >
>> > So, I do *not* believe there is anything wrong with the OSHMEM code,
>> which is why I started this post with "The following is NOT a bug report".
>> However, I have two recommendations to make:
>> >
>> > 1) Documentation
>> >
>> > The README says just:
>> >
>> > --disable-oshmem-fortran
>> >   Disable building only the Fortran OSHMEM bindings.
>> >
>> > So, I recommend adding a sentence there referencing the "Compiler
>> Notes" section of the README which has details on some known bad Fortran
>> compilers.
>> >
>> > 2) Configure:
>> >
>> > As I noted above, at least some of the failures are on platforms where
>> configure has determined it cannot build the f08 MPI bindings.  So, maybe
>> there is something that could be done at configure time to disqualify some
>> Fortran compilers from building the OSHMEM fotran bindings, too.
>> >
>> > -Paul
>> >
>> > --
>> > Paul H. Hargrove  phhargr...@lbl.gov
>> > Future Technologies Group
>> > Computer and Data Sciences Department Tel: +1-510-495-2352
>> > Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
>> > ___
>> > devel mailing list
>> > de...@open-mpi.org
>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> > Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/08/15643.php
>>
>>
>> --
>> Jeff Squyres
>> jsquy...@cisco.com
>> For corporate legal information go to:
>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/08/15650.php
>>
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15653.php
>



-- 
Paul H. Hargrove  phhargr...@lbl.gov
Future Technologies Group
Computer and Data Sciences Department Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory Fax: +1-510-486-6900


Re: [OMPI devel] [1.8.2rc4] OSHMEM fortran bindings with bad compilers

2014-08-14 Thread Joshua Ladd
We will update the README accordingly. Thank you, Paul.

Josh


On Thu, Aug 14, 2014 at 10:00 AM, Jeff Squyres (jsquyres) <
jsquy...@cisco.com> wrote:

> Good points.
>
> Mellanox -- can you update per Paul's suggestions?
>
>
> On Aug 13, 2014, at 8:26 PM, Paul Hargrove  wrote:
>
> > The following is NOT a bug report.
> > This is just an observation that may deserve some text in the README.
> >
> > I've reported issues in the past with some Fortran compilers (mostly
> older XLC and PGI) which either cannot build the "use mpi_f08" module, or
> cannot correctly link to it (and sometimes this fails only if configured
> with --enable-debug).
> >
> > Testing the OSHMEM Fortran bindings (enabled by default on Linux) I have
> found several compilers which fail to link the examples (hello_oshmemfh and
> ring_oshmemfh).  I reported one specific instance (with xlc-11/xlf-13) back
> in February:
> http://www.open-mpi.org/community/lists/devel/2014/02/14057.php
> >
> > So far I have these failures only on platforms where the Fortran
> compiler is *known* to be broken for the MPI f90 and/or f08 bindings.
> Specifically, all the failing platforms are ones on which either:
> > + Configure determines (without my help) that FC cannot build the F90
> and/or F08 modules.
> > OR
> > + I must pass --enable-mpi-fortran=usempi or --enable-mpi-fortran=mpifh
> for cases configure cannot detect.
> >
> > So, I do *not* believe there is anything wrong with the OSHMEM code,
> which is why I started this post with "The following is NOT a bug report".
> However, I have two recommendations to make:
> >
> > 1) Documentation
> >
> > The README says just:
> >
> > --disable-oshmem-fortran
> >   Disable building only the Fortran OSHMEM bindings.
> >
> > So, I recommend adding a sentence there referencing the "Compiler Notes"
> section of the README which has details on some known bad Fortran compilers.
> >
> > 2) Configure:
> >
> > As I noted above, at least some of the failures are on platforms where
> configure has determined it cannot build the f08 MPI bindings.  So, maybe
> there is something that could be done at configure time to disqualify some
> Fortran compilers from building the OSHMEM fotran bindings, too.
> >
> > -Paul
> >
> > --
> > Paul H. Hargrove  phhargr...@lbl.gov
> > Future Technologies Group
> > Computer and Data Sciences Department Tel: +1-510-495-2352
> > Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
> > ___
> > devel mailing list
> > de...@open-mpi.org
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> > Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15643.php
>
>
> --
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to:
> http://www.cisco.com/web/about/doing_business/legal/cri/
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15650.php
>


[OMPI devel] RFC: add opal/threads/spinlock.h

2014-08-14 Thread Dave Goodell (dgoodell)
WHAT: Add a new "opal/threads/spinlock.h" header to OPAL that will typically 
use the OS spinlock primitives if present.

WHY: opal_mutex_t is too slow for some use cases and opal_atomic_lock_t is 
inefficiently implemented for most architectures

WHEN: timeout is after next week's engineering call on Tuesday, 2014-08-19


As discussed at the June developer meeting, I propose this patch to add 
spinlocks to OPAL.  There are at least a half dozen reasonable ways to 
implement spinlocks; which one is best will vary from platform to platform.  In 
general, the OS spinlock implementations are well tested and efficient.  We 
should usually be relying on those implementations instead of rolling our own.


My primary criticism of this patch is that it muddies the waters a bit with 
opal_atomic_lock_t.  An alternative approach would be to spend some time 
working on improving the opal_atomic_lock_t implementation, but I have two 
concerns with this approach:

1) It's very difficult for me to measure the potential performance impact of 
opal_atomic_lock_t modifications on all of the various platforms that we 
currently run on.  Adding this new implementation allows component maintainers 
to decide if and when to convert to using the new facility.

2) There's a reasonable chance that I'll make a mistake.  Writing tests for 
this stuff helps to catch the really basic errors, but it doesn't help as much 
with the really subtle mistakes.

-Dave



0001-add-opal-threads-spinlock.h.patch
Description: 0001-add-opal-threads-spinlock.h.patch


Re: [OMPI devel] 1.8.4rc4 is out

2014-08-14 Thread Jeff Squyres (jsquyres)
Woot -- thank you!

On Aug 14, 2014, at 1:19 AM, Ralph Castain  wrote:

> Fantastic - thanks Paul!!!
> 
> 
> 
> On Wed, Aug 13, 2014 at 9:18 PM, Paul Hargrove  wrote:
> I have completed testing the majority of the platforms I have access to.
> The only issue that is not already known to exist in earlier releases was the 
> missing osx/atomic.h, for which Ralph promptly merged George's fix.
> 
> If I include the re-tested osx-atomics (which passes w/ 1.8.2rc5r32531), I 
> have success on 75 distinct configurations which include x86, x86-64, 
> sparc-v8+, sparc64-v9, ppc32 and ppc64 ABIs with various releases of Linux, 
> Mac OS X, Solaris, FreeBSD, NetBSD and OpenBSD, with all sorts of compilers, 
> and static linking (w/o romio :-)) on at least one configuration for each OS.
> 
> I will have results on ia64, ARMv5, ARMv7 and 3 MIPS ABIs in the next day or 
> two.
> 
> Looks good to me.
> -Paul
> 
> 
> On Wed, Aug 13, 2014 at 1:37 PM, Jeff Squyres (jsquyres)  
> wrote:
> Please test!  Ralph would like to release after the teleconf next Tuesday:
> 
> http://www.open-mpi.org/software/ompi/v1.8/
> 
> Changes since last rc:
> 
> - Fix cascading/over-quoting in some cases with the rsh/ssh-based
>   launcher.  Thanks to multiple users for raising the issue.
> - Properly add support for gfortran 4.9 ignore TKR pragma (it was
>   erroneously only partially added in v1.7.5).  Thanks to Marcus
>   Daniels for raising the issue.
> - Update/improve help messages in the usnic BTL.
> - Resolve a race condition in MPI_Abort.
> - Fix obscure cases where static linking from wrapper compilers would
>   fail.
> - Clarify the configure --help message about when OpenSHMEM is
>   enabled/disabled by default.  Thanks to Paul Hargrove for the
>   suggestion.
> - Align pages properly where relevant.  Thanks to Paul Hargrove for
>   identifying the issue.
> - Various compiler warning and minor fixes for OpenBSD, FreeBSD, and
>   Solaris/SPARC.  Thanks to Paul Hargrove for the patches.
> 
> --
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to: 
> http://www.cisco.com/web/about/doing_business/legal/cri/
> 
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2014/08/15640.php
> 
> 
> 
> -- 
> Paul H. Hargrove  phhargr...@lbl.gov
> Future Technologies Group
> Computer and Data Sciences Department Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
> 
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2014/08/15647.php
> 
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2014/08/15649.php


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/



Re: [OMPI devel] [1.8.2rc4] OSHMEM fortran bindings with bad compilers

2014-08-14 Thread Jeff Squyres (jsquyres)
Good points.

Mellanox -- can you update per Paul's suggestions?


On Aug 13, 2014, at 8:26 PM, Paul Hargrove  wrote:

> The following is NOT a bug report.
> This is just an observation that may deserve some text in the README.
> 
> I've reported issues in the past with some Fortran compilers (mostly older 
> XLC and PGI) which either cannot build the "use mpi_f08" module, or cannot 
> correctly link to it (and sometimes this fails only if configured with 
> --enable-debug).
> 
> Testing the OSHMEM Fortran bindings (enabled by default on Linux) I have 
> found several compilers which fail to link the examples (hello_oshmemfh and 
> ring_oshmemfh).  I reported one specific instance (with xlc-11/xlf-13) back 
> in February: http://www.open-mpi.org/community/lists/devel/2014/02/14057.php
> 
> So far I have these failures only on platforms where the Fortran compiler is 
> *known* to be broken for the MPI f90 and/or f08 bindings.  Specifically, all 
> the failing platforms are ones on which either:
> + Configure determines (without my help) that FC cannot build the F90 and/or 
> F08 modules.
> OR
> + I must pass --enable-mpi-fortran=usempi or --enable-mpi-fortran=mpifh for 
> cases configure cannot detect.
> 
> So, I do *not* believe there is anything wrong with the OSHMEM code, which is 
> why I started this post with "The following is NOT a bug report".  However, I 
> have two recommendations to make:
> 
> 1) Documentation
> 
> The README says just:
> 
> --disable-oshmem-fortran
>   Disable building only the Fortran OSHMEM bindings.
> 
> So, I recommend adding a sentence there referencing the "Compiler Notes" 
> section of the README which has details on some known bad Fortran compilers.
> 
> 2) Configure: 
> 
> As I noted above, at least some of the failures are on platforms where 
> configure has determined it cannot build the f08 MPI bindings.  So, maybe 
> there is something that could be done at configure time to disqualify some 
> Fortran compilers from building the OSHMEM fotran bindings, too.
> 
> -Paul
> 
> -- 
> Paul H. Hargrove  phhargr...@lbl.gov
> Future Technologies Group
> Computer and Data Sciences Department Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2014/08/15643.php


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/



Re: [OMPI devel] 1.8.4rc4 is out

2014-08-14 Thread Ralph Castain
Fantastic - thanks Paul!!!



On Wed, Aug 13, 2014 at 9:18 PM, Paul Hargrove  wrote:

> I have completed testing the majority of the platforms I have access to.
> The only issue that is not already known to exist in earlier releases was
> the missing osx/atomic.h, for which Ralph promptly merged George's fix.
>
> If I include the re-tested osx-atomics (which passes w/ 1.8.2rc5r32531), I
> have success on 75 distinct configurations which include x86, x86-64,
> sparc-v8+, sparc64-v9, ppc32 and ppc64 ABIs with various releases of Linux,
> Mac OS X, Solaris, FreeBSD, NetBSD and OpenBSD, with all sorts of
> compilers, and static linking (w/o romio :-)) on at least one configuration
> for each OS.
>
> I will have results on ia64, ARMv5, ARMv7 and 3 MIPS ABIs in the next day
> or two.
>
> Looks good to me.
> -Paul
>
>
> On Wed, Aug 13, 2014 at 1:37 PM, Jeff Squyres (jsquyres) <
> jsquy...@cisco.com> wrote:
>
>> Please test!  Ralph would like to release after the teleconf next Tuesday:
>>
>> http://www.open-mpi.org/software/ompi/v1.8/
>>
>> Changes since last rc:
>>
>> - Fix cascading/over-quoting in some cases with the rsh/ssh-based
>>   launcher.  Thanks to multiple users for raising the issue.
>> - Properly add support for gfortran 4.9 ignore TKR pragma (it was
>>   erroneously only partially added in v1.7.5).  Thanks to Marcus
>>   Daniels for raising the issue.
>> - Update/improve help messages in the usnic BTL.
>> - Resolve a race condition in MPI_Abort.
>> - Fix obscure cases where static linking from wrapper compilers would
>>   fail.
>> - Clarify the configure --help message about when OpenSHMEM is
>>   enabled/disabled by default.  Thanks to Paul Hargrove for the
>>   suggestion.
>> - Align pages properly where relevant.  Thanks to Paul Hargrove for
>>   identifying the issue.
>> - Various compiler warning and minor fixes for OpenBSD, FreeBSD, and
>>   Solaris/SPARC.  Thanks to Paul Hargrove for the patches.
>>
>> --
>> Jeff Squyres
>> jsquy...@cisco.com
>> For corporate legal information go to:
>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/08/15640.php
>>
>
>
>
> --
> Paul H. Hargrove  phhargr...@lbl.gov
> Future Technologies Group
> Computer and Data Sciences Department Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15647.php
>


Re: [OMPI devel] [1.8.2rc4] build failure with --enable-osx-builtin-atomics

2014-08-14 Thread Ralph Castain
Done - thanks!



On Wed, Aug 13, 2014 at 9:06 PM, George Bosilca  wrote:

> The atomic.h file should also be trimmed of the SPARC relique.
>
>   George.
>
>
> Index: opal/include/opal/sys/atomic.h
> ===
> --- opal/include/opal/sys/atomic.h (revision 32531)
> +++ opal/include/opal/sys/atomic.h (working copy)
> @@ -162,8 +162,6 @@
>  #include "opal/sys/powerpc/atomic.h"
>  #elif OPAL_ASSEMBLY_ARCH == OMPI_POWERPC64
>  #include "opal/sys/powerpc/atomic.h"
> -#elif OPAL_ASSEMBLY_ARCH == OMPI_SPARC
> -#include "opal/sys/sparc/atomic.h"
>  #elif OPAL_ASSEMBLY_ARCH == OMPI_SPARCV9_32
>  #include "opal/sys/sparcv9/atomic.h"
>  #elif OPAL_ASSEMBLY_ARCH == OMPI_SPARCV9_64
>
>
> On Thu, Aug 14, 2014 at 12:01 AM, Paul Hargrove 
> wrote:
>
>> Fix confirmed using the nightly tarball (v1.8rc5r32531).
>>
>> -Paul
>>
>>
>> On Wed, Aug 13, 2014 at 6:16 PM, Ralph Castain  wrote:
>>
>>> Thanks Paul - fixed in r32530
>>>
>>>
>>>
>>> On Wed, Aug 13, 2014 at 2:42 PM, Paul Hargrove 
>>> wrote:
>>>
 When configured with --enable-osx-builtin-atomics

 Making all in asm
   CC   asm.lo
 In file included from
 /Users/Paul/OMPI/openmpi-1.8.2rc4-macos10.8-x86-clang-atomics/openmpi-1.8.2rc4/opal/asm/asm.c:21:
 /Users/Paul/OMPI/openmpi-1.8.2rc4-macos10.8-x86-clang-atomics/openmpi-1.8.2rc4/opal/include/opal/sys/atomic.h:145:10:
 fatal error: 'opal/sys/osx/atomic.h' file not found
 #include "opal/sys/osx/atomic.h"
  ^
 1 error generated.

 I reported the same issue to George in the trunk last week.
 So, I am 95% certain one just needs to cmr r32390 (commit msg == 'Dont
 miss the Os X atomics on "make dist"')


 -Paul

 --
 Paul H. Hargrove  phhargr...@lbl.gov
 Future Technologies Group
 Computer and Data Sciences Department Tel: +1-510-495-2352
 Lawrence Berkeley National Laboratory Fax: +1-510-486-6900

 ___
 devel mailing list
 de...@open-mpi.org
 Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
 Link to this post:
 http://www.open-mpi.org/community/lists/devel/2014/08/15642.php

>>>
>>>
>>> ___
>>> devel mailing list
>>> de...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2014/08/15644.php
>>>
>>
>>
>>
>> --
>> Paul H. Hargrove  phhargr...@lbl.gov
>> Future Technologies Group
>> Computer and Data Sciences Department Tel: +1-510-495-2352
>> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
>>
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/08/15645.php
>>
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15646.php
>


Re: [OMPI devel] 1.8.4rc4 is out

2014-08-14 Thread Paul Hargrove
I have completed testing the majority of the platforms I have access to.
The only issue that is not already known to exist in earlier releases was
the missing osx/atomic.h, for which Ralph promptly merged George's fix.

If I include the re-tested osx-atomics (which passes w/ 1.8.2rc5r32531), I
have success on 75 distinct configurations which include x86, x86-64,
sparc-v8+, sparc64-v9, ppc32 and ppc64 ABIs with various releases of Linux,
Mac OS X, Solaris, FreeBSD, NetBSD and OpenBSD, with all sorts of
compilers, and static linking (w/o romio :-)) on at least one configuration
for each OS.

I will have results on ia64, ARMv5, ARMv7 and 3 MIPS ABIs in the next day
or two.

Looks good to me.
-Paul


On Wed, Aug 13, 2014 at 1:37 PM, Jeff Squyres (jsquyres)  wrote:

> Please test!  Ralph would like to release after the teleconf next Tuesday:
>
> http://www.open-mpi.org/software/ompi/v1.8/
>
> Changes since last rc:
>
> - Fix cascading/over-quoting in some cases with the rsh/ssh-based
>   launcher.  Thanks to multiple users for raising the issue.
> - Properly add support for gfortran 4.9 ignore TKR pragma (it was
>   erroneously only partially added in v1.7.5).  Thanks to Marcus
>   Daniels for raising the issue.
> - Update/improve help messages in the usnic BTL.
> - Resolve a race condition in MPI_Abort.
> - Fix obscure cases where static linking from wrapper compilers would
>   fail.
> - Clarify the configure --help message about when OpenSHMEM is
>   enabled/disabled by default.  Thanks to Paul Hargrove for the
>   suggestion.
> - Align pages properly where relevant.  Thanks to Paul Hargrove for
>   identifying the issue.
> - Various compiler warning and minor fixes for OpenBSD, FreeBSD, and
>   Solaris/SPARC.  Thanks to Paul Hargrove for the patches.
>
> --
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to:
> http://www.cisco.com/web/about/doing_business/legal/cri/
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15640.php
>



-- 
Paul H. Hargrove  phhargr...@lbl.gov
Future Technologies Group
Computer and Data Sciences Department Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory Fax: +1-510-486-6900


Re: [OMPI devel] [1.8.2rc4] build failure with --enable-osx-builtin-atomics

2014-08-14 Thread George Bosilca
The atomic.h file should also be trimmed of the SPARC relique.

  George.


Index: opal/include/opal/sys/atomic.h
===
--- opal/include/opal/sys/atomic.h (revision 32531)
+++ opal/include/opal/sys/atomic.h (working copy)
@@ -162,8 +162,6 @@
 #include "opal/sys/powerpc/atomic.h"
 #elif OPAL_ASSEMBLY_ARCH == OMPI_POWERPC64
 #include "opal/sys/powerpc/atomic.h"
-#elif OPAL_ASSEMBLY_ARCH == OMPI_SPARC
-#include "opal/sys/sparc/atomic.h"
 #elif OPAL_ASSEMBLY_ARCH == OMPI_SPARCV9_32
 #include "opal/sys/sparcv9/atomic.h"
 #elif OPAL_ASSEMBLY_ARCH == OMPI_SPARCV9_64


On Thu, Aug 14, 2014 at 12:01 AM, Paul Hargrove  wrote:

> Fix confirmed using the nightly tarball (v1.8rc5r32531).
>
> -Paul
>
>
> On Wed, Aug 13, 2014 at 6:16 PM, Ralph Castain  wrote:
>
>> Thanks Paul - fixed in r32530
>>
>>
>>
>> On Wed, Aug 13, 2014 at 2:42 PM, Paul Hargrove 
>> wrote:
>>
>>> When configured with --enable-osx-builtin-atomics
>>>
>>> Making all in asm
>>>   CC   asm.lo
>>> In file included from
>>> /Users/Paul/OMPI/openmpi-1.8.2rc4-macos10.8-x86-clang-atomics/openmpi-1.8.2rc4/opal/asm/asm.c:21:
>>> /Users/Paul/OMPI/openmpi-1.8.2rc4-macos10.8-x86-clang-atomics/openmpi-1.8.2rc4/opal/include/opal/sys/atomic.h:145:10:
>>> fatal error: 'opal/sys/osx/atomic.h' file not found
>>> #include "opal/sys/osx/atomic.h"
>>>  ^
>>> 1 error generated.
>>>
>>> I reported the same issue to George in the trunk last week.
>>> So, I am 95% certain one just needs to cmr r32390 (commit msg == 'Dont
>>> miss the Os X atomics on "make dist"')
>>>
>>>
>>> -Paul
>>>
>>> --
>>> Paul H. Hargrove  phhargr...@lbl.gov
>>> Future Technologies Group
>>> Computer and Data Sciences Department Tel: +1-510-495-2352
>>> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
>>>
>>> ___
>>> devel mailing list
>>> de...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/devel/2014/08/15642.php
>>>
>>
>>
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/08/15644.php
>>
>
>
>
> --
> Paul H. Hargrove  phhargr...@lbl.gov
> Future Technologies Group
> Computer and Data Sciences Department Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15645.php
>


Re: [OMPI devel] [1.8.2rc4] build failure with --enable-osx-builtin-atomics

2014-08-14 Thread Paul Hargrove
Fix confirmed using the nightly tarball (v1.8rc5r32531).

-Paul


On Wed, Aug 13, 2014 at 6:16 PM, Ralph Castain  wrote:

> Thanks Paul - fixed in r32530
>
>
>
> On Wed, Aug 13, 2014 at 2:42 PM, Paul Hargrove  wrote:
>
>> When configured with --enable-osx-builtin-atomics
>>
>> Making all in asm
>>   CC   asm.lo
>> In file included from
>> /Users/Paul/OMPI/openmpi-1.8.2rc4-macos10.8-x86-clang-atomics/openmpi-1.8.2rc4/opal/asm/asm.c:21:
>> /Users/Paul/OMPI/openmpi-1.8.2rc4-macos10.8-x86-clang-atomics/openmpi-1.8.2rc4/opal/include/opal/sys/atomic.h:145:10:
>> fatal error: 'opal/sys/osx/atomic.h' file not found
>> #include "opal/sys/osx/atomic.h"
>>  ^
>> 1 error generated.
>>
>> I reported the same issue to George in the trunk last week.
>> So, I am 95% certain one just needs to cmr r32390 (commit msg == 'Dont
>> miss the Os X atomics on "make dist"')
>>
>>
>> -Paul
>>
>> --
>> Paul H. Hargrove  phhargr...@lbl.gov
>> Future Technologies Group
>> Computer and Data Sciences Department Tel: +1-510-495-2352
>> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
>>
>> ___
>> devel mailing list
>> de...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/08/15642.php
>>
>
>
> ___
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post:
> http://www.open-mpi.org/community/lists/devel/2014/08/15644.php
>



-- 
Paul H. Hargrove  phhargr...@lbl.gov
Future Technologies Group
Computer and Data Sciences Department Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory Fax: +1-510-486-6900