Re: [OMPI devel] 1.8.4rc Status

2014-12-18 Thread Paul Hargrove
On Wed, Dec 17, 2014 at 7:17 PM, Paul Hargrove wrote: > > I am going to run the nightly on other configs on both my > Solaris-11/x86-64 and Solaris-10/SPARC systems. > I just want to be sure some other compile/abi/arch combination didn't get > broken by accident. > I will post

Re: [OMPI devel] 1.8.4rc Status

2014-12-17 Thread Ralph Castain
Thanks Paul! Sorry I was out all day - stuck in meetings, I fear. On Wed, Dec 17, 2014 at 7:17 PM, Paul Hargrove wrote: > > Short version: > > v1.8 nightly (v1.8.3-313-g54c80c2) PASSED my testing. > > In full: > > I gave openmpi-v1.8.3-313-g54c80c2 a try. > In this test I

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Gilles Gouaillardet
Ralph, You get it right. The latest nightly tarball shoul work out of the box. (well, -m64 must be passed manually, but this is not related whatsoever to the issue discussed here) Cheers, Gilles "Jeff Squyres (jsquyres)" wrote: >Paul -- > >The __sun macro check is now in

Re: [OMPI devel] 1.8.4rc Status

2014-12-17 Thread Jeff Squyres (jsquyres)
Paul -- The __sun macro check is now in the OMPI 1.8 tree, and is in the latest nightly tarball. If I'm following this thread right -- and I might not be! -- I think Gilles is saying that now that the __sun check is in, it should fix this -mt/-D_REENTRANT/whatever problem. Can you confirm?

Re: [OMPI devel] 1.8.4rc Status

2014-12-17 Thread Tom Wurgler
t;-shared-intel" \ --enable-mpi-ext Can you see anything that should or shouldn't be there? Thanks From: devel <devel-boun...@open-mpi.org> on behalf of Ralph Castain <r...@open-mpi.org> Sent: Monday, December 15, 2014 10:07 PM To: Open M

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Paul Hargrove
Results of tests described below: 1) SEGV in hwloc - will report later 2) PASS 3) PASS So, both -D_REENTRANT or -mt are working for me IF added both the CFLAGS and wrapper-cflags. -Paul On Tue, Dec 16, 2014 at 10:56 PM, Paul Hargrove wrote: > > I've queued 3 tests: > > 1)

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Paul Hargrove
I did run the nightly and it SEGVs in hwloc! I will provide more info when I am able. -Paul On Tue, Dec 16, 2014 at 10:59 PM, Gilles Gouaillardet < gilles.gouaillar...@iferc.org> wrote: > > Thanks Paul ! > > imho the first test is useless since it does not include the commit that > sets the

Re: [OMPI devel] OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Jeff Squyres (jsquyres)
Turns out that this problem was caused by not having a Fortran compiler. I fixed that in https://github.com/open-mpi/ompi-release/commit/b90c8142d343b12cbcc1023cb767801ea2d567a4. There's still 2 other minor problems (a cleanfile and a condition source include); working on those... On Dec

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Gilles Gouaillardet
Ralph, what goes wrong ? (e.g. which command ?) and which compiler (e.g. gcc < 4.9.1 ?) are you using ? Cheers, Gilles On 2014/12/17 17:30, Ralph Castain wrote: > I'm afraid I cannot generate a new rc, nor will there be a new 1.8 nightly > tarball as (ahem) Jeff's fortran commit broke the

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Ralph Castain
I'm afraid I cannot generate a new rc, nor will there be a new 1.8 nightly tarball as (ahem) Jeff's fortran commit broke the build system. I tried to figure out a fix, but am too tired to get it right. So I'm afraid we are stuck for the moment until Jeff returns in the morning and fixes the

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Gilles Gouaillardet
Thanks Paul ! imho the first test is useless since it does not include the commit that sets the -D_REENTRANT CFLAGS on solaris/solarisstudio https://github.com/open-mpi/ompi-release/commit/ac8b84ce674b958dbf8c9481b300beeef0548b83 Cheers, Gilles On 2014/12/17 15:56, Paul Hargrove wrote: > I've

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Paul Hargrove
I've queued 3 tests: 1) openmpi-v1.8.3-272-g4e4f997 2) openmpi-v1.8.4rc4 + adding -D_REENTRANT to CFLAGS and wrapper-cflags 3) openmpi-v1.8.4rc4 + adding -mt to CFLAGS and wrapper-cflags I hope to be able to login and collect the results around noon pacific time on Wed. -Paul On Tue, Dec 16,

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Paul Hargrove
Gilles, If I have done my testing correctly (not 100% sure) then adding "-D_REENTRANT" was NOT sufficient, where "-mt" was. I can at least test 1 tarball with one set of configure args each evening. Anything more than that I cannot commit to. My scripts are capable of grabbing the v1.8 nightly

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Gilles Gouaillardet
Ralph, i think that will not work. here is the full story : once upon a time, on solaris, we did not try to compile pthread'ed app without any special parameters. that was a minor annoyance on solaris 10 with old gcc : configure passed a flag (-pthread if i remember correctly) that was not

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Paul Hargrove
Ralph, No change with the patch you supplied. The test that uses the "pflags" set by your patch is guarded by the value of ompi_pthread_c_success. So, I think there must be some other patch needed to the body of OMPI_INTL_POSIX_THREADS_PLAIN_C to even reach the code changed by the patch you sent

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Ralph Castain
Hi Paul Can you try the attached patch? It would require running autogen, I fear. Otherwise, I can add it to the tarball. Ralph On Tue, Dec 16, 2014 at 9:59 PM, Paul Hargrove wrote: > > Gilles, > > The 1.8.3 test works where the 1.8.4rc4 one fails with identical configure

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-17 Thread Paul Hargrove
Gilles, The 1.8.3 test works where the 1.8.4rc4 one fails with identical configure arguments. While it may be overkill, I configured 1.8.4rc4 with CFLAGS="-m64 -mt" --with-wrapper-cflags="-m64 -mt" \ LDFLAGS="-mt" --with-wrapper-ldflags="-mt" The resulting run worked! So, I very

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-16 Thread Paul Hargrove
My 1.8.3 build has not completed. HOWEVER, I can already see a key difference in the configure step. In 1.8.3 "-mt" was added AUTOMATICALLY to CFLAGS by configure: checking if C compiler and POSIX threads work as is... no - Solaris, not checked checking if C++ compiler and POSIX threads work as

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-16 Thread Gilles Gouaillardet
Paul, i do not think -lpthread is passed automatically to LDFLAGS on Solaris, so you might have to do it manually as well i never used --with-wrapper-cflags before, so i'd rather invite you to mpicc -show to make sure the right flags are passed at the right place when the app is built Cheers,

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-16 Thread Paul Hargrove
Gilles, I am running the build of 1.8.3 first. As you suggest, I will only try without -m64 if 1.8.3 runs with it. Regarding "-mt" my understanding from "man cc" is that it has a DUAL function: 1) Passes -D_REENTRANT to the preprocess stage (if any) 2) Passes "the right flags" to the linker

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-16 Thread Gilles Gouaillardet
Thanks Paul, if 1.8.3 with -m64 and the same compilers runs fine, then please do not bother running 1.8.4rc4 without -m64. /* i understand you are busy and i hardly believe -m64 is the root cause */ a regression i can think of involves the flags we use for pthreads : for bad reasons, we

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-16 Thread Paul Hargrove
Gilles, First, please note that prior tests of 1.8.3 ran with no problems on these hosts. So, I *think* this problem is a regression. However, I am not 100% certain that this *exact* configuration was tested. So, I am RE-running a test of 1.8.3 now to be absolutely sure if this is a regression. I

Re: [OMPI devel] OMPI devel] 1.8.4rc Status

2014-12-16 Thread Gilles Gouaillardet
Thanks Paul, Are you invoking mpirun on pcp-j-20 ? If yes, what does getent hosts pcp-j-20 says ? BTW, did you try without -m64 ? Does the following work ping/ssh 172.18.0.120 Honestly, this output makes very little sense to me, so i am asking way too much info hoping i can reproduce this

Re: [OMPI devel] 1.8.4rc Status

2014-12-16 Thread Gilles Gouaillardet
Paul, could you please send the output of ifconfig -a netstat -nr on the three hosts you are using (i assume you are still invoking mpirun from one node, and tasks are running on two other nodes) Cheers, Gilles On 2014/12/16 16:00, Paul Hargrove wrote: > Gilles, > > I looked again carefully

Re: [OMPI devel] 1.8.4rc Status

2014-12-16 Thread Gilles Gouaillardet
Paul, the root cause -D_REENTRANT is not set automatically is we test the __sun__ macro and 12.4 compiler defines only __sun and sun I will make a fix for that ... Cheers, Gilles On 2014/12/16 16:00, Paul Hargrove wrote: > Gilles, > > I looked again carefully and I am *NOT* finding

Re: [OMPI devel] 1.8.4rc Status

2014-12-16 Thread Paul Hargrove
Gilles, I looked again carefully and I am *NOT* finding -D_REENTRANT passed to most compilations. It appears to be used for building libevent and vt, but nothing else. The output from configure contains checking if more special flags are required for pthreads... -D_REENTRANT only in the

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
Gilles, I am NOT seeing the problem with gcc. It is only occurring with the Studio compilers. As I've already reported, I have tried adding either "-mt" or "-mt=yes" to both LDFLAGS and --with-wrapper-ldflags. The "cc" manpage (on the Solaris-10 system I can get to right now) says: -mt

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Gilles Gouaillardet
Paul, did you manually set -mt ? if i remember correctly, solaris 11 (at least with gcc compilers) do not need any flags (except the -D_REENTRANT that is added automatically) Cheers, Gilles On 2014/12/16 12:10, Paul Hargrove wrote: > Gilles, > > I will try the patch when I can. > However, our

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
Gilles, I will try the patch when I can. However, our network is undergoing network maintenance right now, leaving me unable to reach the necessary hosts. As for -D_REENTRANT, I had already reported having verified in the "make" output that it had been added automatically. Additionally, the

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Ralph Castain
les to indicate why. >> >> Note this is still rc2 >> >> More testing on-going >> >> >> -- >> *From:* devel <devel-boun...@open-mpi.org> on behalf of Tom Wurgler < >> twu...@goodyear.com> >> *Sent:* Monday, December 15, 2014

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Gilles Gouaillardet
Paul, could you please make sure configure added "-D_REENTRANT" to the CFLAGS ? /* otherwise, errno is a global variable instead of a per thread variable, which can explains some weird behaviour. note this should have been already fixed */ assuming -D_REENTRANT is set, could you please give the

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Ralph Castain
rc2 > > More testing on-going > > > -- > *From:* devel <devel-boun...@open-mpi.org> on behalf of Tom Wurgler < > twu...@goodyear.com> > *Sent:* Monday, December 15, 2014 1:23 PM > > *To:* Open MPI Developers > *Subject:* Re: [OMPI devel] 1.8.4rc Statu

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
A little more reading finds that... Docs says that one needs "-mt" without the "=yes". That will work for both old and new compilers, where "-mt=yes" chokes older ones. Also, man pages say "-mt" must come before "-lpthread" in the link command. -Paul On Mon, Dec 15, 2014 at 12:52 PM, Paul

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Tom Wurgler
this is still rc2 More testing on-going From: devel <devel-boun...@open-mpi.org> on behalf of Tom Wurgler <twu...@goodyear.com> Sent: Monday, December 15, 2014 1:23 PM To: Open MPI Developers Subject: Re: [OMPI devel] 1.8.4rc Status It seems t

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Paul Hargrove
On Mon, Dec 15, 2014 at 5:35 AM, Ralph Castain wrote: > > 7. Linkage issue on Solaris-11 reported by Paul Hargrove. Missing the > multi-threaded C libraries, apparently need "-mt=yes" in both compile and > link. Need someone to investigate. The lack of multi-thread libraries

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Tom Wurgler
2014 8:45 AM To: Open MPI Developers Subject: Re: [OMPI devel] 1.8.4rc Status Should be there in rc4, and I thought it made it to rc2 for that matter. I'll take a gander. FWIW: I'm working off-list with IBM to tighten the LSF integration so we correctly read and follow their binding directives. Th

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Ralph Castain
an tell, it isn't there as yet. > Is this correct? > > > thanks! > > > -- > *From:* devel <devel-boun...@open-mpi.org> on behalf of Ralph Castain < > r...@open-mpi.org> > *Sent:* Monday, December 15, 2014 8:35 AM > *To:* Open MPI De

Re: [OMPI devel] 1.8.4rc Status

2014-12-15 Thread Tom Wurgler
oun...@open-mpi.org> on behalf of Ralph Castain <r...@open-mpi.org> Sent: Monday, December 15, 2014 8:35 AM To: Open MPI Developers Subject: [OMPI devel] 1.8.4rc Status Hi folks Trying to summarize the current situation on releasing 1.8.4. Remaining identified issues: 1. TCP/BTL ha

[OMPI devel] 1.8.4rc Status

2014-12-15 Thread Ralph Castain
Hi folks Trying to summarize the current situation on releasing 1.8.4. Remaining identified issues: 1. TCP/BTL hang under mpi-thread-multiple. Asked George to look into it. 2. hwloc updates required. Brice committed them to the hwloc 1.7 repo. Gilles volunteered to create the PR from there. 3.