Re: [OMPI devel] Choosing an Open-MPI release for a new cluster

2013-05-02 Thread Paul Hargrove
On Thu, May 2, 2013 at 5:55 AM, Jeff Squyres (jsquyres) wrote: > Given that we know there are compile bugs with XRC in all the v1.7 > releases so far (which seems to show how few people are actually using > XRC...), there are definitely "growing pains", as Ralph mentioned. Jeff may be a little

Re: [OMPI devel] Any plans to support Intel MIC (Xeon Phi) in Open-MPI?

2013-05-02 Thread Paul Hargrove
Ralph, I am not an expert, by any means, but based on a presentation I heard 4 hours ago: The Xeon and Phi instruction sets have a large intersection, but neither is a subset of the other. In particular, Phi has its own SIMD instructions *instead* of Xeon's MMX, SSEn, etc. There is also on CMPXCH

Re: [OMPI devel] Any plans to support Intel MIC (Xeon Phi) in Open-MPI?

2013-05-02 Thread Paul Hargrove
know exactly what that means (I haven't read their docs about this stuff), > but I suspect that it's more than just launching MPI processes on them... > > > On May 2, 2013, at 8:54 PM, Paul Hargrove wrote: > > > Ralph, > > > > I am not an expert, by any means, b

Re: [OMPI devel] [EXTERNAL] Re: RFC: Python-generated Fortran wrappers

2013-05-22 Thread Paul Hargrove
Let me jump in here with a different perspective. First, for those who don't know me: + I am NOT an OMPI developer + I am NOT an MPI application author either + I am a developer of "competing" HPC communications software (GASNet) + I contribute to OMPI mainly by building release candidates

Re: [OMPI devel] Open MPI shirts and more

2013-10-18 Thread Paul Hargrove
I don't know it if just me, but the logo appears to have gotten cropped on the 2.25" button and the round magnet. -Paul On Fri, Oct 18, 2013 at 12:41 PM, Jeff Squyres (jsquyres) < jsquy...@cisco.com> wrote: > OMPI Developer Community -- > > Per the upcoming 10th anniversary of OMPI's SVN r1 (an

Re: [OMPI devel] Open MPI shirts and more

2013-10-18 Thread Paul Hargrove
Still looks the same to me, even after changing browsers. Screen shot attached. -Paul On Fri, Oct 18, 2013 at 1:59 PM, Jeff Squyres (jsquyres) wrote: > Fixed! > > On Oct 18, 2013, at 3:44 PM, Paul Hargrove > wrote: > > > I don't know it if just me, but the l

Re: [OMPI devel] shmem vs. oshmem

2013-10-25 Thread Paul Hargrove
On Fri, Oct 25, 2013 at 2:32 PM, Jeff Squyres (jsquyres) wrote: > We're shipping oshmem, not shmem, so why not call them oshmem examples > [that also happen to be shmem examples] -- rather than shmem examples [that > also happen to be oshmem examples]? My USD 0.02: If the examples were written

Re: [OMPI devel] RFC: usnic BTL MPI_T pvar scheme

2013-11-05 Thread Paul Hargrove
Jeff, If this approach is to be adopted by other components (and perhaps other MPIs), then it would be important for the enumeration variable name to be derived in a UNIFORM way: __SOMETHING Without a fixed value for "SOMETHING" somebody will need to read sources (or documentation) to make the

Re: [OMPI devel] RFC: usnic BTL MPI_T pvar scheme

2013-11-05 Thread Paul Hargrove
On Tue, Nov 5, 2013 at 6:00 PM, Jeff Squyres (jsquyres) wrote: > On Nov 5, 2013, at 2:54 PM, Paul Hargrove wrote: > > > If this approach is to be adopted by other components (and perhaps other > MPIs), then it would be important for the enumeration variable name to be > derive

Re: [OMPI devel] RFC: usnic BTL MPI_T pvar scheme

2013-11-05 Thread Paul Hargrove
gt; > How about using a * in the name, to represent where the match is? E.G., > btl_usnic_*_enum? > > It's a string, so it's not just limited to letters and underscores. > > Sent from my phone. No type good. > > On Nov 5, 2013, at 6:26 PM, "Paul Hargrove&quo

Re: [OMPI devel] [PATCH 4/4] Trying to get the C/R code to compile again. (last)

2013-12-04 Thread Paul Hargrove
On Dec 4, 2013 12:07 PM, "Jeff Squyres (jsquyres)" wrote: [...] > But in some ways, having uncompilable code is a *good* thing, because it tells you exactly where you need to work on the architecture. Just updating it to *compile* removes that safeguard -- will you remember/re-find all those plac

[OMPI devel] 1.7.4rc1 build failure: FreeBSD-9

2013-12-19 Thread Paul Hargrove
I see the failure below when building 1.7.4rc1 on FreeBSD-9 (amd64). It looks to be just a missing header, probably sys/stat.h. $ gcc --version gcc (GCC) 4.2.1 20070831 patched [FreeBSD] Only configure option passed was --prefix-... -Paul Making all in mca/sharedfp/sm CC sharedfp_sm.l

[OMPI devel] 1.7.4rc1 build failure: OpenBSD-5 and NetBSD-6

2013-12-19 Thread Paul Hargrove
When building 1.7.4rc1 on OpenBSD-5 and NetBSD-6 (both amd64) I see what appears to be the same three errors ("make" output at end of this email) on both platforms. All three syntax errors appears to be collisions on the symbol if_mtu: -bash-4.2$ cat -n openmpi-1.7.4rc1/opal/util/if.h | grep -w

[OMPI devel] 1.74rc1 build failure: Solaris 11 / x86_64 / Sun Studio 12.3

2013-12-19 Thread Paul Hargrove
In 1.7.4rc1's README support is still claimed for Solaris 11 on x86_64 with Sun Studio (12.2 and 12.3): - Oracle Solaris 10 and 11, 32 and 64 bit (SPARC, i386, x86_64), with Oracle Solaris Studio 12.2 and 12.3 However, I get a build failure when configured with: CC=cc CFLAGS=-m64 --w

Re: [OMPI devel] 1.7.4rc1 build failure: Solaris 11 / x86_64

2013-12-19 Thread Paul Hargrove
o] Error 1 make[2]: Leaving directory `/shared/OMPI/openmpi-1.7.4rc1-solaris11-x64-ib-gcc452/BLD/opal/mca/if/posix_ipv4 -Paul On Thu, Dec 19, 2013 at 3:51 PM, Paul Hargrove wrote: > In 1.7.4rc1's README support is still claimed for Solaris 11 on x86_64 > with Sun Studio (12.2 and 1

Re: [OMPI devel] 1.74rc1 build failure: Solaris 11 / x86_64 / Sun Studio 12.3

2013-12-19 Thread Paul Hargrove
ifr) < 0) { > -opal_output(0, "btl_usnic_opal_ifinit: ioctl(SIOCGIFMTU) > failed with errno=%d", errno); > -break; > -} > -intf->if_mtu = ifr->ifr_mtu; > +/* get the MTU */ > + if (ioctl(sd, SIOCGIFMTU, ifr) < 0)

Re: [OMPI devel] 1.7.4rc1 build failure: OpenBSD-5 and NetBSD-6

2013-12-19 Thread Paul Hargrove
recursive') The getrlimit manpage on this platform does not list RLIMIT_AS. Running "grep -rl RLIMIT_AS /usr/include" confirms that this constant does not exist. So, I think "#ifdef RLIMIT_AS" is required. -Paul On Thu, Dec 19, 2013 at 4:39 PM, Jeff Squyres (jsquyres) wrote:

Re: [OMPI devel] 1.74rc1 build failure: Solaris 11 / x86_64 / Sun Studio 12.3

2013-12-19 Thread Paul Hargrove
nic btl on Solaris at all. Perhaps just because the OFED stack is present? -Paul On Thu, Dec 19, 2013 at 4:39 PM, Jeff Squyres (jsquyres) wrote: > Try http://www.open-mpi.org/~jsquyres/unofficial/. > > Should have both "if" fixes in it. > > > On Dec 19, 2013, at 7

[OMPI devel] 1.7.4rc1 run failure on Solaris 10 / SPARC (not SIGBUS)

2013-12-19 Thread Paul Hargrove
Testing with Solaris 10 on SPARC, I was expecting to encounter the bus error reported previously by Siegman Gross. Instead I see the following hwloc-related abort: $ env PATH=/home/hargrove/OMPI/openmpi-1.7.4rc1-solaris10-sparcT2-ss12u3-v9/INST/bin:$PATH LD_LIBRARY_PATH_64=/home/hargrove/OMPI/o

Re: [OMPI devel] 1.74rc1 build failure: Solaris 11 / x86_64 / Sun Studio 12.3

2013-12-19 Thread Paul Hargrove
52/openmpi-1.7.4rc2forpaul/ompi/mca/btl/usnic/btl_usnic_util.h:19:45: error: static declaration of �fls� follows non-static declaration /usr/include/string.h:87:12: note: previous declaration of �fls� was here make[2]: *** [btl_usnic_module.lo] Error 1 -Paul On Thu, Dec 19, 2013 at 6:35 PM, Pau

[OMPI devel] 1.7.4rc1 install failure: NetBSD-6 amd64

2013-12-19 Thread Paul Hargrove
Attached is the output from "make install" of 1.7.4rc1 + Jeff's fix for the symbol conflict on "if_mtu". There appear to be at least 2 issues. 1) There are lots of (not fatal) messages about ldconfig not existing, but according to he NetBSD lists that utility went away with the conversion from a.

Re: [OMPI devel] 1.7.4rc1 run failure on Solaris 10 / SPARC (not SIGBUS)

2013-12-19 Thread Paul Hargrove
e: > I believe this one has already been fixed and is in the nightly (1.7.4rc2) > - for now, you can just set "--bind-to none" on the cmd line to get past it > > > On Dec 19, 2013, at 6:42 PM, Paul Hargrove wrote: > > Testing with Solaris 10 on SPARC, I was e

[OMPI devel] 1.7.4rc1 autogen error: NetBSD-6

2013-12-19 Thread Paul Hargrove
Probably nobody cares, but I'll report this for completeness. In trying to understand the "make install" failure on NetBSD-6 I run "autogen.sh". The versions detected: Searching for autoconf Found autoconf version 2.69; checking version... Found version component 2 -- need 2

Re: [OMPI devel] 1.74rc1 build failure: Solaris 11 / x86_64 / Sun Studio 12.3

2013-12-19 Thread Paul Hargrove
Dec 19, 2013 at 6:47 PM, Paul Hargrove wrote: > Jeff, > > I didn't actually get very far after fixing __always_inline. > In fact, the build still fails on the *same* line, but for a different > (valid) reason: > fls() is declared in /usr/include/string.h > >

[OMPI devel] 1.7.4rc1 build failure: another OpenBSD-5

2013-12-20 Thread Paul Hargrove
Manually #ifdef'ing out the RLIMIT_AS code which lead to my previous failure on OpenBSD-5 allows me to reach the (sigh) *next* problem: Making all in mpi/cxx CXX mpicxx.lo /home/phargrov/OMPI/openmpi-1.7.4rc2forpaul-openbsd5-amd64/openmpi-1.7.4rc2forpaul/ompi/mpi/cxx/mpicxx.cc:120:21: error

Re: [OMPI devel] 1.7.4rc1 run failure on Solaris 10 / SPARC (not SIGBUS)

2013-12-20 Thread Paul Hargrove
ems where I reported issues on Thu night. On Thu, Dec 19, 2013 at 7:19 PM, Ralph Castain wrote: > I believe this one has already been fixed and is in the nightly (1.7.4rc2) > - for now, you can just set "--bind-to none" on the cmd line to get past it > > > On Dec 19, 2013,

Re: [OMPI devel] 1.7.4rc1 run failure on Solaris 10 / SPARC (not SIGBUS)

2013-12-20 Thread Paul Hargrove
e just went in > today. Jeff has since regenerated the tarball for the web site, so the one > up there should have most (if not all) of these problems fixed > > Have a great holiday! > Ralph > > > On Dec 20, 2013, at 11:40 AM, Paul Hargrove wrote: > > Ralph, > > I

[OMPI devel] 1.7.4rc2r30031 testing summary

2013-12-20 Thread Paul Hargrove
This email is a summary of my results testing 1.7.4rc2r30031. I will send detailed follow-ups on the new issues. So, this is a heads-up to let you know this version still has significant problems for me. FreeBSD-9/amd64 + "mpirun -np 2 examples/ring_c" hangs! + "mpirun -np 1 examples/ring_c" runs

[OMPI devel] 1.7.4rc2r30031 - OpenBSD-5 mpirun hangs

2013-12-20 Thread Paul Hargrove
With plenty of help from Jeff and Ralph's bug fixes in the past 24 hours, I can now build OMPI for NetBSD. However, running even a simple example fails: Having set PATH and LD_LIBARY_PATH: $ mpirun -np 1 examples/ring_c just hangs Output from "top" shows idle procs: PID USERNAME PRI NICE SIZE

[OMPI devel] 1.7.4rc2r30031 - FreeBSD mpirun warning

2013-12-20 Thread Paul Hargrove
I have a build of OMPI 1.7.4rc2r30031 on FreeBSD-9 finally. I can (as I will detail in another email) run only singletons at the moment. However, when I do I get a warning that is, IMHO, unnecessary: $ mpirun -np 1 examples/ring_c ---

Re: [OMPI devel] [EXTERNAL] 1.7.4rc2r30031 - OpenBSD-5 mpirun hangs

2013-12-20 Thread Paul Hargrove
t; Paul - > > Any chance you could grab a stack trace from the mpi app? That's probably > the fastest next step > > Brian > > > > Sent with Good (www.good.com) > > > -Original Message- > *From: *Paul Hargrove [phhargr...@lbl.gov] > *Sent: *Fr

[OMPI devel] 1.7.4rc2r30031 - FreeBSD-9 mpirun hangs

2013-12-20 Thread Paul Hargrove
This case is not quite like my OpenBSD-5 report. On FreeBSD-9 I *can* run singletons, but "-np 2" hangs. The following hangs: $ mpirun -np 2 examples/ring_c The following complains about the "bogus" btl selection. So this is not the same as my problem with OpenBSD-5: $ mpirun -mca btl bogus -np 2

Re: [OMPI devel] [EXTERNAL] 1.7.4rc2r30031 - OpenBSD-5 mpirun hangs

2013-12-20 Thread Paul Hargrove
c000 On Fri, Dec 20, 2013 at 2:48 PM, Paul Hargrove wrote: > Brian, > > Of course, I should have thought of that myself. > See below for backtrace from a singleton run. > > I'm starting an --enable-debug build to maybe get some line number info > too. > > -Paul &

Re: [OMPI devel] 1.7.4rc2r30031 - FreeBSD mpirun warning

2013-12-20 Thread Paul Hargrove
On Fri, Dec 20, 2013 at 3:12 PM, Dave Goodell (dgoodell) wrote: > On Dec 20, 2013, at 4:43 PM, Paul Hargrove wrote: > > > The warning is correct that no such interface exists. > > However 127.0.0.1/24 DOES exist: > > > > $ ifconfig lo0 inet > >

Re: [OMPI devel] 1.7.4rc2r30031 - FreeBSD-9 mpirun hangs

2013-12-20 Thread Paul Hargrove
, 2013 at 2:59 PM, Paul Hargrove wrote: > This case is not quite like my OpenBSD-5 report. > On FreeBSD-9 I *can* run singletons, but "-np 2" hangs. > > The following hangs: > $ mpirun -np 2 examples/ring_c > > The following complains about the "bogus" btl s

Re: [OMPI devel] 1.7.4rc2r30031 - FreeBSD-9 mpirun hangs

2013-12-20 Thread Paul Hargrove
FWIW: I've confirmed that this is REGRESSION relative to 1.7.3, which works fine on FreeBSD-9 -Paul On Fri, Dec 20, 2013 at 3:30 PM, Paul Hargrove wrote: > And the FreeBSD backtraces again, this time configured with --enable-debug > and for all threads: > > The 100%-c

Re: [OMPI devel] [EXTERNAL] 1.7.4rc2r30031 - OpenBSD-5 mpirun hangs

2013-12-20 Thread Paul Hargrove
16 PM, Paul Hargrove wrote: > Below is the backtrace again, this time configured w/ --enable-debug and > for all threads. > -Paul > > Thread 2 (thread 1021110): > #0 0x1bc0ef6c5e3a in nanosleep () at :2 > #1 0x1bc0f317c2d4 in nanosleep (rqtp=0x7f7bc900, rmtp=

Re: [OMPI devel] [EXTERNAL] 1.7.4rc2r30031 - OpenBSD-5 mpirun hangs

2013-12-20 Thread Paul Hargrove
try back-porting the fix(es) to see if 1.7.3 works or not . > > -Paul > > > On Fri, Dec 20, 2013 at 3:16 PM, Paul Hargrove wrote: > >> Below is the backtrace again, this time configured w/ --enable-debug and >> for all threads. >> -Paul >> >> T

Re: [OMPI devel] [EXTERNAL] 1.7.4rc2r30031 - OpenBSD-5 mpirun hangs

2013-12-20 Thread Paul Hargrove
On Fri, Dec 20, 2013 at 4:02 PM, Paul Hargrove wrote: > FWIW: > I've confirmed that this is REGRESSION relative to 1.7.2, which works fine > on OpenBSD-5 > > I could not build 1.7.3 due to some of issues fixed for 1.7.4rc in the > past 24 hours. > I am going to try b

Re: [OMPI devel] [EXTERNAL] 1.7.4rc2r30031 - OpenBSD-5 mpirun hangs

2013-12-20 Thread Paul Hargrove
vestigating and think I > know where the issue might lie (a timer that is firing to indicate a failed > connection attempt and causing a race condition) > > > On Dec 20, 2013, at 4:02 PM, Paul Hargrove wrote: > > FWIW: > I've confirmed that this is REGRESSION relative t

Re: [OMPI devel] 1.7.4rc1 autogen error: NetBSD-6

2013-12-20 Thread Paul Hargrove
utogen.pl if I can find time. However, I am sending this email to document my finding in case I don't get back to this. -Paul On Fri, Dec 20, 2013 at 6:49 AM, Jeff Squyres (jsquyres) wrote: > I just submitted a CMR to Brian to fix this: > > https://svn.open-mpi.org/trac/

Re: [OMPI devel] 1.7.4rc1 autogen error: NetBSD-6

2013-12-20 Thread Paul Hargrove
; + open(OUT, ">configure.patched") || my_die "Can't open configure.patched"; print OUT $c; close(OUT); -Paul On Fri, Dec 20, 2013 at 6:04 PM, Paul Hargrove wrote: > As I indicated earlier today, the CMRed fix to push/pop "dir" in hwloc did

[OMPI devel] 1.7.4rc2r30076 Missing Mellanox vendor ID

2013-12-25 Thread Paul Hargrove
I get a warning from the current 1.7.4rc2r30076: -- WARNING: No preset parameters were found for the device that Open MPI detected: Local host:pcp-j-20 Device name: mthca0 Device vendor ID: 0

Re: [OMPI devel] RFC: OB1 optimizations

2014-01-08 Thread Paul Hargrove
Interestingly enough the 4MB latency actually improved significantly relative to the initial numbers. -Paul [Sent from my phone] On Jan 8, 2014 8:50 AM, "George Bosilca" wrote: > These results are way worst that the one you send on your previous email? > What is the reason? > > George. > > On

Re: [OMPI devel] RFC: OB1 optimizations

2014-01-08 Thread Paul Hargrove
Nevermind, since Nathan just clarified that the results are not comparable. -Paul [Sent from my phone] On Jan 8, 2014 8:58 AM, "Paul Hargrove" wrote: > Interestingly enough the 4MB latency actually improved significantly > relative to the initial numbers. > > -Paul [Sent f

[OMPI devel] 1.7.4rc2r30148 - static link failure on NetBSD

2014-01-08 Thread Paul Hargrove
When I compile the current 1.7.4rc on NetBSD with no configure arguments, I still get the "make install" failure that I have detailed in previous emails. HOWEVER, if I configure with "--enable-static --disable-shared" then I get an earlier failure at build time (partial "make V=1" output shown bel

[OMPI devel] 1.7.4rc2r30148 run failure NetBSD6-x86

2014-01-08 Thread Paul Hargrove
While I have yet to get a working build on NetBSD for x86-64 h/w, I *have* successfully built Open MPI's current 1.7.4rc tarball on NetBSD-6 for x86. However, I can't *run* anything: Attempting the ring_c example on 2 cores: -bash-4.2$ mpirun -mca btl sm,self -np 2 examples/ring_c ---

[OMPI devel] 1.7.4rc2r30148 - out-of-date mpirun usage output

2014-01-08 Thread Paul Hargrove
Note the following still indicates that "--bind-to none" is the default: -bash-4.2$ mpirun --help | grep -A2 'bind-to ' --bind-to Policy for binding processes [none (default) | hwthread | core | socket | numa | board] (supported qualifiers

Re: [OMPI devel] 1.7.4rc2r30031 - FreeBSD mpirun warning

2014-01-08 Thread Paul Hargrove
s given for btl_tcp_if_exclude. This value will be ignored. Local host: pcp-j-17 Value: 127.0.0.1/8 Message:Did not find interface matching this subnet -- -Paul > On Dec 20, 2013, at 3:20 PM, Paul Hargrove wrote: >

[OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-08 Thread Paul Hargrove
This partial make output shows a build failure of the current trunk tarball on FreeBSD-9/x86-64: CC path.lo /home/phargrov/OMPI/openmpi-trunk-freebsd9-amd64/openmpi-1.9a1r30146/opal/util/path.c: In function 'opal_path_df': /home/phargrov/OMPI/openmpi-trunk-freebsd9-amd64/openmpi-1.9a1r30146

[OMPI devel] trunk - ibverbs configure error on Solaris-11

2014-01-08 Thread Paul Hargrove
When trying to configure the OMPI trunk on a Solaris-11/x86-64 with --enable-openib, I see the following error not seen with the 1.7 branch: *** Compiler flags checking which of CFLAGS are ok for debugger modules... -DNDEBUG -m64 -mt checking for debugger extra CFLAGS... -g checking for the C com

[OMPI devel] 1.7.4rc2r30148 - implicit decl of bzero on Solaris

2014-01-08 Thread Paul Hargrove
As of 1.7.4rc2r30148 there appears to be a missing "#include " in bcol_basesmuma_smcm.c. Both the Solaris Studio compiler (output below) and gcc agree on this point. CC bcol_basesmuma_smcm.lo "/shared/OMPI/openmpi-1.7-latest-solaris11-x64-ib-ss12u3/openmpi-1.7.4rc2r30148/ompi/mca/bcol/bas

[OMPI devel] 1.7.4rc2r30148 - crash in MPI_Init on Linux/x86

2014-01-08 Thread Paul Hargrove
I am still testing the current 1.7.4rc tarball on my various systems. The latest failure (shown below) is a SEGV somewhere below MPI_Init on a old, but otherwise fairly normal, Linux/x86 (32-bit) system. $ /home/pcp1/phargrov/OMPI/openmpi-1.7-latest-linux-x86/INST/bin/mpirun -np 1 examples/ring_c

Re: [OMPI devel] 1.7.4rc2r30148 - crash in MPI_Init on Linux/x86

2014-01-08 Thread Paul Hargrove
bad backing store site - any chance you could > give me a line number from this? There are a lot of calls to register > params in that code and I'd need some help in figuring out which one wasn't > right. > > > On Jan 8, 2014, at 6:59 PM, Paul Hargrove wrote: >

Re: [OMPI devel] 1.7.4rc2r30148 - crash in MPI_Init on Linux/x86

2014-01-08 Thread Paul Hargrove
r, I'm > mindful of all the things you need to do, so please only if you have the > time. > > Thanks > Ralph > > On Jan 8, 2014, at 8:23 PM, Paul Hargrove wrote: > > Ralph, > > Building with gcc-4.1.2 fixed the problem for me. I also removed an old > install o

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
t;. > > So I just changed it to an "else" statement in the trunk and scheduled it > for 1.7.4 if it passes muster - see how this works for you. > > Ralph > > > On Jan 8, 2014, at 4:52 PM, Paul Hargrove wrote: > > This partial make output shows a build failure

Re: [OMPI devel] 1.7.4rc2r30148 run failure NetBSD6-x86

2014-01-09 Thread Paul Hargrove
gt; Hmmm...I see the problem. Looks like binding isn't supported on that > system for some reason, so we need to turn "off" our auto-binding when we > hit that condition. I'll check to see why that isn't happening (was > supposed to do so) > > > On Jan 8,

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
No joy running autogen.pl on my FreeBSD system. So, I'll try to remember to run on Thu night's trunk tarball. -Paul On Wed, Jan 8, 2014 at 9:01 PM, Paul Hargrove wrote: > Ralph, > > *IF* I have new enough autotools to autogen on my FreeBSD platform, I'll > take

Re: [OMPI devel] 1.7.4rc2r30148 - crash in MPI_Init on Linux/x86

2014-01-09 Thread Paul Hargrove
, argv=0xbfe04114) at ring_c.c:19 On Wed, Jan 8, 2014 at 8:45 PM, Paul Hargrove wrote: > Only takes <30 seconds of typing to start the test and I get email when it > is done. > Typing these emails takes more of my time than the actual testing does. > > -Paul > > > On W

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
all the systems for the latter actually is intended to amount to "anything > else". > > So I just changed it to an "else" statement in the trunk and scheduled it > for 1.7.4 if it passes muster - see how this works for you. > > Ralph > > > On Jan 8, 2014

[OMPI devel] trunk - non-portable examples/Makefie

2014-01-09 Thread Paul Hargrove
With Ralph's fix for opal/util/path.c, I can build the trunk on FreeBSD-9/x86-64. However, building the examples fails with: $ cp -r $SRCDIR/examples . $ cd examples $ make mpicc -g hello_c.c -o hello_c mpicc -g ring_c.c -o ring_c mpicc -g connectivity_c.c -o connectivity_c shmemcc -g -o he

[OMPI devel] trunk - broken logic for oshmem:bindings:fort

2014-01-09 Thread Paul Hargrove
Building the trunk on FreeBSD-9/x86-64, and using gmake to work around the non-portable examples/Makefile, I *still* encounter issues with shmemfort when running "gmake" in the examples subdirectory: $ gmake mpicc -ghello_c.c -o hello_c mpicc -gring_c.c -o ring_c mpicc -gconnectivi

Re: [OMPI devel] 1.7.4rc2r30031 - FreeBSD mpirun warning

2014-01-09 Thread Paul Hargrove
Yup, the 1.7.4rc2r30168 tarball appears to have resolved this. -Paul On Wed, Jan 8, 2014 at 4:08 PM, Ralph Castain wrote: > Yeah - it only today was approved for move into 1.7.4 :-) > > Hopefully will make it into tonight's tarball > > > On Jan 8, 2014, at 3:58 P

[OMPI devel] 1.7.4rc2r30168 - misc harmless *BSD warnings

2014-01-09 Thread Paul Hargrove
Some minor misc warnings from the current 1.7.4rc tarball: On both FreeBSD and NetBSD, the symbol CACHE_LINE_SIZE used in ompi/mca/bcol/basesmuma/ appears to clash with a system-defined symbol. FreeBSD-9/x86-64: CC bcol_basesmuma_component.lo In file included from /home/phargrov/OMPI/openm

[OMPI devel] 1.7.4rc2r30168 - warnings for pointer/integer of different size

2014-01-09 Thread Paul Hargrove
The following three warnings occur on 32-bit targets, and can each be avoided by adding an intermediate cast to uintptr_t or intptr_t: -bash-4.2$ grep -B2 'different size' make.log CC fcoll_dynamic_file_read_all.lo /home/phargrov/OMPI/openmpi-1.7-latest-netbsd6-i386/openmpi-1.7.4rc2r30168/

Re: [OMPI devel] 1.7.4rc2r30168 - warnings for pointer/integer of different size

2014-01-09 Thread Paul Hargrove
size -Paul On Thu, Jan 9, 2014 at 1:07 AM, Paul Hargrove wrote: > The following three warnings occur on 32-bit targets, and can each be > avoided by adding an intermediate cast to uintptr_t or intptr_t: > > -bash-4.2$ grep -B2 'different size

Re: [OMPI devel] 1.7.4rc2r30148 run failure NetBSD6-x86

2014-01-09 Thread Paul Hargrove
is topology, only "pu's", so > "bind-to core" is going to fail even though binding is supported. Will > adjust. > > Thanks! > > On Jan 8, 2014, at 9:06 PM, Paul Hargrove wrote: > > Requested verbose output below. > -Paul > > -bash-4.2$ mpir

Re: [OMPI devel] Paul H build on Solaris

2014-01-09 Thread Paul Hargrove
Jeff, I actually noted this in the unofficial tarball you rolled for me in December: On Thu, Dec 19, 2013 at 6:35 PM, Paul Hargrove wrote: [...] > I'm not sure why one is trying to build the usnic btl on Solaris at all. > Perhaps just because the OFED stack is present? > [...]

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
Jeff, The changes as described in the commit message make good sense to me except for one thing: In the 1.7 branch there is still a defined(__WINDOWS__) case for which opal_path_nfs() is currently a no-op . So, I fear that if CMR'ed blindly both the configure-time and build-time checks to ensure

Re: [OMPI devel] trunk - ibverbs configure error on Solaris-11

2014-01-09 Thread Paul Hargrove
as to do something with our handling of the legacy > --with-openib switch...? (it's been deprecated on the trunk in favor of > --with-verbs) > > > On Jan 8, 2014, at 8:38 PM, Paul Hargrove wrote: > > > When trying to configure the OMPI trunk on a Solaris-11/x86-64

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
On Thu, Jan 9, 2014 at 2:08 PM, Paul Hargrove wrote: > However, only my Solaris (10/SPARC and 11/x86-64) systems have NFS-mounted > filesystems. So, I don't have any means to ensure that the "newly active" > code performs correctly on the BSD systems. In other words,

[OMPI devel] 1.7.4 support Mac OSX 10.8 or 10.9?

2014-01-09 Thread Paul Hargrove
The README in the latest 1.7.4rc lists support for: - OS X (10.5, 10.6, 10.7), 32 and 64 bit (x86_64), with gcc and Absoft compilers (*) Should 10.8 (Mountain Lion) be listed? What about 10.9 (Mavericks)? I can test 10.5 through 10.8 but haven't been doing so assuming that is covered by th

[OMPI devel] 1.7.4rc2r30168 - configure failure on Mac OSX 10.5

2014-01-09 Thread Paul Hargrove
As I noted in another email, 1.7.4's README claims support for Mac OSX versions 10.5 through 10.7. So, I just now tried (but failed) to build on 10.5 (Leopard): *** Assembler checking dependency style of gcc -std=gnu99... gcc3 checking for BSD- or MS-compatible name lister (nm)... /usr/bin/nm -p

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
On Thu, Jan 9, 2014 at 4:15 PM, Jeff Squyres (jsquyres) wrote: > For example, if you build and run the test like this: > > make check > ./opal_path_nfs . Makefile ~ > > It'll report if ., Makefile, and ~ are on network filesystems (i.e., the > result of sending each of ".", "Makefile", and "your_h

[OMPI devel] 1.7.4rc2r30168 - PGI F08 failure

2014-01-09 Thread Paul Hargrove
My attempts to build the current 1.7.4rc tarball with versions 8.0 and 9.0 of the Portland Group compilers fails miserably on compilation of mpi-f08-types.F90. I am sort of surprised by the attempt to build Fortran 2008 support w/ such old compilers. I think that fact itself is the real bug here,

Re: [OMPI devel] 1.7.4rc2r30168 - configure failure on Mac OSX 10.5

2014-01-09 Thread Paul Hargrove
nno if we really go back that far, Paul - I doubt anyone has tested on > anything less than 10.8, frankly. Might be better if we update to not make > claims that far back. > > Were you able to build/run on 10.7? > > On Jan 9, 2014, at 3:25 PM, Paul Hargrove wrote: > > As I no

[OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-09 Thread Paul Hargrove
Trying to run on the front-end of one of our production Linux systems I see the following: $ mpirun -mca btl sm,self -np 2 examples/ring_c' [cvrsvc01:17692] [[42051,1],0] ORTE_ERROR_LOG: Data for specified key not found in file /global/homes/h/hargrove/GSCRATCH/OMPI/openmpi-1.7-latest-linux-x86_64

Re: [OMPI devel] 1.7.4rc2r30168 - PGI F08 failure

2014-01-09 Thread Paul Hargrove
V11.x, V12.x, ..., all being parsed as V1. > My recollection is that some C++ code was affected. > > Larry Baker > US Geological Survey > 650-329-5608 > ba...@usgs.gov > > > > On 9 Jan 2014, at 4:35 PM, Paul Hargrove wrote: > > My attempts to build the current 1

[OMPI devel] 1.7.4rc2r30168 - usnic warning w/ icc

2014-01-09 Thread Paul Hargrove
I believe the following means that the compiler has determined that the two named variables DO NOT actually get initialized to NULL as written. However, it looks like their initialization is not required, as each is set before it is read. CC btl_usnic_component.lo /scratch/scratchdirs/har

[OMPI devel] uDAPL and elan in 1.7.4?

2014-01-09 Thread Paul Hargrove
The README in the current 1.7.4rc tarball still claims support for uDAPL and Quadrics Elan. Unless I am mistaken, those were both removed. -- Paul H. Hargrove phhargr...@lbl.gov Future Technologies Group Computer and Data Sciences Department Tel: +1-510-495-2352 Lawr

[OMPI devel] Portals vs Portals4 in 1.7.4

2014-01-09 Thread Paul Hargrove
The README in the current 1.7.4rc tarball lists support for "Portals" and documents --with-portals{,-config,-libs} configure arguments. However, unless I am mistaken mtl:portals is gone and mtl:portals4 has different configure arguments. -Paul -- Paul H. Hargrove phharg

[OMPI devel] MX and PSM in 1.7.4

2014-01-09 Thread Paul Hargrove
Is anybody still testing MX and PSM? They are both still present in ompi/mca/mtl/ I have access to a system w/ QLogic HCAs w/ PSM and have verified that I can: $ mpirun -mca btl sm,self -np 2 -host n15,n16 -mca mtl psm examples/ring_c I have an x86 (32-bit) system w/ MX headers and libs that I ha

Re: [OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-09 Thread Paul Hargrove
ation because of the batch system detected at configure time. However, I would have expected a more informative error message for that case. -Paul On Thu, Jan 9, 2014 at 5:03 PM, Paul Hargrove wrote: > Trying to run on the front-end of one of our production Linux systems I > see th

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
Jeff, Sorry, but the new opal/util/pth.c in the trunk tarball (1.9a1r30215) fails to build on NetBSD: CC path.lo /home/phargrov/OMPI/openmpi-trunk-netbsd6-i386/openmpi-1.9a1r30215/opal/util/path.c: In function 'opal_path_nfs': /home/phargrov/OMPI/openmpi-trunk-netbsd6-i386/openmpi-1.9a1r3

[OMPI devel] trunk - build failure on OpenBSD

2014-01-09 Thread Paul Hargrove
With the new opal/util/path.c I get farther building the trunk on OpenBSD but hit a new failure: Making all in mca/memheap CC base/memheap_base_frame.lo CC base/memheap_base_select.lo CC base/memheap_base_alloc.lo /home/phargrov/OMPI/openmpi-trunk-openbsd5-i386/openmpi-1.9a

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
Jeff, No joy on Solaris-10 either: CC path.lo "/home/hargrove/OMPI/openmpi-trunk-solaris10-sparcT2-ss12u3-v8plus/openmpi-1.9a1r30215/opal/util/path.c", line 478: prototype mismatch: 2 args passed, 4 expected "/home/hargrove/OMPI/openmpi-trunk-solaris10-sparcT2-ss12u3-v8plus/openmpi-1.9a1

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
On Thu, Jan 9, 2014 at 7:15 PM, Paul Hargrove wrote: > My Solaris-11 build stopped again on the failure to find ibv_open_device(). > I am re-running w/o --enable-openib now. > It finished while I was typing the previous message. The Solaris-11 build failed in the same way as Solaris-1

Re: [OMPI devel] trunk build failure on {Free,Net,Open}BSD

2014-01-09 Thread Paul Hargrove
, if one does want to keep the current logic (or at least something similar) it looks like configure should not assume statfs() is available without *also* confirming that "struct statfs" is available. -Paul On Thu, Jan 9, 2014 at 7:18 PM, Paul Hargrove wrote: > > On Thu, Jan

Re: [OMPI devel] trunk - build failure on OpenBSD

2014-01-09 Thread Paul Hargrove
Same issue for NetBSD, too. -Paul On Thu, Jan 9, 2014 at 7:09 PM, Paul Hargrove wrote: > With the new opal/util/path.c I get farther building the trunk on OpenBSD > but hit a new failure: > > Making all in mca/memheap > CC base/memheap_base_frame.lo &g

Re: [OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-09 Thread Paul Hargrove
; of a slurm cluster. > > Only possibly relevant thing I see is that this was built with PGI - any > chance you could try a gcc based build? All my tests are done with gcc, so > I'm wondering if PGI is the source of the trouble here. > > > On Jan 9, 2014, at 6:17 PM,

Re: [OMPI devel] 1.7.4rc2r30148 run failure NetBSD6-x86

2014-01-10 Thread Paul Hargrove
compiler. I *was* able to finally confirm that I can now run ring_c. -Paul On Thu, Jan 9, 2014 at 12:07 PM, Paul Hargrove wrote: > Ralph, > > Thanks for fielding all these issues I've been finding. > I will plan to run tonight's trunk tarball through all of the systems

Re: [OMPI devel] 1.7.4rc2r30148 run failure NetBSD6-x86

2014-01-10 Thread Paul Hargrove
On Thu, Jan 9, 2014 at 9:05 PM, Ralph Castain wrote: > Not sure why the shmem fortran examples would try to build - will pass > that off to Jeff as well (sorry Jeff!) This is the issue I described in http://www.open-mpi.org/community/lists/devel/2014/01/13616.php It seems that oshmem_info alwa

Re: [OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-10 Thread Paul Hargrove
On Thu, Jan 9, 2014 at 8:56 PM, Paul Hargrove wrote: > I'll try a gcc-based build on one of the systems ASAP. Sorry, Ralph: the failure remains when built w/ gcc. Let me know what to try next and I'll give it a shot. -Paul -- Paul H. Hargrove phhar

Re: [OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-10 Thread Paul Hargrove
The problem is seen with both the trunk and the 1.7.4rc tarball. -Paul On Thu, Jan 9, 2014 at 9:23 PM, Paul Hargrove wrote: > > On Thu, Jan 9, 2014 at 8:56 PM, Paul Hargrove wrote: > >> I'll try a gcc-based build on one of the systems ASAP. > > > Sorry, Ralph: t

Re: [OMPI devel] trunk - build failure on OpenBSD

2014-01-10 Thread Paul Hargrove
ROT_READ | PROT_WRITE, MAP_SHARED | -#if defined (__APPLE__) -MAP_ANON | -#elif defined (__GNUC__) -MAP_ANONYMOUS | +#ifdef MAP_ANONYMOUS +MAP_ANONYMOUS | #endif MAP_FIXED, 0, On Thu, Jan 9, 2014 at 8:35 PM, Paul Hargrov

Re: [OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-10 Thread Paul Hargrove
libs from old OMPI builds. -Paul > > On Jan 9, 2014, at 9:57 PM, Paul Hargrove wrote: > > The problem is seen with both the trunk and the 1.7.4rc tarball. > > -Paul > > > On Thu, Jan 9, 2014 at 9:23 PM, Paul Hargrove wrote: > >> >> On Thu, Jan 9, 20

Re: [OMPI devel] 1.7.4rc2r30168 - PGI F08 failure

2014-01-10 Thread Paul Hargrove
On Fri, Jan 10, 2014 at 7:49 AM, Jeff Squyres (jsquyres) wrote: > Paul -- > > The output from configure looks ok to me. We're testing for the various > capabilities of the fortran compiler that we need, most of which have been > around for quite a while. One Big New Thing that isn't around yet

Re: [OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-10 Thread Paul Hargrove
On Fri, Jan 10, 2014 at 10:08 AM, Ralph Castain wrote: > ??? that was it? Was this built with --enable-debug? Nope, I missed --enable-debug. Will try again. -Paul -- Paul H. Hargrove phhargr...@lbl.gov Future Technologies Group Computer and Data Sciences Department

Re: [OMPI devel] 1.7.4rc2r30168 - odd run failure

2014-01-10 Thread Paul Hargrove
On Fri, Jan 10, 2014 at 10:41 AM, Paul Hargrove wrote: > > On Fri, Jan 10, 2014 at 10:08 AM, Ralph Castain wrote: > >> ??? that was it? Was this built with --enable-debug? > > > Nope, I missed --enable-debug. Will try again. > > OK, Take-2 below. There is an ob

Re: [OMPI devel] Paul H build on Solaris

2014-01-10 Thread Paul Hargrove
On Thu, Jan 9, 2014 at 12:35 PM, Jeff Squyres (jsquyres) wrote: > Thanks. We're just going to change the test in the usnic BTL to be > explicit about only building on 64 bit Linux. > Last night's trunk did NOT try to build btl:usnic on Solaris. So, this issue looks to be resolved in trunk. -Pa

<    1   2   3   4   5   6   7   8   9   10   >