Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread Paul Hargrove
Well, the contents of opal/asm/asm-data.txt and the arch-specific subdirs below opal/include/opal/sys have served me as a list of the atomics implementations. If those include architectures no longer officially supported, then some cleanup may be in order (as SPARC_v8 was recently removed from

Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread Jeff Squyres (jsquyres)
I think the closest thing we have to a supported architecture list is in the README. On Aug 11, 2014, at 2:42 PM, Nathan Hjelm wrote: > > Which brings us back to Dave's question. Is there a list of supported > architectures? I don't want to bother with DEC Alpha if we no

Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread Nathan Hjelm
Which brings us back to Dave's question. Is there a list of supported architectures? I don't want to bother with DEC Alpha if we no longer support it. BTW, so far I have converted: AMD64, IA32, ARM. Working on IA64 now. -Nathan On Mon, Aug 11, 2014 at 01:57:21PM -0400, George Bosilca wrote: >

Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread George Bosilca
Dave, We all understand your concerns. However, the current issue has nothing to do with Nathan, the code for supporting ARMv5 is already in the patch I submitted and that Paul validated. What Nathan said he might take a look at is a different method for generating assembly code, one that only

Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread Dave Goodell (dgoodell)
On Aug 11, 2014, at 11:54 AM, Paul Hargrove wrote: > I am on the same page with George here - if it's on the list then support it > until its been removed. > > I happen to have systems to test, I believe, every supported atomics > implementation except for DEC Alpha, and

[OMPI devel] btl thread safety question

2014-08-11 Thread Pritchard Jr., Howard
Hi Folks, Has anyone checked about ompi thread safety support since the BTL move? I can only get the osu latency mt test to work using sm/shmem/vader. With TCP I see it hang after 32KB messages. Howard - Howard Pritchard HPC-5 Los Alamos

Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread Paul Hargrove
I am on the same page with George here - if it's on the list then support it until its been removed. I happen to have systems to test, I believe, every supported atomics implementation except for DEC Alpha, and so I did test them all. AFAIK ARMv5 is even out-dated as a smartphone platform.

Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread George Bosilca
It is not that I care, but it was one of our supported platforms and we don't usually drop support for anything without a proper RFC. George. On Mon, Aug 11, 2014 at 12:09 PM, Dave Goodell (dgoodell) < dgood...@cisco.com> wrote: > On Aug 7, 2014, at 11:37 PM, George Bosilca

Re: [OMPI devel] RFC: add atomic compare-and-swap that returns old value

2014-08-11 Thread Dave Goodell (dgoodell)
On Aug 7, 2014, at 11:37 PM, George Bosilca wrote: > Paul's tests identified an small issue with the previous patch (a real > corner-case for ARM v5). The patch below is fixing all known issues. Wait, why do we care about ARMv5? It's certainly not a serious HPC platform,

Re: [OMPI devel] errors and warnings with show_help() usage

2014-08-11 Thread Jeff Squyres (jsquyres)
Sweet -- thanks! On Aug 11, 2014, at 2:07 AM, Gilles Gouaillardet wrote: > Jeff and all, > > i fixed the trivial errors in the trunk, there are now 11 non trivial > errors. > (commits r32490 to r32497) > > i ran the script vs the v1.8 branch and found 54

Re: [OMPI devel] errors and warnings with show_help() usage

2014-08-11 Thread Ralph Castain
I'm not worrying about 1.8.2, but we can take a look at this for 1.8.3 or beyond. Thanks for working on the trunk! On Aug 10, 2014, at 11:07 PM, Gilles Gouaillardet wrote: > Jeff and all, > > i fixed the trivial errors in the trunk, there are now 11 non

Re: [OMPI devel] ORTE headers in OPAL source

2014-08-11 Thread Adrian Reber
I have seen it. I am still waiting for things to settle down before I start fixing the FT code ( again ;-) Adrian On Mon, Aug 11, 2014 at 01:40:33PM +, Jeff Squyres (jsquyres) wrote: > Ah, I see. > > Ok -- add it to the list of >

Re: [OMPI devel] cosmetic configure nit

2014-08-11 Thread Jeff Squyres (jsquyres)
On Aug 9, 2014, at 4:24 PM, Paul Hargrove wrote: > One too many 's' characters in the following: > > checking for asssembly architecture... Fixed; thanks. > -Paul > > -- > Paul H. Hargrove phhargr...@lbl.gov > Future Technologies Group >

Re: [OMPI devel] bus error with openmpi-1.8.2rc4r32485 and gcc-4.9.0

2014-08-11 Thread Kawashima, Takahiro
Hi Ralph, Your commit r32459 fixed the bus error by correcting opal/dss/dss_copy.c. It's OK for trunk because mca_dstore_hash calls dss to copy data. But it's insufficient for v1.8 because mca_db_hash doesn't call dss and copies data itself. The attached patch is the minimum patch to fix it in

Re: [OMPI devel] [vt] --with-openmpi-inside configure argument

2014-08-11 Thread Matthias Jurenz
Hello Paul, the only possible values for --with-openmpi-inside are "yes" and "1.7" where the latter value is interpreted as *since*. Prior version 1.7 the Open MPI configure provides both F77 and FC for specifying Fortran compilers. The VT configure only provides FC, so it sets FC (if not

Re: [OMPI devel] [OMPI users] bus error with openmpi-1.8.2rc4r32485 and gcc-4.9.0

2014-08-11 Thread Kawashima, Takahiro
Siegmar, Ralph, I'm sorry to response so late since last week. Ralph fixed the problem in r32459 and it was merged to v1.8 in r32474. But in v1.8 an additional custom patch is needed because the db/dstore source codes are different between trunk and v1.8. I'm preparing and testing the custom

Re: [OMPI devel] errors and warnings with show_help() usage

2014-08-11 Thread Gilles Gouaillardet
Jeff and all, i fixed the trivial errors in the trunk, there are now 11 non trivial errors. (commits r32490 to r32497) i ran the script vs the v1.8 branch and found 54 errors (first, you need to touch Makefile.ompi-rules in the top-level Open MPI directory in order to make the script happy)

Re: [OMPI devel] ibm abort test hangs on one node

2014-08-11 Thread Gilles Gouaillardet
Thanks Ralph ! this was necessary but not sufficient : orte_errmgr_base_abort calls orte_session_dir_finalize at errmgr_base_fns.c:219 that will remove the proc session dir then, orte_errmgr_base_abort (indirectly) calls orte_ess_base_app_abort at line 227 first, the proc session dir is removed