Issue filed -- please follow up here:

    https://github.com/open-mpi/ompi/issues/1654



> On May 6, 2016, at 12:22 PM, Paul Hargrove <phhargr...@lbl.gov> wrote:
> 
> BIG-endian PPC64 w/ xlc V13.1 experiences a nearly identical failure.
> However, this time gdb appears to have been able to resolve frame #0 to a PLT 
> slot (instead of "??").
> 
> -Paul
> 
> #0  0x00000fff8904ef88 in 00000010.plt_call.opal_mem_hooks_release_hook+0 ()
>    from 
> /gpfs-biou/phh1/OMPI/openmpi-v2.x-dev-1410-g81e0924-linux-ppc64-xlc-13.1/INST/lib/libopen-pal.so.20
> #1  0x00000fff8910b630 in intercept_munmap (start=0xfff88d20000, 
> length=2097152)
>     at 
> /gpfs-biou/phh1/OMPI/openmpi-v2.x-dev-1410-g81e0924-linux-ppc64-xlc-13.1/openmpi-gitclone/opal/mca/memory/patcher/memory_patcher_component.c:155
> #2  0x000000800cc5ca80 in ._IO_setb () from /lib64/libc.so.6
> #3  0x000000800cc5b16c in ._IO_file_close_it () from /lib64/libc.so.6
> #4  0x000000800cc4a758 in .fclose () from /lib64/libc.so.6
> #5  0x0000000010000f88 in do_test ()
>     at 
> /gpfs-biou/phh1/OMPI/openmpi-v2.x-dev-1410-g81e0924-linux-ppc64-xlc-13.1/openmpi-gitclone/ompi/debuggers/dlopen_test.c:97
> #6  0x00000000100010d8 in main (argc=1, argv=0xffff462f398)
>     at 
> /gpfs-biou/phh1/OMPI/openmpi-v2.x-dev-1410-g81e0924-linux-ppc64-xlc-13.1/openmpi-gitclone/ompi/debuggers/dlopen_test.c:135
> 
> 
> 
> On Fri, May 6, 2016 at 9:14 AM, Paul Hargrove <phhargr...@lbl.gov> wrote:
> I am testing a tarball built from v2.x-dev-1410-g81e0924
> This includes pull request #1128 in which Nathan addressed multiple "patcher" 
> issues.
> 
> However, I see the crash below in dlopen_test on a LITTLE-ENDIAN Power8 
> system.
> This is happening when built with "V13.1.2 (5725-C73, 5765-J08)", but not 
> with gcc on the same system.
> So, I cannot conclusively assign blame to OpenMPI.
> 
> -Paul
> 
> Program terminated with signal SIGSEGV, Segmentation fault.
> 
> (gdb) where
> #0  0x0000000000000000 in ?? ()
> #1  0x00003fff897adb38 in intercept_munmap (start=0x3fff89670000, 
> length=65536)
>     at 
> /home/phargrov/OMPI/openmpi-v2.x-dev-1410-g81e0924-linux-ppc64el-xlc/openmpi-gitclone/opal/mca/memory/patcher/memory_patcher_component.c:155
> #2  0x00003fff8933bc80 in __GI__IO_setb () from /lib64/libc.so.6
> #3  0x00003fff89339528 in __GI__IO_file_close_it () from /lib64/libc.so.6
> #4  0x00003fff89327f74 in fclose@@GLIBC_2.17 () from /lib64/libc.so.6
> #5  0x0000000010000f7c in do_test ()
>     at 
> /home/phargrov/OMPI/openmpi-v2.x-dev-1410-g81e0924-linux-ppc64el-xlc/openmpi-gitclone/ompi/debuggers/dlopen_test.c:97
> #6  0x00000000100010e0 in main (argc=1, argv=0x3fffff332888)
>     at 
> /home/phargrov/OMPI/openmpi-v2.x-dev-1410-g81e0924-linux-ppc64el-xlc/openmpi-gitclone/ompi/debuggers/dlopen_test.c:135
> 
> "start" is valid:
> (gdb) print *(char*)0x3fff89670000
> $1 = 35 '#'
> 
> Frame 1:
> 155         opal_mem_hooks_release_hook (start, length, true);
> 
> -- 
> Paul H. Hargrove                          phhargr...@lbl.gov
> Computer Languages & Systems Software (CLaSS) Group
> Computer Science Department               Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900
> 
> 
> 
> -- 
> Paul H. Hargrove                          phhargr...@lbl.gov
> Computer Languages & Systems Software (CLaSS) Group
> Computer Science Department               Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2016/05/18930.php


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/

Reply via email to