[ 
https://issues.apache.org/jira/browse/MESOS-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14335465#comment-14335465
 ] 

Vinod Kone commented on MESOS-2388:
-----------------------------------

Segfault in another Group test.

{code}
[ RUN      ] GroupTest.GroupJoinWithDisconnect
[New Thread 0x7fffa4272940 (LWP 56808)]
[New Thread 0x7fffa2d65940 (LWP 56809)]
[New Thread 0x7fffa4373940 (LWP 56810)]
[New Thread 0x7fffa4171940 (LWP 56811)]
I0224 21:08:12.288172 56719 zookeeper_test_server.cpp:158] Started 
ZooKeeperTestServer on port 50142
I0224 21:08:12.288238 56719 zookeeper_test_server.cpp:118] Shutting down 
ZooKeeperTestServer on port 50142
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@712: Client 
environment:zookeeper.version=zookeeper C client 3.4.5
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@716: Client 
environment:host.name=smfd-atr-11-sr1.devel.twitter.com
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@723: Client 
environment:os.name=Linux
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@724: Client 
environment:os.arch=2.6.50-t17.el5
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@725: Client 
environment:os.version=#1 SMP Sat Oct 18 13:18:32 PDT 2014
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@733: Client 
environment:user.name=vinod
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@741: Client 
environment:user.home=/root
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@753: Client 
environment:user.dir=/home/vinod/mesos/build
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@zookeeper_init@786: 
Initiating client connection, host=127.0.0.1:50142 sessionTimeout=10000 
watcher=0x7ffff6b75b90 sessionId=0 sessionPasswd=<null> context=0x7fffd4043560 
flags=0
[New Thread 0x7fffa3e68940 (LWP 56812)]
[Thread 0x7fffa4272940 (LWP 56808) exited]
[Thread 0x7fffa4171940 (LWP 56811) exited]
[New Thread 0x7fffa3566940 (LWP 56813)]
2015-02-24 21:08:12,305:56719(0x7fffa3e68940):ZOO_INFO@check_events@1703: 
initiated connection to server [127.0.0.1:50142]
[Thread 0x7fffa4373940 (LWP 56810) exited]
[New Thread 0x7fffa4171940 (LWP 56814)]
2015-02-24 
21:08:12,310:56719(0x7fffa3e68940):ZOO_ERROR@handle_socket_error_msg@1721: 
Socket [127.0.0.1:50142] zk retcode=-4, errno=112(Host is down): failed while 
receiving a server response
[New Thread 0x7fffa4373940 (LWP 56815)]
[New Thread 0x7fffa4272940 (LWP 56816)]
[New Thread 0x7fffa4474940 (LWP 56817)]
I0224 21:08:12.323103 56719 zookeeper_test_server.cpp:158] Started 
ZooKeeperTestServer on port 50142
[Thread 0x7fffa2d65940 (LWP 56809) exited]
[Thread 0x7fffa3667940 (LWP 56805) exited]
2015-02-24 21:08:15,644:56719(0x7fffa3e68940):ZOO_INFO@check_events@1703: 
initiated connection to server [127.0.0.1:50142]
2015-02-24 21:08:15,699:56719(0x7fffa3e68940):ZOO_INFO@check_events@1750: 
session establishment complete on server [127.0.0.1:50142], 
sessionId=0x14bbd6b2b960000, negotiated timeout=10000
I0224 21:08:15.699853 56745 group.cpp:313] Group process 
(group(3)@10.35.255.108:43895) connected to ZooKeeper
I0224 21:08:15.699910 56745 group.cpp:790] Syncing group operations: queue size 
(joins, cancels, datas) = (1, 0, 0)
I0224 21:08:15.699937 56745 group.cpp:385] Trying to create path '/test' in 
ZooKeeper
[Thread 0x7fffa3e68940 (LWP 56812) exited]
2015-02-24 21:08:15,735:56719(0x7fffed41f940):ZOO_INFO@zookeeper_close@2505: 
Closing zookeeper sessionId=0x14bbd6b2b960000 to [127.0.0.1:50142]

I0224 21:08:15.735313 56719 zookeeper_test_server.cpp:118] Shutting down 
ZooKeeperTestServer on port 50142
[Thread 0x7fffa3566940 (LWP 56813) exited]

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 0x7fffa4474940 (LWP 56817)]
0x00007fffe39aa792 in write_memory_serialize_page (this=0x7fffa4473500, 
from=<value optimized out>, to=_thread_in_vm)
    at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
378     
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp: 
No such file or directory.
        in 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp
(gdb) bt
#0  0x00007fffe39aa792 in write_memory_serialize_page (this=0x7fffa4473500, 
from=<value optimized out>, to=_thread_in_vm)
    at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
#1  serialize_memory (this=0x7fffa4473500, from=<value optimized out>, 
to=_thread_in_vm) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/os/linux/vm/interfaceSupport_linux.hpp:31
#2  transition_and_fence (this=0x7fffa4473500, from=<value optimized out>, 
to=_thread_in_vm) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:187
#3  ThreadStateTransition::trans_and_fence (this=0x7fffa4473500, from=<value 
optimized out>, to=_thread_in_vm)
    at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:240
#4  0x00007fffe39a569b in ~ThreadBlockInVM (this=<value optimized out>, 
isAbsolute=false, time=<value optimized out>)
    at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:322
#5  Parker::park (this=<value optimized out>, isAbsolute=false, time=<value 
optimized out>) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/os/linux/vm/os_linux.cpp:6025
#6  0x00007fffe3b0cfd5 in Unsafe_Park (env=<value optimized out>, unsafe=<value 
optimized out>, isAbsolute=<value optimized out>, time=0)
    at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/prims/unsafe.cpp:1205
#7  0x00007fffd90127f8 in ?? ()
#8  0x00007fffd9006058 in ?? ()
#9  0x0000000000000000 in ?? ()
{code}

Yet another test
{code}
[ RUN      ] GroupTest.GroupDataWithDisconnect
[New Thread 0x7fffa2e66940 (LWP 57617)]
[New Thread 0x7fffa4171940 (LWP 57618)]
[New Thread 0x7fffa4373940 (LWP 57619)]
[New Thread 0x7fffa4272940 (LWP 57620)]
I0224 21:10:56.844243 57510 zookeeper_test_server.cpp:158] Started 
ZooKeeperTestServer on port 50328
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@712: Client 
environment:zookeeper.version=zookeeper C client 3.4.5
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@716: Client 
environment:host.name=smfd-atr-11-sr1.devel.twitter.com
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@723: Client 
environment:os.name=Linux
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@724: Client 
environment:os.arch=2.6.50-t17.el5
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@725: Client 
environment:os.version=#1 SMP Sat Oct 18 13:18:32 PDT 2014
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@733: Client 
environment:user.name=vinod
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@741: Client 
environment:user.home=/root
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@753: Client 
environment:user.dir=/home/vinod/mesos/build
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@zookeeper_init@786: 
Initiating client connection, host=127.0.0.1:50328 sessionTimeout=10000 
watcher=0x7ffff6b75b90 sessionId=0 sessionPasswd=<null> context=0x299f930 
flags=0
[New Thread 0x7fffa3667940 (LWP 57621)]
[New Thread 0x7fffa3e68940 (LWP 57622)]
2015-02-24 21:10:56,852:57510(0x7fffa3667940):ZOO_INFO@check_events@1703: 
initiated connection to server [127.0.0.1:50328]
2015-02-24 21:10:56,937:57510(0x7fffa3667940):ZOO_INFO@check_events@1750: 
session establishment complete on server [127.0.0.1:50328], 
sessionId=0x14bbd6dae3e0000, negotiated timeout=10000
I0224 21:10:56.937506 57541 group.cpp:313] Group process 
(group(4)@10.35.255.108:41878) connected to ZooKeeper
I0224 21:10:56.937546 57541 group.cpp:790] Syncing group operations: queue size 
(joins, cancels, datas) = (1, 0, 0)
I0224 21:10:56.937568 57541 group.cpp:385] Trying to create path '/test' in 
ZooKeeper
I0224 21:10:56.968747 57510 zookeeper_test_server.cpp:118] Shutting down 
ZooKeeperTestServer on port 50328
2015-02-24 
21:10:56,970:57510(0x7fffa3667940):ZOO_ERROR@handle_socket_error_msg@1721: 
Socket [127.0.0.1:50328] zk retcode=-4, errno=112(Host is down): failed while 
receiving a server response
I0224 21:10:56.970808 57540 group.cpp:418] Lost connection to ZooKeeper, 
attempting to reconnect ...

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 0x7fffa2e66940 (LWP 57617)]
0x00007fffe399485a in write_memory_serialize_page (this=0x2919028, 
__the_thread__=0x7fffdc348800) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
378     
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp: 
No such file or directory.
        in 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp
(gdb) r
The program being debugged has been started already.
Start it from the beginning? (y or n) n
Program not restarted.
(gdb) bt
#0  0x00007fffe399485a in write_memory_serialize_page (this=0x2919028, 
__the_thread__=0x7fffdc348800) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
#1  serialize_memory (this=0x2919028, __the_thread__=0x7fffdc348800) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/os/linux/vm/interfaceSupport_linux.hpp:31
#2  transition_and_fence (this=0x2919028, __the_thread__=0x7fffdc348800) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:187
#3  trans_and_fence (this=0x2919028, __the_thread__=0x7fffdc348800) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:240
#4  ~ThreadBlockInVM (this=0x2919028, __the_thread__=0x7fffdc348800) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:322
#5  ObjectMonitor::enter (this=0x2919028, __the_thread__=0x7fffdc348800) at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/objectMonitor.cpp:417
#6  0x00007fffe37a4999 in InterpreterRuntime::monitorenter 
(thread=0x7fffdc348800, elem=0x7fffa2e657c8)
    at 
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/interpreter/interpreterRuntime.cpp:596
#7  0x00007fffd901ebae in ?? ()
#8  0x00007fffd901eb6c in ?? ()
#9  0x0000000000000003 in ?? ()
#10 0x00000006876f2f78 in ?? ()
#11 0x00007fffa2e657c8 in ?? ()
#12 0x000000039099861c in ?? ()
#13 0x00007fffa2e65870 in ?? ()
#14 0x00000003909b1dc8 in ?? ()
#15 0x0000000000000000 in ?? ()
{code}

This looks like something is wrong with how we do ZooKeeper tests because the 
SEGFAULTS are happening during teardown of a test AFAICT.

> GroupTest.LabelledGroup segfaults
> ---------------------------------
>
>                 Key: MESOS-2388
>                 URL: https://issues.apache.org/jira/browse/MESOS-2388
>             Project: Mesos
>          Issue Type: Bug
>          Components: test
>         Environment: Ubuntu 14.10 gcc
>            Reporter: Vinod Kone
>            Assignee: Vinod Kone
>
> Observed this on internal CI. Not sure if it is due to 
> "GroupTest.LabelledGroup" or an earlier test.
> {code}
> I0219 01:04:17.980598 27766 zookeeper_test_server.cpp:117] Shutting down 
> ZooKeeperTestServer on port 39597
> [       OK ] GroupTest.RetryableErrors (30150 ms)
> [ RUN      ] GroupTest.LabelledGroup
> Makefile:6656: recipe for target 'check-local' failed
> make[3]: *** [check-local] Segmentation fault (core dumped)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to