[
https://issues.apache.org/jira/browse/MESOS-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14335465#comment-14335465
]
Vinod Kone commented on MESOS-2388:
-----------------------------------
Segfault in another Group test.
{code}
[ RUN ] GroupTest.GroupJoinWithDisconnect
[New Thread 0x7fffa4272940 (LWP 56808)]
[New Thread 0x7fffa2d65940 (LWP 56809)]
[New Thread 0x7fffa4373940 (LWP 56810)]
[New Thread 0x7fffa4171940 (LWP 56811)]
I0224 21:08:12.288172 56719 zookeeper_test_server.cpp:158] Started
ZooKeeperTestServer on port 50142
I0224 21:08:12.288238 56719 zookeeper_test_server.cpp:118] Shutting down
ZooKeeperTestServer on port 50142
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@712: Client
environment:zookeeper.version=zookeeper C client 3.4.5
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@716: Client
environment:host.name=smfd-atr-11-sr1.devel.twitter.com
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@723: Client
environment:os.name=Linux
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@724: Client
environment:os.arch=2.6.50-t17.el5
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@725: Client
environment:os.version=#1 SMP Sat Oct 18 13:18:32 PDT 2014
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@733: Client
environment:user.name=vinod
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@741: Client
environment:user.home=/root
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@log_env@753: Client
environment:user.dir=/home/vinod/mesos/build
2015-02-24 21:08:12,288:56719(0x7fffe5c10940):ZOO_INFO@zookeeper_init@786:
Initiating client connection, host=127.0.0.1:50142 sessionTimeout=10000
watcher=0x7ffff6b75b90 sessionId=0 sessionPasswd=<null> context=0x7fffd4043560
flags=0
[New Thread 0x7fffa3e68940 (LWP 56812)]
[Thread 0x7fffa4272940 (LWP 56808) exited]
[Thread 0x7fffa4171940 (LWP 56811) exited]
[New Thread 0x7fffa3566940 (LWP 56813)]
2015-02-24 21:08:12,305:56719(0x7fffa3e68940):ZOO_INFO@check_events@1703:
initiated connection to server [127.0.0.1:50142]
[Thread 0x7fffa4373940 (LWP 56810) exited]
[New Thread 0x7fffa4171940 (LWP 56814)]
2015-02-24
21:08:12,310:56719(0x7fffa3e68940):ZOO_ERROR@handle_socket_error_msg@1721:
Socket [127.0.0.1:50142] zk retcode=-4, errno=112(Host is down): failed while
receiving a server response
[New Thread 0x7fffa4373940 (LWP 56815)]
[New Thread 0x7fffa4272940 (LWP 56816)]
[New Thread 0x7fffa4474940 (LWP 56817)]
I0224 21:08:12.323103 56719 zookeeper_test_server.cpp:158] Started
ZooKeeperTestServer on port 50142
[Thread 0x7fffa2d65940 (LWP 56809) exited]
[Thread 0x7fffa3667940 (LWP 56805) exited]
2015-02-24 21:08:15,644:56719(0x7fffa3e68940):ZOO_INFO@check_events@1703:
initiated connection to server [127.0.0.1:50142]
2015-02-24 21:08:15,699:56719(0x7fffa3e68940):ZOO_INFO@check_events@1750:
session establishment complete on server [127.0.0.1:50142],
sessionId=0x14bbd6b2b960000, negotiated timeout=10000
I0224 21:08:15.699853 56745 group.cpp:313] Group process
(group(3)@10.35.255.108:43895) connected to ZooKeeper
I0224 21:08:15.699910 56745 group.cpp:790] Syncing group operations: queue size
(joins, cancels, datas) = (1, 0, 0)
I0224 21:08:15.699937 56745 group.cpp:385] Trying to create path '/test' in
ZooKeeper
[Thread 0x7fffa3e68940 (LWP 56812) exited]
2015-02-24 21:08:15,735:56719(0x7fffed41f940):ZOO_INFO@zookeeper_close@2505:
Closing zookeeper sessionId=0x14bbd6b2b960000 to [127.0.0.1:50142]
I0224 21:08:15.735313 56719 zookeeper_test_server.cpp:118] Shutting down
ZooKeeperTestServer on port 50142
[Thread 0x7fffa3566940 (LWP 56813) exited]
Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 0x7fffa4474940 (LWP 56817)]
0x00007fffe39aa792 in write_memory_serialize_page (this=0x7fffa4473500,
from=<value optimized out>, to=_thread_in_vm)
at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
378
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:
No such file or directory.
in
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp
(gdb) bt
#0 0x00007fffe39aa792 in write_memory_serialize_page (this=0x7fffa4473500,
from=<value optimized out>, to=_thread_in_vm)
at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
#1 serialize_memory (this=0x7fffa4473500, from=<value optimized out>,
to=_thread_in_vm) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/os/linux/vm/interfaceSupport_linux.hpp:31
#2 transition_and_fence (this=0x7fffa4473500, from=<value optimized out>,
to=_thread_in_vm) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:187
#3 ThreadStateTransition::trans_and_fence (this=0x7fffa4473500, from=<value
optimized out>, to=_thread_in_vm)
at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:240
#4 0x00007fffe39a569b in ~ThreadBlockInVM (this=<value optimized out>,
isAbsolute=false, time=<value optimized out>)
at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:322
#5 Parker::park (this=<value optimized out>, isAbsolute=false, time=<value
optimized out>) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/os/linux/vm/os_linux.cpp:6025
#6 0x00007fffe3b0cfd5 in Unsafe_Park (env=<value optimized out>, unsafe=<value
optimized out>, isAbsolute=<value optimized out>, time=0)
at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/prims/unsafe.cpp:1205
#7 0x00007fffd90127f8 in ?? ()
#8 0x00007fffd9006058 in ?? ()
#9 0x0000000000000000 in ?? ()
{code}
Yet another test
{code}
[ RUN ] GroupTest.GroupDataWithDisconnect
[New Thread 0x7fffa2e66940 (LWP 57617)]
[New Thread 0x7fffa4171940 (LWP 57618)]
[New Thread 0x7fffa4373940 (LWP 57619)]
[New Thread 0x7fffa4272940 (LWP 57620)]
I0224 21:10:56.844243 57510 zookeeper_test_server.cpp:158] Started
ZooKeeperTestServer on port 50328
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@712: Client
environment:zookeeper.version=zookeeper C client 3.4.5
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@716: Client
environment:host.name=smfd-atr-11-sr1.devel.twitter.com
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@723: Client
environment:os.name=Linux
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@724: Client
environment:os.arch=2.6.50-t17.el5
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@725: Client
environment:os.version=#1 SMP Sat Oct 18 13:18:32 PDT 2014
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@733: Client
environment:user.name=vinod
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@741: Client
environment:user.home=/root
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@log_env@753: Client
environment:user.dir=/home/vinod/mesos/build
2015-02-24 21:10:56,844:57510(0x7fffe6411940):ZOO_INFO@zookeeper_init@786:
Initiating client connection, host=127.0.0.1:50328 sessionTimeout=10000
watcher=0x7ffff6b75b90 sessionId=0 sessionPasswd=<null> context=0x299f930
flags=0
[New Thread 0x7fffa3667940 (LWP 57621)]
[New Thread 0x7fffa3e68940 (LWP 57622)]
2015-02-24 21:10:56,852:57510(0x7fffa3667940):ZOO_INFO@check_events@1703:
initiated connection to server [127.0.0.1:50328]
2015-02-24 21:10:56,937:57510(0x7fffa3667940):ZOO_INFO@check_events@1750:
session establishment complete on server [127.0.0.1:50328],
sessionId=0x14bbd6dae3e0000, negotiated timeout=10000
I0224 21:10:56.937506 57541 group.cpp:313] Group process
(group(4)@10.35.255.108:41878) connected to ZooKeeper
I0224 21:10:56.937546 57541 group.cpp:790] Syncing group operations: queue size
(joins, cancels, datas) = (1, 0, 0)
I0224 21:10:56.937568 57541 group.cpp:385] Trying to create path '/test' in
ZooKeeper
I0224 21:10:56.968747 57510 zookeeper_test_server.cpp:118] Shutting down
ZooKeeperTestServer on port 50328
2015-02-24
21:10:56,970:57510(0x7fffa3667940):ZOO_ERROR@handle_socket_error_msg@1721:
Socket [127.0.0.1:50328] zk retcode=-4, errno=112(Host is down): failed while
receiving a server response
I0224 21:10:56.970808 57540 group.cpp:418] Lost connection to ZooKeeper,
attempting to reconnect ...
Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 0x7fffa2e66940 (LWP 57617)]
0x00007fffe399485a in write_memory_serialize_page (this=0x2919028,
__the_thread__=0x7fffdc348800) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
378
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:
No such file or directory.
in
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp
(gdb) r
The program being debugged has been started already.
Start it from the beginning? (y or n) n
Program not restarted.
(gdb) bt
#0 0x00007fffe399485a in write_memory_serialize_page (this=0x2919028,
__the_thread__=0x7fffdc348800) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/os.hpp:378
#1 serialize_memory (this=0x2919028, __the_thread__=0x7fffdc348800) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/os/linux/vm/interfaceSupport_linux.hpp:31
#2 transition_and_fence (this=0x2919028, __the_thread__=0x7fffdc348800) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:187
#3 trans_and_fence (this=0x2919028, __the_thread__=0x7fffdc348800) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:240
#4 ~ThreadBlockInVM (this=0x2919028, __the_thread__=0x7fffdc348800) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/interfaceSupport.hpp:322
#5 ObjectMonitor::enter (this=0x2919028, __the_thread__=0x7fffdc348800) at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/runtime/objectMonitor.cpp:417
#6 0x00007fffe37a4999 in InterpreterRuntime::monitorenter
(thread=0x7fffdc348800, elem=0x7fffa2e657c8)
at
/builddir/build/BUILD/java-1.7.0-openjdk7/hotspot/src/share/vm/interpreter/interpreterRuntime.cpp:596
#7 0x00007fffd901ebae in ?? ()
#8 0x00007fffd901eb6c in ?? ()
#9 0x0000000000000003 in ?? ()
#10 0x00000006876f2f78 in ?? ()
#11 0x00007fffa2e657c8 in ?? ()
#12 0x000000039099861c in ?? ()
#13 0x00007fffa2e65870 in ?? ()
#14 0x00000003909b1dc8 in ?? ()
#15 0x0000000000000000 in ?? ()
{code}
This looks like something is wrong with how we do ZooKeeper tests because the
SEGFAULTS are happening during teardown of a test AFAICT.
> GroupTest.LabelledGroup segfaults
> ---------------------------------
>
> Key: MESOS-2388
> URL: https://issues.apache.org/jira/browse/MESOS-2388
> Project: Mesos
> Issue Type: Bug
> Components: test
> Environment: Ubuntu 14.10 gcc
> Reporter: Vinod Kone
> Assignee: Vinod Kone
>
> Observed this on internal CI. Not sure if it is due to
> "GroupTest.LabelledGroup" or an earlier test.
> {code}
> I0219 01:04:17.980598 27766 zookeeper_test_server.cpp:117] Shutting down
> ZooKeeperTestServer on port 39597
> [ OK ] GroupTest.RetryableErrors (30150 ms)
> [ RUN ] GroupTest.LabelledGroup
> Makefile:6656: recipe for target 'check-local' failed
> make[3]: *** [check-local] Segmentation fault (core dumped)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)