Jie Yu created MESOS-979:
----------------------------
Summary: Master segfault when tasks.json endpoint is hit
Key: MESOS-979
URL: https://issues.apache.org/jira/browse/MESOS-979
Project: Mesos
Issue Type: Bug
Components: master
Affects Versions: 0.18.0
Reporter: Jie Yu
Priority: Critical
Fix For: 0.18.0
curl 'http://<host>:5050/master/tasks.json
{noformat}
I0210 17:45:30.496232 10818 master.hpp:421] Removing task
system-gc-ea18e4d4-3549-4159-b9a4-7587f87af2f3 with resources cpus(*):0.01;
disk(*):1; mem(*):1 on slave 201309231905-2215191306-5050-53933-22 (<host>)
I0210 17:45:30.496374 10818 hierarchical_allocator_process.hpp:637] Recovered
cpus(*):0.01; disk(*):1; mem(*):1 (total allocatable: cpus(*):0.01; disk(*):1;
mem(*):1) on slave 201309231905-2215191306-5050-53933-22 from framework
201108030032-0000000011-0000
I0210 17:45:30.562484 10818 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142078 ] on slave
201309231905-2215191306-5050-53933-27 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:30.562623 10818 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309231905-2215191306-5050-53933-27 for 5secs
I0210 17:45:30.624539 10828 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142075 ] on slave
201309231905-2215191306-5050-53933-28 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:30.624811 10830 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309231905-2215191306-5050-53933-28 for 5secs
I0210 17:45:30.986958 10828 http.cpp:319] HTTP request for '/master/stats.json'
I0210 17:45:35.099058 10831 master.cpp:1998] Status update TASK_FINISHED (UUID:
dca364d9-4b93-4a46-b5cc-3237beaf2ef6) for task
system-gc-8024ec43-07e8-4d78-96b4-5c491ecea710 of framework
201108030032-0000000011-0000 from slave(1)@10.35.12.106:5051
I0210 17:45:35.099158 10831 master.hpp:421] Removing task
system-gc-8024ec43-07e8-4d78-96b4-5c491ecea710 with resources cpus(*):0.01;
disk(*):1; mem(*):1 on slave 201309231905-2215191306-5050-53933-15 (<host>)
I0210 17:45:35.099278 10831 hierarchical_allocator_process.hpp:637] Recovered
cpus(*):0.01; disk(*):1; mem(*):1 (total allocatable: cpus(*):0.01; disk(*):1;
mem(*):1) on slave 201309231905-2215191306-5050-53933-15 from framework
201108030032-0000000011-0000
I0210 17:45:35.966225 10825 master.cpp:2250] Sending 2 offers to framework
201108030032-0000000011-0000
I0210 17:45:36.858616 10819 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142082 ] on slave
201309131923-1829643018-5050-22101-4 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:36.858881 10819 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309131923-1829643018-5050-22101-4 for 5secs
I0210 17:45:41.004396 10821 http.cpp:319] HTTP request for '/master/stats.json'
I0210 17:45:41.914156 10823 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142112 ] on slave
201309231905-2215191306-5050-53933-26 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:41.914438 10823 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309231905-2215191306-5050-53933-26 for 5secs
I0210 17:45:41.972837 10818 master.cpp:2250] Sending 1 offers to framework
201108030032-0000000011-0000
I0210 17:45:41.976341 10819 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142264 ] on slave
201309131923-1829643018-5050-22101-4 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:41.976824 10819 master.hpp:403] Adding task
system-gc-6e21f223-e6e8-4790-b9fc-8df7b4195a70 with resources cpus(*):0.01;
disk(*):1; mem(*):1 on slave 201309131923-1829643018-5050-22101-4 (<host>)
I0210 17:45:41.976881 10819 master.cpp:2419] Launching task
system-gc-6e21f223-e6e8-4790-b9fc-8df7b4195a70 of framework
201108030032-0000000011-0000 with resources cpus(*):0.01; disk(*):1; mem(*):1
on slave 201309131923-1829643018-5050-22101-4 (<host>)
I0210 17:45:41.977452 10819 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309131923-1829643018-5050-22101-4 for 5secs
I0210 17:45:43.807765 10827 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142084 ] on slave
201309231901-1762272010-5050-5544-17 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:43.807962 10827 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309231901-1762272010-5050-5544-17 for 5secs
I0210 17:45:43.974618 10823 master.cpp:2250] Sending 1 offers to framework
201108030032-0000000011-0000
I0210 17:45:46.977696 10825 master.cpp:2250] Sending 1 offers to framework
201108030032-0000000011-0000
I0210 17:45:47.978766 10817 master.cpp:2250] Sending 1 offers to framework
201108030032-0000000011-0000
I0210 17:45:49.204412 10820 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142101 ] on slave
201309231905-2215191306-5050-53933-32 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:49.204604 10820 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309231905-2215191306-5050-53933-32 for 5secs
I0210 17:45:49.538697 10830 master.cpp:1546] Processing reply for offers: [
201402061850-2215191306-5050-10816-142106 ] on slave
201309231905-2215191306-5050-53933-34 (<host>) for framework
201108030032-0000000011-0000
I0210 17:45:49.538883 10830 hierarchical_allocator_process.hpp:590] Framework
201108030032-0000000011-0000 filtered slave
201309231905-2215191306-5050-53933-34 for 5secs
I0210 17:45:51.022019 10829 http.cpp:319] HTTP request for '/master/stats.json'
I0210 17:45:51.773195 10830 http.cpp:540] HTTP request for '/master/tasks.json'
*** Aborted at 1392054351 (unix time) try "date -d @1392054351" if you are
using GNU date ***
PC: @ 0x7f3fb0a0d9ca mesos::internal::master::TaskComparator::descending()
*** SIGSEGV (@0x4069) received by PID 10816 (TID 0x4781a940) from PID 16489;
stack trace: ***
@ 0x3bdac0eb10 (unknown)
@ 0x7f3fb0a0d9ca mesos::internal::master::TaskComparator::descending()
@ 0x7f3fb0a083e2 std::__unguarded_partition<>()
@ 0x7f3fb0a0a8e8 std::__introsort_loop<>()
@ 0x7f3fb09ff0dc mesos::internal::master::Master::Http::tasks()
@ 0x7f3fb0a3fc2e std::tr1::_Function_handler<>::_M_invoke()
@ 0x7f3fb0e40dd1 std::tr1::function<>::operator()()
@ 0x7f3fb0e3b521 process::ProcessBase::visit()
@ 0x7f3fb0e2d1ab process::ProcessManager::resume()
@ 0x7f3fb0e2db7f process::schedule()
@ 0x3bdac0673d (unknown)
@ 0x3bd9cd3f6d (unknown)
/usr/local/bin/mesos-master.sh: line 83: 10816 Segmentation fault (core
dumped) $debug /usr/local/sbin/mesos-master --root_submissions
--zk=${master_zoo_url} --log_dir=${log_dir} "$@"
Master Exit Status: 139
Abnormal exit detected, sending mail.
BACKTRACE:
[New Thread 10832]
[New Thread 10833]
[New Thread 10831]
[New Thread 10837]
[New Thread 10816]
[New Thread 10834]
[New Thread 10836]
[New Thread 10823]
[New Thread 10818]
[New Thread 10825]
[New Thread 10824]
[New Thread 10835]
[New Thread 10826]
[New Thread 10821]
[New Thread 10819]
[New Thread 10829]
[New Thread 10817]
[New Thread 10822]
[New Thread 10820]
[New Thread 10827]
[New Thread 10828]
Core was generated by `/usr/local/sbin/mesos-master --root_submissions
--zk=zk://mesos:mesos@zookeeper'.
Program terminated with signal 11, Segmentation fault.
#0 ascending (lhs=0xc559c0, rhs=0x4011) at master/http.cpp:524
in master/http.cpp
#0 ascending (lhs=0xc559c0, rhs=0x4011) at master/http.cpp:524
#1 mesos::internal::master::TaskComparator::descending (lhs=0xc559c0,
rhs=0x4011) at master/http.cpp:533
#2 0x00007f3fb0a083e2 in
std::__unguarded_partition<__gnu_cxx::__normal_iterator<mesos::internal::Task
const**, std::vector<mesos::internal::Task const*,
std::allocator<mesos::internal::Task const*> > >, mesos::internal::Task const*,
bool (*)(mesos::internal::Task const*, mesos::internal::Task const*)> (
__first=<value optimized out>, __last=<value optimized out>,
__pivot=0xc559c0,
__comp=0x7f3fb0a0d9a0
<mesos::internal::master::TaskComparator::descending(mesos::internal::Task
const*, mesos::internal::Task const*)>)
at
/usr/lib/gcc/x86_64-redhat-linux/4.1.2/../../../../include/c++/4.1.2/bits/stl_algo.h:2202
#3 0x00007f3fb0a0a8e8 in
std::__introsort_loop<__gnu_cxx::__normal_iterator<mesos::internal::Task
const**, std::vector<mesos::internal::Task const*,
std::allocator<mesos::internal::Task const*> > >, long, bool
(*)(mesos::internal::Task const*, mesos::internal::Task const*)> (__first=...,
__last=<value optimized out>, __depth_limit=19,
__comp=0x7f3fb0a0d9a0
<mesos::internal::master::TaskComparator::descending(mesos::internal::Task
const*, mesos::internal::Task const*)>)
at
/usr/lib/gcc/x86_64-redhat-linux/4.1.2/../../../../include/c++/4.1.2/bits/stl_algo.h:2679
#4 0x00007f3fb09ff0dc in
sort<__gnu_cxx::__normal_iterator<mesos::internal::Task const**,
std::vector<mesos::internal::Task const*, std::allocator<mesos::internal::Task
const*> > >, bool (*)(mesos::internal::Task const*, mesos::internal::Task
const*)> (this=0x47819700, request=<value optimized out>)
at
/usr/lib/gcc/x86_64-redhat-linux/4.1.2/../../../../include/c++/4.1.2/bits/stl_algo.h:2749
#5 mesos::internal::master::Master::Http::tasks (this=0x47819700,
request=<value optimized out>) at master/http.cpp:580
#6 0x00007f3fb0a3fc2e in operator() (__functor=<value optimized out>,
__a1=<value optimized out>)
at
/usr/lib/gcc/x86_64-redhat-linux/4.1.2/../../../../include/c++/4.1.2/tr1/functional_iterate.h:208
#7 operator()<process::http::Request const> (
__functor=<value optimized out>, __a1=<value optimized out>)
at
/usr/lib/gcc/x86_64-redhat-linux/4.1.2/../../../../include/c++/4.1.2/tr1/bind_iterate.h:45
#8 std::tr1::_Function_handler<process::Future<process::http::Response>
()(const
process::http::Request&),std::tr1::_Bind<std::tr1::_Mem_fn<process::Future<process::http::Response>
(mesos::internal::master::Master::Http::*)(const process::http::Request&)>
()(mesos::internal::master::Master::Http, std::tr1::_Placeholder<1>)>
>::_M_invoke(const std::tr1::_Any_data &, const process::http::Request &)
(__functor=<value optimized out>, __a1=<value optimized out>)
at
/usr/lib/gcc/x86_64-redhat-linux/4.1.2/../../../../include/c++/4.1.2/tr1/functional_iterate.h:488
#9 0x00007f3fb0e40dd1 in
std::tr1::function<process::Future<process::http::Response> ()(const
process::http::Request&)>::operator()(const process::http::Request &) const
(this=0x1, __a1=...)
at
/usr/lib/gcc/x86_64-redhat-linux/4.1.2/../../../../include/c++/4.1.2/tr1/functional_iterate.h:868
#10 0x00007f3fb0e3b521 in process::ProcessBase::visit (
this=<value optimized out>, event=<value optimized out>)
at src/process.cpp:3194
#11 0x00007f3fb0e2d1ab in process::ProcessManager::resume (this=0xb1dec0,
process=0x7f3fa8010d10) at src/process.cpp:2594
#12 0x00007f3fb0e2db7f in process::schedule (arg=<value optimized out>)
at src/process.cpp:1290
#13 0x0000003bdac0673d in start_thread () from /lib64/libpthread.so.0
#14 0x0000003bd9cd3f6d in clone () from /lib64/libc.so.6
{noformat}
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)