I've committed a fix in r1336417. Please let me know if this fixes the
problem or if more needs to be done. Thank you!


On Wed, May 9, 2012 at 1:46 PM, Benjamin Hindman <[email protected]>wrote:

> Yes, this looks like it should be the case. :(
>
> I'll fix this bug ASAP. Thanks for reporting!
>
>
>
> On Wed, May 9, 2012 at 8:56 AM, Scott Smith <[email protected]> wrote:
>
>> I've had numerous other segfaults in libprocess, mostly in
>> std::map/rbtree code.  Is it possible that SocketManager::accepted is
>> missing a synchronized(this) {} block?
>>
>> from process.cpp:
>>
>> Socket SocketManager::accepted(int s)
>> {
>>  return sockets[s] = Socket(s);
>> }
>>
>> On Mon, May 7, 2012 at 11:40 PM, Scott Smith <[email protected]>
>> wrote:
>> > I've encountered another segfault in the slave.  This time, nothing
>> > unusual was happening.  Single framework / single user.  Four slaves,
>> > one master, framework run from master.
>> >
>> > version:
>> > svn Revision: 1334534 + proposed fix for MESOS-190:
>> > https://reviews.apache.org/r/5057/diff/2/#index_header
>> >
>> > log messages:
>> > I0508 06:35:21.458798   828 slave.cpp:447] Got assigned task 8:864:0
>> > for framework 201205080535222558218-5050-29475-0004
>> > I0508 06:35:21.459225   829 slave.cpp:689] Got acknowledgement of
>> > status update for task 8:863:0 of framework
>> > 201205080535222558218-5050-29475-0004
>> > F0508 06:35:21.459432   832 process.cpp:1772] Check failed:
>> > sockets.count(s) > 0
>> >
>> > stack trace:
>> > #0  0x00007f0aecdf0445 in raise () from /lib/x86_64-linux-gnu/libc.so.6
>> > #1  0x00007f0aecdf3bab in abort () from /lib/x86_64-linux-gnu/libc.so.6
>> > #2  0x00007f0aedd65dd9 in google::DumpStackTraceAndExit () at
>> > src/utilities.cc:145
>> > #3  0x00007f0aedd5ed9d in google::LogMessage::Fail () at
>> src/logging.cc:1256
>> > #4  0x00007f0aedd6152f in google::LogMessage::SendToLog
>> (this=0x7f0ae8a71c60)
>> >    at src/logging.cc:1216
>> > #5  0x00007f0aedd5e99b in google::LogMessage::Flush
>> (this=0x7f0ae8a71c60)
>> >    at src/logging.cc:1088
>> > #6  0x00007f0aedd61dbd in google::LogMessageFatal::~LogMessageFatal (
>> >    this=0x7f0ae8a71c60, __in_chrg=<optimized out>) at
>> src/logging.cc:1777
>> > #7  0x00007f0aedc93a55 in process::SocketManager::next(int) ()
>> >   from /home/ubuntu/cr/lib/libmesos-0.9.0.so
>> > #8  0x00007f0aedc8e119 in process::send_data(ev_loop*, ev_io*, int) ()
>> >   from /home/ubuntu/cr/lib/libmesos-0.9.0.so
>> > #9  0x00007f0aedd9e6ef in ev_invoke_pending (loop=0x7f0aee119240) at
>> ev.c:1971
>> > #10 0x00007f0aedda2a24 in ev_loop (loop=0x7f0aee119240,
>> flags=<optimized out>)
>> >    at ev.c:2333
>> > #11 0x00007f0aedc8f30d in process::serve(void*) ()
>> >   from /home/ubuntu/cr/lib/libmesos-0.9.0.so
>> > #12 0x00007f0aed17ee9a in start_thread () from
>> > /lib/x86_64-linux-gnu/libpthread.so.0
>> > #13 0x00007f0aeceac4bd in clone () from /lib/x86_64-linux-gnu/libc.so.6
>> > #14 0x0000000000000000 in ?? ()
>> >
>> > --
>> >         Scott
>>
>>
>>
>> --
>>         Scott
>>
>
>

Reply via email to