[ 
https://issues.apache.org/jira/browse/TS-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13720381#comment-13720381
 ] 

Leif Hedstrom commented on TS-1330:
-----------------------------------

>From the dupe bug (Yunkai's comment)

{code}

(gdb) bt
#0  0x00000000005f0384 in LogBuffer::checkout_write (this=0x2aeb30002ff0, 
write_offset=0x2aeb09f5f4e8, write_size=152)
    at LogBuffer.cc:287
#1  0x000000000060bc3f in LogObject::_checkout_write (this=0x1ed8800, 
write_offset=0x2aeb09f5f4e8, bytes_needed=152)
    at LogObject.cc:424
#2  0x000000000060c43e in LogObject::log (this=0x1ed8800, lad=0x2aeb09f5f7a0, 
text_entry=0x0) at LogObject.cc:579
#3  0x00000000005f761a in LogObjectManager::log (this=0x1eaf400, 
lad=0x2aeb09f5f7a0) at LogObject.h:395
#4  0x00000000005f6379 in Log::access (lad=0x2aeb09f5f7a0) at Log.cc:1117
#5  0x000000000058506e in HttpSM::update_stats (this=0x2aeb41379f10) at 
HttpSM.cc:6312
#6  0x0000000000584d21 in HttpSM::kill_this (this=0x2aeb41379f10) at 
HttpSM.cc:6273
#7  0x00000000005798ea in HttpSM::main_handler (this=0x2aeb41379f10, 
event=2301, data=0x2aeb4137ba70) at HttpSM.cc:2593
#8  0x00000000004f1f1e in Continuation::handleEvent (this=0x2aeb41379f10, 
event=2301, data=0x2aeb4137ba70)
    at ../iocore/eventsystem/I_Continuation.h:146
#9  0x00000000005bc819 in HttpTunnel::main_handler (this=0x2aeb4137ba70, 
event=103, data=0x2aebb000f050)
    at HttpTunnel.cc:1476
#10 0x00000000004f1f1e in Continuation::handleEvent (this=0x2aeb4137ba70, 
event=103, data=0x2aebb000f050)
    at ../iocore/eventsystem/I_Continuation.h:146
#11 0x00000000006ee000 in write_signal_and_update (event=103, 
vc=0x2aebb000eee0) at UnixNetVConnection.cc:153
#12 0x00000000006ee0f9 in write_signal_done (event=103, nh=0x2aeb08d542a8, 
vc=0x2aebb000eee0) at UnixNetVConnection.cc:180
#13 0x00000000006eef0a in write_to_net_io (nh=0x2aeb08d542a8, 
vc=0x2aebb000eee0, thread=0x2aeb08d51010)
    at UnixNetVConnection.cc:479
#14 0x00000000006ee8c9 in write_to_net (nh=0x2aeb08d542a8, vc=0x2aebb000eee0, 
pd=0x2aeb1466b010, thread=0x2aeb08d51010)
    at UnixNetVConnection.cc:352
#15 0x00000000006eae36 in NetHandler::mainNetEvent (this=0x2aeb08d542a8, 
event=5, e=0x2aeb0a6746e0) at UnixNet.cc:394
#16 0x00000000004f1f1e in Continuation::handleEvent (this=0x2aeb08d542a8, 
event=5, data=0x2aeb0a6746e0)
    at ../iocore/eventsystem/I_Continuation.h:146
#17 0x000000000070f3d2 in EThread::process_event (this=0x2aeb08d51010, 
e=0x2aeb0a6746e0, calling_code=5)
    at UnixEThread.cc:142
#18 0x000000000070f8ef in EThread::execute (this=0x2aeb08d51010) at 
UnixEThread.cc:266
#19 0x000000000070e62f in spawn_thread_internal (a=0x1b3f520) at Thread.cc:88
#20 0x0000003c756077f1 in start_thread () from /lib64/libpthread.so.0
#21 0x0000003c752e570d in clone () from /lib64/libc.so.6
(gdb) f 0
#0  0x00000000005f0384 in LogBuffer::checkout_write (this=0x2aeb30002ff0, 
write_offset=0x2aeb09f5f4e8, write_size=152)
    at LogBuffer.cc:287
287         entry_header->timestamp = tp.tv_sec;
(gdb) p entry_header 
$7 = (LogEntryHeader *) 0x0
(gdb) p old_s
$8 = {ival = 0, s = {offset = 0, num_entries = 0, byte_count = 0, full = 0, 
num_writers = 0}}
(gdb) p new_s
$9 = {ival = 563671507992744, s = {offset = 168, num_entries = 1, byte_count = 
168, full = 0, num_writers = 1}}
(gdb) p m_buffer
$10 = 0x2aeb30005000 "\316\372\316\n\002"
(gdb) p *this
$11 = {write_link = {next = 0x0}, link = {<SLink<LogBuffer>> = {next = 0x0}, 
prev = 0x0}, static M_ID = 57301638, 
  m_unaligned_buffer = 0x2aeb30004fa0 "\350\a", m_buffer = 0x2aeb30005000 
"\316\372\316\n\002", m_size = 18432, 
  m_buf_align = 512, m_write_align = 8, m_state = {ival = 923143098288024, s = 
{offset = 18328, num_entries = 115, 
      byte_count = 18328, full = 1, num_writers = 1}}, m_max_entries = 200, 
m_expiration_time = 1374692424, 
  m_owner = 0x1ed8800, m_header = 0x2aeb30005000, m_id = 57301630, m_references 
= 1}
{code}
                
> Logging related segfault in 3.2.0
> ---------------------------------
>
>                 Key: TS-1330
>                 URL: https://issues.apache.org/jira/browse/TS-1330
>             Project: Traffic Server
>          Issue Type: Bug
>          Components: Logging
>    Affects Versions: 3.2.0
>         Environment: ATS 3.2.0 on RHEL 6.2 64-bit
>            Reporter: David Carlin
>            Assignee: Bryan Call
>            Priority: Critical
>              Labels: A, crash
>             Fix For: 3.5.0
>
>
> I observed the following crash once on one of our ATS boxes - possibly 
> related to TS-1240
> Jul  2 13:56:56 l2 traffic_server[25853]: {0x2b0a391e1700} ERROR: 
> [SSL_NetVConnection::ssl_read_from_net] SSL_ERROR_SYSCALL, underlying IO 
> error: Connection reset by peer
> Jul  2 13:59:56 l2 kernel: [ET_NET 1][25855]: segfault at c ip 
> 000000000058e083 sp 00002b0a2982b740 error 6
> Jul  2 13:59:56 l2 kernel: [ET_NET 3][25857]: segfault at 84 ip 
> 000000000058e083 sp 00002b0a29a31740 error 6 in traffic_server[400000+340000]
> Jul  2 13:59:56 l2 kernel: in traffic_server[400000+340000]
> Jul  2 14:02:59 l2 traffic_cop[25901]: (test) read timeout [180000 ]
> Jul  2 14:02:59 l2 traffic_cop[25901]: server heartbeat failed [1]
> Jul  2 14:03:08 l2 traffic_manager[25826]: {0x7f3f088607e0} FATAL: 
> [LocalManager::pollMgmtProcessServer] Error in read (errno: 104)
> Jul  2 14:03:09 l2 traffic_manager[25826]: {0x7f3f088607e0} FATAL:  (last 
> system error 104: Connection reset by peer)
> Jul  2 14:03:09 l2 traffic_cop[25901]: cannot find traffic_server [1]
> Jul  2 14:03:09 l2 traffic_manager[25826]: {0x7f3f088607e0} ERROR: 
> [LocalManager::sendMgmtMsgToProcesses] Error writing message
> Jul  2 14:03:09 l2 traffic_manager[25826]: {0x7f3f088607e0} ERROR:  (last 
> system error 32: Broken pipe)
> Jul  2 14:03:12 l2 traffic_cop[25901]: cop received child status signal 
> [25826 35584]
> Jul  2 14:03:12 l2 traffic_cop[25901]: traffic_manager not running, making 
> sure traffic_server is dead
> Jul  2 14:03:12 l2 traffic_cop[25901]: spawning traffic_manager
> Jul  2 14:03:13 l2 traffic_manager[18267]: NOTE: --- Manager Starting ---
> Jul  2 14:03:13 l2 traffic_manager[18267]: NOTE: Manager Version: Apache 
> Traffic Server - traffic_manager - 3.2.0 - (build # 52518 on Jun 25 2012 at 
> 18:22:12)
> Jul  2 14:03:13 l2 traffic_manager[18267]: {0x7fe63de3f7e0} STATUS: opened 
> /home/y/logs/trafficserver/manager.log
> Jul  2 14:03:15 l2 traffic_server[18322]: NOTE: --- Server Starting ---
> Jul  2 14:03:15 l2 traffic_server[18322]: NOTE: Server Version: Apache 
> Traffic Server - traffic_server - 3.2.0 - (build # 52518 on Jun 25 2012 at 
> 18:22:31)
> Jul  2 14:03:15 l2 traffic_server[18322]: {0x2b77573ab860} STATUS: opened 
> /home/y/logs/trafficserver/diags.log
> Jul  2 14:03:15 l2 traffic_server[18322]: {0x2b77573ab860} ERROR: Cannot 
> insert duplicate!
> Jul  2 14:03:22 l2 traffic_cop[25901]: server heartbeat succeeded
> [Jul  2 13:56:56.304] Server {0x2b0a391e1700} ERROR: 
> [SSL_NetVConnection::ssl_read_from_net] SSL_ERROR_SYSCALL, underlying IO 
> error: Connection reset by peer
> NOTE: Traffic Server received Sig 11: Segmentation fault
> NOTE: Traffic Server received Sig 11: Segmentation fault
> /home/y/bin/traffic_server - STACK TRACE: 
> /home/y/bin/traffic_server - STACK TRACE: 
> /lib64/libpthread.so.0[0x3b54e0f4a0]
> /lib64/libpthread.so.0[0x3b54e0f4a0]
> /home/y/bin/traffic_server(_ZN9LogBuffer14checkout_writeEPmm+0x153)[0x58e083]
> /home/y/bin/traffic_server(_ZN9LogObject15_checkout_writeEPmm+0xa8)[0x5a64c8]
> /home/y/bin/traffic_server(_ZN9LogObject3logEP9LogAccessPc+0x2f0)[0x5a7e30]
> /home/y/bin/traffic_server(_ZN9LogBuffer14checkout_writeEPmm+0x153)[0x58e083]
> /home/y/bin/traffic_server(_ZN9LogObject15_checkout_writeEPmm+0xa8)[0x5a64c8]
> /home/y/bin/traffic_server(_ZN9LogObject3logEP9LogAccessPc+0x2f0)[0x5a7e30]
> /home/y/bin/traffic_server(_ZN3Log6accessEP9LogAccess+0x146)[0x58f506]
> /home/y/bin/traffic_server(_ZN6HttpSM12update_statsEv+0x630)[0x526c50]
> /home/y/bin/traffic_server(_ZN3Log6accessEP9LogAccess+0x146)[0x58f506]
> /home/y/bin/traffic_server(_ZN6HttpSM9kill_thisEv+0x928)[0x52b548]
> /home/y/bin/traffic_server(_ZN6HttpSM12update_statsEv+0x630)[0x526c50]
> /home/y/bin/traffic_server(_ZN6HttpSM9kill_thisEv+0x928)[0x52b548]
> /home/y/bin/traffic_server(_ZN6HttpSM12main_handlerEiPv+0x198)[0x52b868]
> /home/y/bin/traffic_server(_ZN10HttpTunnel12main_handlerEiPv+0xde)[0x56c3ee]
> /home/y/bin/traffic_server[0x6736a1]
> /home/y/bin/traffic_server(_ZN6HttpSM12main_handlerEiPv+0x198)[0x52b868]
> /home/y/bin/traffic_server(_Z15write_to_net_ioP10NetHandlerP18UnixNetVConnectionP7EThread+0x847)[0x675517]
> /home/y/bin/traffic_server[0x672f81]
> /home/y/bin/traffic_server(_ZN10NetHandler12mainNetEventEiP5Event+0x286)[0x66df96]
> /home/y/bin/traffic_server[0x676ad5]
> /home/y/bin/traffic_server(_ZN7EThread13process_eventEP5Eventi+0xb4)[0x696b14]
> /home/y/bin/traffic_server(_ZN10NetHandler12mainNetEventEiP5Event+0x1f2)[0x66df02]
> /home/y/bin/traffic_server(_ZN7EThread13process_eventEP5Eventi+0xb4)[0x696b14]
> /home/y/bin/traffic_server(_ZN7EThread7executeEv+0x4c3)[0x6974a3]
> /home/y/bin/traffic_server[0x695ae2]
> /lib64/libpthread.so.0[0x3b54e077f1]
> /lib64/libc.so.6(clone+0x6d)[0x3b54ae5ccd]
> /home/y/bin/traffic_server(_ZN7EThread7executeEv+0x4c3)[0x6974a3]
> /home/y/bin/traffic_server[0x695ae2]
> /lib64/libpthread.so.0[0x3b54e077f1]
> /lib64/libc.so.6(clone+0x6d)[0x3b54ae5ccd]
> [Jul  2 14:03:08.684] Manager {0x7f3f088607e0} FATAL: 
> [LocalManager::pollMgmtProcessServer] Error in read (errno: 104)
> [Jul  2 14:03:09.213] Manager {0x7f3f088607e0} FATAL:  (last system error 
> 104: Connection reset by peer)
> [Jul  2 14:03:09.213] Manager {0x7f3f088607e0} NOTE: 
> [LocalManager::mgmtShutdown] Executing shutdown request.
> [Jul  2 14:03:09.213] Manager {0x7f3f088607e0} NOTE: 
> [LocalManager::processShutdown] Executing process shutdown request.
> [Jul  2 14:03:09.496] Manager {0x7f3f088607e0} ERROR: 
> [LocalManager::sendMgmtMsgToProcesses] Error writing message
> [Jul  2 14:03:09.496] Manager {0x7f3f088607e0} ERROR:  (last system error 32: 
> Broken pipe)
> [example_alarm_bin.sh] sent alarm: l2.ycs.uls.yahoo.com [Mon Jul  2 14:03:09 
> 2012] The TS-TM connection is broken for some reason. Either restart TS and 
> TM or correct this error for TM to display TS statistics correctly

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to