[jira] [Created] (TS-3714) TS seems to stall between ua_begin and ua_first_read on some transactions resulting in high response times.

Sudheer Vinukonda (JIRA) Mon, 22 Jun 2015 18:05:13 -0700

Sudheer Vinukonda created TS-3714:
-------------------------------------

             Summary: TS seems to stall between ua_begin and ua_first_read on 
some transactions resulting in high response times.
                 Key: TS-3714
                 URL: https://issues.apache.org/jira/browse/TS-3714
             Project: Traffic Server
          Issue Type: Bug
          Components: Core
            Reporter: Sudheer Vinukonda



An example slow log showing very high *ua_first_read*.

{code}
ERROR: [8624075] Slow Request: client_ip: xx.xx.xx.xxx url: 
http://xxxxxxxxxxxxxxxxxx.... status: 200 unique id: bytes: 86 fd: 0 client 
state: 0 serv
er state: 9 ua_begin: 0.000 ua_first_read: 42.224 ua_read_header_done: 42.224 
cache_open_rea
d_begin: 42.224 cache_open_read_end: 42.224 dns_lookup_begin: 42.224 
dns_lookup_end: 42.224
server_connect: 42.224 server_first_read: 42.229 server_read_header_done: 
42.229 server_clos
e: 42.229 ua_begin_write: 42.229 ua_close: 42.229 sm_finish: 42.229
{code}

Initially, I suspected that it might be caused by browser's connecting early 
before sending any bytes to TS. However, this seems to be happening too 
frequently and with unrealistically high delay between *ua_begin* and 
*ua_first_read*.

I suspect it's caused due to the code that disables the read temporarily before 
calling *TXN_START* hook and re-enables it after the API call out. The read 
disable is done via a 0-byte *do_io_read* on the client vc, but, the problem is 
that a valid *mbuf* is still passed. Based on what I am seeing, it's possible 
this results in actually enabling the *read_vio* all the way uptil 
*ssl_read_from_net* for instance (if there's a race condition and there were 
bytes already from the client resulting in an epoll read event), which would 
finally disable the read since, the *ntodo* (nbytes) is 0. However, this may 
result in the epoll event being lost until a new i/o happens from the client. 
I'm trying out further experiments to confirm the theory.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (TS-3714) TS seems to stall between ua_begin and ua_first_read on some transactions resulting in high response times.

Reply via email to