[Bug 63666] New: Should take the OS buffers into account when timing lingering

bugzilla Thu, 15 Aug 2019 00:51:47 -0700

https://bz.apache.org/bugzilla/show_bug.cgi?id=63666


            Bug ID: 63666
           Summary: Should take the OS buffers into account when timing
                    lingering
           Product: Apache httpd-2
           Version: 2.4-HEAD
          Hardware: PC
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Platform
          Assignee: [email protected]
          Reporter: [email protected]
  Target Milestone: ---

Created attachment 36718
  --> https://bz.apache.org/bugzilla/attachment.cgi?id=36718&action=edit
python test case

Note version tested is 2.4.41, however the version field doesn't seem to have
that one.

For context; we're using bmaptool (https://github.com/intel/bmap-tools) to
flasy embedded boards over the network; bmap can on the fly download an image,
uncompress it and write to storage (e.g. SD card). As the input image is
compressed the amount work bmaptool needs to do fluctutes heavily (e.g. towards
the end of an image the content will mosty be zeros, which means for a very
amount of small compressed data transfer you get a big amount of compressed
data).

What we saw practically happening is on some specific boards/images apache ends
up resetting the connection when the data transfer was nearly finished.


Tracing this down what happens is that the connection ens up in FIN-WAIT-1
(iotw. apache has shutdown its write side of the connection already) with quite
some amount of data left in the send queue as the connection was stalled at
that time, after 30 seconds the connection gets reset.

On the apache site what happens is that it simply finishing writing all its
data to the socket, shuts down the write side, lingers for maximally 30 seconds
and then closes, which
https://svn.apache.org/viewvc?view=revision&revision=1802875 forces a
connection reset (on older versions it would "linger"/be "orphaned" on the OS
side).


On the network side what happens is that download is stalled (bmaptool is busy)
as the recevier window is full, which means that even though apache is already
lingering not all data has been transferred and FIN hasn't been sent yet. This
is then followed by RST packet as Apache causes the connection to be dropped,
with the receiver never having a chance to see all data (or the FIN).


What should probably happen is that when apache does it's lingering it should
check the send queue size on the OS side before hard terminating the connection
(or leave it up to the OS which is what happened previously) as the connection
simply might have slowed down enough to not be able to drain the send queues
within 30 seconds...


I've attached a minimal python test case that shows the issue; The key there is
to tweak the code a bit the setup such that apache is lingering with a good
amount of data left in the send queue when the 40 seconds sleep happens.

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[Bug 63666] New: Should take the OS buffers into account when timing lingering

Reply via email to