Excessive chunking [was: mod_disk_cache and atimes]

Konstantin Chuguev Wed, 26 Mar 2008 09:23:51 -0700

Thanks for the clarification.

A small correction: I meant writev() calls instead of sendfile() whenworking with small-size buckets.

The filter I'm developing provisionally splits the supplied bucketsinto relatively small buckets during content parsing. It then removessome of them and inserts some other buckets. Before passing theresulting brigade further down the filter chain, it merges all bucketsthat have their data in contiguous memory regions back together. So Iguess I'm doing my bit in preventing excessive chunking.

I've done some research on the source files of httpd-2.2.6. The COREfilter seems to do de-chunking in the case when 16 or more buckets arepassed to it (actually, the brigade is split if it contains flushbuckets and each split part is checked for 16 buckets) AND the totalamount of bytes in the 16 buckets does not exceed 8000. The filterthen buffers the buckets together. Very clever.


        KC


On 26 Mar 2008, at 15:22, Dirk-Willem van Gulik wrote:

On Mar 26, 2008, at 4:15 PM, Konstantin Chuguev wrote:
Can you please clarify your mentioning the bucket-brigadefootprint? Are they so slow they make memory-based cache no moreefficient then disk-based one? Or the opposite: sendfile() works sowell that serving content from memory is not any faster?
No - they are very fast (in an absolute sense) - and your approachis almost certainly the right one.
However all-in-all there is a lot of logic surrounding them; and ifyou are trying to squeeze out the very last drop (e.g. the 1x1 gifexample) - you run into all sorts of artificial limits, specificallyon linux and 2x2 core machines; as the memory which needs to beaccessed is just a little more scattered than one would prefer andall sort of competition around the IRQ handling in the kernel and soon.
Or in other words - in a pure static case where you are serving verysmall files which rarely if ever change, have no variance to anyinbound headers, etc - things are not ideal.
But that is a small price to pay - i.e. apache is more of a swissarmy knife; which saw's OK, but a proper hacksaw is 'better'.
I'm developing an Apache output filter for highly loaded serversand proxies that juggles small-size buckets and brigadesextensively. I'm not at the stage yet where I can do performancetests but if I knew this would definitely impact performance, Iwould perhaps switch to fixed-size buffers straight away...
I'd bet you are on the right track. However there is -one- smallconcern; sometimes if you have looooots of buckets and very chunkedoutput - then one gets lots and lots of 1-5 byte chunks; eachprefixed by the length byte. And this can get really inefficient.
Perhaps we need a de-bucketer to 'dechunk' when outputting chunked.

Dw


Konstantin Chuguev
Software Developer

Mobile: +44 7734 955973
Fax: + 44 20 7509 9600

Clickstream Technologies PLC, 58 Davies Street, London, W1K 5JF,Registered in England No. 3774129

Excessive chunking [was: mod_disk_cache and atimes]

Reply via email to