taoyunxing created TS-4279:
------------------------------
Summary: ats fallen into dead loop for cache directory overflow
Key: TS-4279
URL: https://issues.apache.org/jira/browse/TS-4279
Project: Traffic Server
Issue Type: Bug
Components: Cache
Reporter: taoyunxing
CPU 40 cores, Mem: 120GB, Disk: 1*300 sys + 11 * 899GB,
records.config:
CONFIG proxy.config.cache.min_average_object_size INT 1048576
CONFIG proxy.config.cache.ram_cache.algorithm INT 1
CONFIG proxy.config.cache.ram_cache_cutoff INT 4194304
CONFIG proxy.config.cache.ram_cache.size INT 64424509440
storage.config:
/dev/sdc id=cache.disk.1
I encountered a kind of dead loop situation of ats 5.3.1 on two production
hosts, a burst of warning is seen by me in the diags.log like this:
{code}
[Mar 16 13:04:32.730] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.732] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.733] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.735] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.737] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.739] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.742] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.744] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.747] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.750] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.753] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
[Mar 16 13:04:32.756] Server {0x2b8ffc544700} WARNING: <CacheDir.cc:502
(freelist_clean)> cache directory overflow on '/dev/sde' segment 4, purging...
{code}
ats restart in every serval hours, and the TIMEWAIT count is huge above the
ESTABLISH TCP connection count.
the following is the current dir snapshot of those host:
{code}
Directory for [cache.disk.1 172032:109741163]
Bytes: 8573600
Segments: 14
Buckets: 15310
Entries: 857360
Full: 852904
Empty: 4085
Stale: 0
Free: 371
Bucket Fullness: 4085 15800 32044 41621
42175 33137 22232 12605
Segment Fullness: 60903 60918 60914 60947 60956
60947 60872 60943 60918 60927
60858 60917 60927 60957
Freelist Fullness: 45 30 27 13 0
7 89 5 32 12
83 0 20 8
{code}
I wonder why, anyone help me? thinks a lot.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)