Re: GC memory fragmentation

tchaloupka via Digitalmars-d-learn Tue, 13 Apr 2021 05:35:47 -0700

On Monday, 12 April 2021 at 07:03:02 UTC, Sebastiaan Koppe wrote:

We have similar problems, we see memory usage alternate betweenplateauing and then slowly growing. Until it hits theconfigured maximum memory for that job and the orchestratorkills it (we run multiple instances and have good failover).
I have reduced the problem by refactoring some of our gc usage,but the problem still persists.
On side-note, it would also be good if the GC can be aware ofthe max memory it is allotted so that it knows it needs to domore aggressive collections when nearing it.


I knew this must be a more common problem :)

What I've found in the meantime:

* nice writeup of how GC actually works by Vladimir Panteleev -https://thecybershadow.net/d/Memory_Management_in_the_D_Programming_Language.pdf* described tool (https://github.com/CyberShadow/Diamond) wouldbe very helpfull, but I assume it's for D1 and based on some olddruntime fork :(* we've implemented log rotation using `std.zlib` (by just`foreach (chunk; fin.byChunk(4096).map!(x => c.compress(x)))fout.rawWrite(chunk);`)* oh boy, don't use `std.zlib.Compress` that way, it allocateseach chunk and for a large files it creates large GC memory peaksthat sometimes doesn't go down

  * rewritten using direct `etc.c.zlib` completely out of GC

* currently testing with `--DRT-gcopt=incPoolSize:0` as otherwiseallocated page size multiplies with number of allocated pools *3MB by default* `profile-gc` is not much helpfull in this case as it onlyprints total allocated memory for each allocation on theapplication exit and as it's a long running service using manyvarious libraries it's just hundreds of lines :)* I've considered to fork the process periodically, terminateit and rename the created profile statistics to at least see thedifferences between the states, but still not sure if it wouldhelp much* as I understand the GC it uses different algorithm for smallallocations and for large objects

  * small (`<=2048`)

* it categorizes objects to fixed set of used sizes and foreach uses whole memory page as bucket with free list from whichit reserves memory on request* when the bucket is full, new page is allocated andallocations are provided from that

  * big - similar, but it allocates N pages as a pool

So If I understand it correctly when for example vibe-dinitializes new fiber on some request, it's handled and fiber canbe discarded it can easily lead to a scenario when fiber itselfis allocated in one page, it's filled up during the requestprocessing so new page is allocated and when cleaning, bucketwith fiber cannot be cleaned up as it's added to a `TaskFiber`pool (with a fixed size). This way fiber's bucket would never befreed and easily never be used anymore during the applicationlifetime.

I'm not so sure if pages of small objects (or large) that are notcompletely empty can be reused as a new bucket or only free pagescan be reused.


Does anyone has some insight of this?

Some kind of GC memory dump and analyzer tool as mentioned`Diamond` would be of tremendous help to diagnose this..

Re: GC memory fragmentation

Reply via email to