Re: Significant GC performance penalty

SomeDude Sun, 16 Dec 2012 13:45:28 -0800

On Sunday, 16 December 2012 at 07:47:48 UTC, Rob T wrote:

On Sunday, 16 December 2012 at 05:37:57 UTC, SomeDude wrote:
Isn't the memory management completely negligible whencompared to the database access here ?
Here are the details ...
My test run selects and returns 206,085 records with 14 fieldsper record.
With all dynamic memory allocations disabled that are used tocreate the data structure containing the returned rows, a runtakes 5 seconds. This does not return any data, but it runsexactly through all the records in the same way but returns toa temporary stack allocated value of appropriate type.
If I disable the GC before the run and re-enable it immediatelyafter, it takes 7 seconds. I presume a full 2 seconds are usedto disable and re-enable the GC which seems like a lot of time.
With all dynamic memory allocations enabled that are used tocreate the data structure containing the returned rows, a runtakes 28 seconds. In this case, all 206K records are returnedin a dynamically generate list.
If I disable the GC before the run and re-enable it immediatelyafter, it takes 11 seconds. Since a full 2 seconds are used todisable and re-enable the GC, then 9 seconds are used, andsince 5 seconds are used without memory allocations, theallocations are using 4 seconds, but I'm doing a lot ofallocations.
In my case, the structure is dynamically generated byallocating each individual field for each record returned, sothere's 206,085 records x 14 fields = 2,885,190 allocationsbeing performed. I can cut the individual allocations down to206,000 by allocating the full record in one shot, however thisis a stress test designed to work D as hard as possible andcompare it with an identically stressed C++ version.

You cannot expect the GC to perform like manual memorymanagement. It's a completely unrealistic microbenchmark toallocate each individual field, even for manual MM. The least youcan do to be a little bit realistic is indeed to allocate one rowat a time. I hope that's what you intend to do. But usually,database drivers allow the user to tweak the queries and decidehow many rows can be fetched at a time, and it's pretty common tofetch 50 or 100 rows at a time, meaning one allocation only eachtime. It would be interesting to compare the performance of thetwo languages in these situations, i.e one row at a time, and 50rows at a time.

Re: Significant GC performance penalty

Reply via email to