Re: Reserving/Preallocating associative array?

Benjamin Thaut Fri, 27 Dec 2013 09:21:28 -0800

Am 27.12.2013 17:49, schrieb Gordon:


Benjamin,
Can you point me to your Hashmap implementation? I could perhaps use it
to improve the timings even more.


https://github.com/Ingrater/druntime/blob/merge64/src/core/hashmap.d

This implementation depends on my own allocator design, but it should bepossible to remove that dependeny quite easly by replacing allallocations / frees with malloc/free or GC.malloc / nothing. Just makesure that memory is not initialized beforehand (as newubyte[sizeInBytes] would do) because that also has a considerableperformance impact. Also when allocating with GC.malloc you shouldspecify the GC.BlkAttr.NO_SCAN flag, because you know that your datadoes not contain any pointers. That way the GC will not scan that hugememory block and it should speed up collection a lot.


To improve the performance of the hashmap you can try:

- specifingy different hashing functions (via the hash policy). Seehttps://github.com/Ingrater/thBase/blob/master/src/thBase/policies/hashing.dfor more examples of hashing policies.- Change the amount of always free entries in the hashmap (currently25%) for that change line 233, 207, 210 (not factored into a constantyet). Reducing the free entries might result in less cache misses, butmore linear search, as this is a linear probing hashmap. Increasing thefree entries might reduce the amount of linear search, but increasecache misses and increases memory usage.- Compute a series of prime numbers, for which each prime number is atleast twice as big as the previous one and use that as the possiblesizes for the hashmap. Prime number sizes give better distribution ofthe items within the hashmap, therefor reduce the amount of linearsearching neccessary and thus improve the hashmap performance.- When searching for the next free spot in the hashmap, it currentlyjust adds 1 to the previous index found (line 301). It is also possibleto rehash the previous index (just put it into the hashing functionagain), which would give a new index that is more distant in the hashmapfrom the current one. This would again improve the distribution of theitems in the hashmap, and thus reduce linear search time, but mayincrease the amount of cache-misses as it no longer does linear memoryaccess.


In case you are interrested in my implementation of your problem see here:
http://dpaste.dzfl.pl/8d0d174e

You won't be able to compile the code unless you setup my custom Denvironment. Which I would not recommend unless you really hate the Dgarbage collector ;-)


Kind Regards
Benjamin Thaut

Re: Reserving/Preallocating associative array?

Reply via email to