Re: Policy for exposing range structs

Johan Engelen via Digitalmars-d Tue, 19 Apr 2016 07:51:12 -0700

On Friday, 1 April 2016 at 14:46:42 UTC, Johan Engelen wrote:

Meanwhile, I've implemented hashing of function names and othersymbols *for the backend*, giving an object file size reductionof ~25% (hashing everything larger than 100 chars) for mycurrent testcase (251MB -> 189MB).Hashing symbols in the FE is not possible with my testcasebecause of std.traits.ParameterStorageClassTuple... :/


See my PR for LDC:
https://github.com/ldc-developers/ldc/pull/1445

"This adds MD5 hashing of symbol names that are larger thanthreshold set by -hashthres.

What is very unfortunate is that std.traits depends on themangled name, doing string parsing of the mangled name of symbolsto obtain symbol traits. This means that mangling cannot bechanged (dramatically, like hashing) at a high level, and thehashing has to be done on a lower level.


Hashed symbols look like this:
_D3one3two5three3L3433_46a82aac733d8a4b3588d7fa8937aad66Result3fooZ
ddemangle gives:
one.two.three.L34._46a82aac733d8a4b3588d7fa8937aad6.Result.foo

Meaning: this symbol is defined in module one.two.three on line34. The identifier is foo and is contained in the struct or classResult.


Symbols that may be hashed:
- functions
- struct/class initializer
- vtable
- typeinfo (needed surgery inside FE code)

The feature is experimental, and has been tested on Weka.io'scodebase. Compilation with -hashthres=1000 results in a binarythat is half the size of the original (201MB vs. 461MB). I didnot observe a significant difference in total build times. Hashthreshold of 8000 gives 229MB, 800 gives 195MB binary size: thereis not much gain after a certain hash threshold.Linking Weka's code fails with a threshold of 500: phoboscontains a few large symbols (one larger than 8kb!) and this PRcurrently does not disable hashing of symbols that are insidephobos, hence "experimental". Future work could try to figure outwhether a symbol is inside phobos or not."

Re: Policy for exposing range structs

Reply via email to