Re: d word counting approach performs well but has higher mem usage

Jon Degenhardt via Digitalmars-d-learn Sun, 04 Nov 2018 12:25:57 -0800

On Saturday, 3 November 2018 at 14:26:02 UTC, dwdv wrote:

Hi there,
the task is simple: count word occurrences from stdin (around150mb in this case) and print sorted results to stdout in asomewhat idiomatic fashion.
Now, d is quite elegant while maintaining high performancecompared to both c and c++, but I, as a complete beginner,can't identify where the 10x memory usage (~300mb, see resultsbelow) is coming from.
Unicode overhead? Internal buffer? Is something slurping thewhole file? Assoc array allocations? Couldn't find huge allocswith dmd -vgc and -profile=gc either. What did I do wrong?

Not exactly the same problem, but there is relevant discussion inthe blog post I wrote a while ago:https://dlang.org/blog/2017/05/24/faster-command-line-tools-in-d/

See in particular the section on Associate Array lookupoptimization. This takes advantage of the fact that it's onlynecessary to create the immutable string the first time a key isentered into the hash. Subsequent occurrences do not need to takethis step. As creating allocates new memory, even if only usedtemporarily, this is a meaningful savings.

There have been additional APIs added to the AA interface since Iwrote the blog post, I believe it is now possible to accomplishthe same thing with more succinct code.


Other optimization possibilities:

* Avoid auto-decode: Not sure if your code is hitting this, butif so it's a significant performance hit. Unfortunately, it's notalways obvious when this is happening. The task your areperforming doesn't need auto-decode because it is splitting onsingle-byte utf-8 char boundaries (newline and space).

* LTO on druntime/phobos: This is easy and will have a materialspeedup. Simply add

        '-defaultlib=phobos2-ldc-lto,druntime-ldc-lto'

to the 'ldc2' build line, after the '-flto=full' entry. This willbe a win because it will enable a number of optimizations in theinternal loop.

* Reading the whole file vs line by line - 'byLine' is reallyfast. It's also nice and general, as it allows reading arbitrarysize files or standard input without changes to the code.However, it's not as fast as reading the file in a single shot.

* std.algorithm.joiner - Has improved dramatically, but is stillslower than a foreach loop. See:https://github.com/dlang/phobos/pull/6492


--Jon

Re: d word counting approach performs well but has higher mem usage

Reply via email to