Re: dmd codegen improvements

Kagamin via Digitalmars-d Fri, 21 Aug 2015 02:41:10 -0700

On Friday, 21 August 2015 at 09:17:28 UTC, Iain Buclaw wrote:

There's a paper somewhere about optimisations on Intelprocessors that says that -O2 produces overall better resultsthan -O3 (I'll have to dig it out).

That being said, recently I compared performance of the datetimelibrary using different algorithms. One function of interest wascomputing year from raw time: D1 had an implementation based onloop - it iterated over years until it matched the source rawtime; and currently phobos has implementation without loop, whichcarefully reduces the time to year. I wrote two tests iteratingover days and calling date-time conversion functions, the testwhich invoked yearFromDays directly showed that implementationwithout loop is faster, but the bigger test that called fullconversion between date and time showed that version with loop isfaster by 5%. Quite unintuitive. Could it be due to cacheproblems? The function with loop is smaller, but the wholeexecutable is only 15kb - should fit in the processor cacheentirely.

Re: dmd codegen improvements

Reply via email to