Hi,

As you may know I am currently optimizing template-related code inside of DMD. Inside DMD code quality is quite high, there is little low hanging fruit.

However there is one thing that suspiciously often shows up the on profilers display. Those are indirect class which have a high number of L2(!) i-cache misses.

These calls don't do a lot of work. They are needed either for downcasts or to verify the dynamic type of an AST-Node. Even without the i-cache related stalls the call overhead alone is something to think about.

For template-heavy code matching template parameters is one of the most frequently operations. Since Template Parameters can be types expressions or symbols the dynamic-types are heavily queried.

First experiments suggest that a speedup of around 12% is possible if the types where accessible directly.

Since dmd uses visitors for many things now the benefit of virtual methods is highly reduced.

Please share your thoughts.

Cheers,
Stefan

Reply via email to