LDC with Profile-Guided Optimization (PGO)

Johan Engelen via Digitalmars-d Tue, 15 Dec 2015 15:11:26 -0800

Hi all,

I have been working on adding profile-guided optimization (PGO)to LDC [1][2][3].At this point, I'd like to hear your input and hope you can helpwith testing!

Unfortunately, to try it out, you will need to build LDC withLLVM3.7 yourself. PGO should work on OS X, Linux, and Windows.

A first implementation is mostly complete now: it can generate anexecutable that will output profile data, and it can use profiledata during a second compilation pass (and it will tell LLVMabout branch frequencies). LDC does not do any PGO optimizations(yet): LLVM should do that.

It works like PGO with Clang, with the fprofile-instr-generateand fprofile-instr-use cmdline options [4]:

ldc2 -fprofile-instr-generate=test.profraw -run test.d
llvm-profdata merge test.profraw -output test.profdata
ldc2 -profile-instr-use=test.profdata test.d -of=test

You should now have the executable "test" with an amazingperformance boost ;-)

You can inspect the generated code using LDC's -output-ll switch.Functions should be annotated with call frequencies, and mostbranches should be annotated with branch_weights metadata. Forexample:

define void @for_loop() #0 !prof !12
...
!12 = !{!"function_entry_count", i64 234}

for "void for_loop()" that is called 234 times, and

br i1 %3, label %if, label %else, !prof !17
...
!17 = !{!"branch_weights", i32 5, i32 3}

for "if (condition) {...} else {...}"

The branch_weights have an offset of 1, so the above means thatthe condition was true 4 times, and false 2 times. If a certainpiece of code is never executed, no metadata is added (i.e. youwon't see {!"branch_weights", i32 1, i32 1}). Some branches areintentionally not instrumented/annotated if they lead toterminating code (e.g. array boundschecks and auto-generatednullptr checks on this at class method entry).

I hope you will be able to test and comment on the work. I amvery interested in hearing about performancegains(/losses/no-change) for your programs. I am curious to learnfor what kinds of code it makes a difference in practice.


Thanks!
  Johan

(future work will probably include coverage analysis (llvm-cov)and support for sampling-based profiles, which should fitnaturally with the current implementation)


[1] http://wiki.dlang.org/LDC_LLVM_profiling_instrumentation

[2] https://github.com/JohanEngelen/ldc/tree/pgo (warning: Iwill rebase soon)

[3] https://github.com/ldc-developers/ldc/pull/1219

[4]http://clang.llvm.org/docs/UsersManual.html#profiling-with-instrumentation

LDC with Profile-Guided Optimization (PGO)

Reply via email to