Jian-Xin Lai <laij...@gmail.com> wrote: > I tried the Open64 PGO on these benchmarks. Basically, the training > executable runs about 20 times slower. I guess the overhead of open64 > PGO is comparable as ICC.
What are the exact options you used when trying PGO? I found that the C-tran code was about 20 times slower, but the expression template code was much worse than that. > But there is not much performance gain from Open64 PGO. Since all > test cases are single file, "-O3 -OPT:Ofast" may works better. That what I would have thought, but the FTensor results for the Intel compiler were much, much improved with PGO. Thanks, Walter Landry ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ Open64-devel mailing list Open64-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/open64-devel