Jian-Xin Lai <laij...@gmail.com> wrote:
> I tried the Open64 PGO on these benchmarks. Basically, the training
> executable runs about 20 times slower. I guess the overhead of open64
> PGO is comparable as ICC. 

What are the exact options you used when trying PGO?  I found that the
C-tran code was about 20 times slower, but the expression template
code was much worse than that.

> But there is not much performance gain from Open64 PGO. Since all
> test cases are single file, "-O3 -OPT:Ofast" may works better.

That what I would have thought, but the FTensor results for the Intel
compiler were much, much improved with PGO.

Thanks,
Walter Landry

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Open64-devel mailing list
Open64-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/open64-devel

Reply via email to