Re: [Haifux] GCC Conundrum

2009-06-19 Thread Ilya A. Volynets-Evenbakh
Were results different (as in the way one would expect) with earlier
versions of
gcc?

Shlomi Fish wrote:
 I'm getting strange results (and counter-intuitive ones) when trying to 
 compile a program using gcc-4.4.0-3mnb2 on Mandriva Linux Cooker.

 The program in question is:

 https://svn.berlios.de/svnroot/repos/fc-solve/trunk/fc-solve/source

 (you can checkout it using svn co).

 And I'm getting the following benchmark timings: (in seconds)

 {{
 dump:126.027262926102
 dump-2:124.866523981094
 dump-3:125.149952888489
 dump-4:125.010930776596
 dump-aft-reboot:127.221956968307
 dump-flip:123.10272693634
 dump-flip-2:121.934360027313
 dump-flip-3:121.98522400856
 dump-old:130.334769010544
 dump-old-aft-reboot:122.411857128143
 }}

 dump-old is of an old revision before I rebooted. dump-old-aft-reboot is the 
 same version after I reboot the computer.

 The dump-flip's are after doing ./Tatzer -l p4b and then make (and make 
 install) and then running:

  ./freecell-solver-range-parallel-solve 1 32000 500 -l gi

 The dump-[234] are after doing ./Tatzer -l p4b --without-flip. What this 
 flag does is define the FCS_WITHOUT_CARD_FLIPPING macro, which in turn 
 toggles 
 some #ifndef's in the code that eliminate a lot of code. I verified that with 
 --without-flip binaries are smaller.

 My question is: why is the --without-flip binaries slower than the ones with 
 the flipping logic? So far my best bet is that with the flipping logic 
 enabled 
 (but underused) the code aligns to have fewer cache misses and that it is 
 just 
 a co-incidence that this is the case.

 I'll appreciate any insights.

 Regards,

   Shlomi Fish

   


-- 
Ilya A. Volynets-Evenbakh
http://www.total-knowledge.com

___
Haifux mailing list
Haifux@haifux.org
http://hamakor.org.il/cgi-bin/mailman/listinfo/haifux


Re: [Haifux] GCC Conundrum

2009-06-19 Thread Shlomi Fish
On Sunday 07 June 2009 07:20:00 Ilya A. Volynets-Evenbakh wrote:
 Were results different (as in the way one would expect) with earlier
 versions of
 gcc?


I didn't try. I added this compilation option recently.

Regards,

Shlomi Fish

 Shlomi Fish wrote:
  I'm getting strange results (and counter-intuitive ones) when trying to
  compile a program using gcc-4.4.0-3mnb2 on Mandriva Linux Cooker.
 
  The program in question is:
 
  https://svn.berlios.de/svnroot/repos/fc-solve/trunk/fc-solve/source
 
  (you can checkout it using svn co).
 
  And I'm getting the following benchmark timings: (in seconds)
 
  {{
  dump:126.027262926102
  dump-2:124.866523981094
  dump-3:125.149952888489
  dump-4:125.010930776596
  dump-aft-reboot:127.221956968307
  dump-flip:123.10272693634
  dump-flip-2:121.934360027313
  dump-flip-3:121.98522400856
  dump-old:130.334769010544
  dump-old-aft-reboot:122.411857128143
  }}
 
  dump-old is of an old revision before I rebooted. dump-old-aft-reboot is
  the same version after I reboot the computer.
 
  The dump-flip's are after doing ./Tatzer -l p4b and then make (and
  make install) and then running:
 
   ./freecell-solver-range-parallel-solve 1 32000 500 -l gi
 
  The dump-[234] are after doing ./Tatzer -l p4b --without-flip. What
  this flag does is define the FCS_WITHOUT_CARD_FLIPPING macro, which in
  turn toggles some #ifndef's in the code that eliminate a lot of code. I
  verified that with --without-flip binaries are smaller.
 
  My question is: why is the --without-flip binaries slower than the ones
  with the flipping logic? So far my best bet is that with the flipping
  logic enabled (but underused) the code aligns to have fewer cache misses
  and that it is just a co-incidence that this is the case.
 
  I'll appreciate any insights.
 
  Regards,
 
  Shlomi Fish

-- 
-
Shlomi Fish   http://www.shlomifish.org/
http://www.shlomifish.org/humour/ways_to_do_it.html

God gave us two eyes and ten fingers so we will type five times as much as we
read.
___
Haifux mailing list
Haifux@haifux.org
http://hamakor.org.il/cgi-bin/mailman/listinfo/haifux