Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 9:21 PM, Ian Smith smi...@nimnet.asn.au wrote: On Wed, 14 Apr 2010, Garrett Cooper wrote:   On Wed, Apr 14, 2010 at 7:49 PM, Garrett Cooper yanef...@gmail.com wrote:   On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA cha...@mac.com wrote:   Hi Andry and Adam     My

Re: Linux static linked ver doesn't work on FBSD (Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 10:26 PM, Maho NAKATA cha...@mac.com wrote: From: Pieter de Goeje pie...@degoeje.nl Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 16:05:18 +0200 I think the best test would be to run

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-15 Thread Adrian Chadd
May I make a suggestion? Would you mind creating a shared google spreadsheet with your testing results and a shared google document with the test setup? I think having the data in an easily represented, easily shared medium would be beneficial to everyone. Adrian On 15 April 2010 08:46, Maho

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Andriy Gapon
on 14/04/2010 20:47 Adam Vande More said the following: I'm no expert Andriy, but it seems like if gotoblas implemented some of the FreeBSD optimizations then we'd be in the same ballpark. This is a good point. But on the other hand, it means that our scheduler doesn't do a perfect job here.

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Adam Vande More
On Thu, Apr 15, 2010 at 3:54 AM, Andriy Gapon a...@freebsd.org wrote: This is a good point. But on the other hand, it means that our scheduler doesn't do a perfect job here. BTW, I use ULE. My observation is that when a number of CPU-intensive long running processes is less than or equal

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-15 Thread Andriy Gapon
on 15/04/2010 16:23 Adam Vande More said the following: Is is possible to add a tunable to the scheduler for it's aggressiveness in switching cores? No idea; not a scheduler person. -- Andriy Gapon ___ freebsd-stable@freebsd.org mailing list

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 14/04/2010 02:21 Maho NAKATA said the following: 2. install ports/math/gotoblas (manual download required) make install Do you know how gotoblas on Linux was obtained? Was it built from source? Has it come pre-packaged? If so, can you find out details of its build configuration? Thanks!

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Pieter de Goeje
On Wednesday 14 April 2010 15:19:13 Andriy Gapon wrote: on 14/04/2010 02:21 Maho NAKATA said the following: 2. install ports/math/gotoblas (manual download required) make install Do you know how gotoblas on Linux was obtained? Was it built from source? Has it come pre-packaged? If so,

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 14/04/2010 02:21 Maho NAKATA said the following: 4. run dgemm. % ./dgemm n: 3000 time : 134.648208 or 16.910525 Mflops : 31943.419695 n: 3100 time : 148.122279 or 18.615284 Mflops : 32017.357408 n: 3200 time : 162.45 or 20.430651 Mflops : 32087.318295 n: 3300 time :

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Adam Vande More
On Wed, Apr 14, 2010 at 10:26 AM, Andriy Gapon a...@freebsd.org wrote: on 14/04/2010 02:21 Maho NAKATA said the following: 4. run dgemm. % ./dgemm n: 3000 time : 134.648208 or 16.910525 Mflops : 31943.419695 n: 3100 time : 148.122279 or 18.615284 Mflops : 32017.357408 n: 3200

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Adam Vande More
On Wed, Apr 14, 2010 at 11:34 AM, Adam Vande More amvandem...@gmail.comwrote: That's about 67% utilization, turning off HTT drops it more. HTT on the newer cores is good, not bad. Well that was completely contrarty to some tests I'd run when I first got the cpu. With HTT off: n: 3000

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 14/04/2010 19:45 Adam Vande More said the following: also if I run cpuset on the dgemm then the utilization is basically at the theoretical max for one core so at least that part is working. You can also try procstat -t pid to find out thread IDs and cpuset -t to pin the threads to the

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Adam Vande More
On Wed, Apr 14, 2010 at 11:51 AM, Andriy Gapon a...@freebsd.org wrote: on 14/04/2010 19:45 Adam Vande More said the following: also if I run cpuset on the dgemm then the utilization is basically at the theoretical max for one core so at least that part is working. You can also try

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
From: Andriy Gapon a...@freebsd.org Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 16:19:13 +0300 on 14/04/2010 02:21 Maho NAKATA said the following: 2. install ports/math/gotoblas (manual download required)

HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Maho NAKATA
Hi Andry and Adam My test again. No desktop, etc. I just run dgemm. Contrary to Adam's result, Hyper Threading makes the performance worse. all tests are done on Core i7 920 @ 2.67GHz. (TurboBoost @2.8GHz) Turbo Boost off, Hyper threading off: 82% (35GFlops)[1] Turbo Boost off, Hyper

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
opps I missed this e-mail... From: Adam Vande More amvandem...@gmail.com Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 11:45:04 -0500 On Wed, Apr 14, 2010 at 11:34 AM, Adam Vande More

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
Hi Andriy and Adam, I did also the same thing as suggested. my conclusion: on Core i7 920, 2.66GHz, TurboBoost on, HyperThreading off, My result of dgemm GotoBLAS performance was following. *summary of result 36-39GFlops 81-87% of peak performance without pinning 35-40GFlops 78-89% of peak

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Maho NAKATA
Hi Adam, From: Adam Vande More amvandem...@gmail.com Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 12:47:31 -0500 Since this is a full fledged desktop environment, 90% utilization seems pretty good. No, I

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA cha...@mac.com wrote: Hi Andry and Adam My test again. No desktop, etc. I just run dgemm. Contrary to Adam's result, Hyper Threading makes the performance worse. all tests are done on Core i7 920 @ 2.67GHz. (TurboBoost @2.8GHz) Turbo Boost off,

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Garrett Cooper
On Wed, Apr 14, 2010 at 7:49 PM, Garrett Cooper yanef...@gmail.com wrote: On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA cha...@mac.com wrote: Hi Andry and Adam My test again. No desktop, etc. I just run dgemm. Contrary to Adam's result, Hyper Threading makes the performance worse. all tests

Re: HyperThreading makes worse to me (was Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Ian Smith
On Wed, 14 Apr 2010, Garrett Cooper wrote: On Wed, Apr 14, 2010 at 7:49 PM, Garrett Cooper yanef...@gmail.com wrote: On Wed, Apr 14, 2010 at 5:46 PM, Maho NAKATA cha...@mac.com wrote: Hi Andry and Adam My test again. No desktop, etc. I just run dgemm. Contrary to Adam's result,

Linux static linked ver doesn't work on FBSD (Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920)

2010-04-14 Thread Maho NAKATA
From: Pieter de Goeje pie...@degoeje.nl Subject: Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920 Date: Wed, 14 Apr 2010 16:05:18 +0200 I think the best test would be to run a statically compiled linux binary on FreeBSD. That way the compiler

Re: How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-14 Thread Andriy Gapon
on 15/04/2010 04:20 Maho NAKATA said the following: Hi Andriy and Adam, I did also the same thing as suggested. my conclusion: on Core i7 920, 2.66GHz, TurboBoost on, HyperThreading off, So HyperThreading is off. then, pinned to each core like following % procstat -t 1408 PID

How to reproduce: Re: Only 70% of theoretical peak performance on FreeBSD 8/amd64, Corei7 920

2010-04-13 Thread Maho NAKATA
Hi all, thanks for showing interest in this issue. I uploaded my test code so that you can test on your PC. Following is the instruction. 1. download my source codes. http://people.freebsd.org/~maho/dgemm/Makefile http://people.freebsd.org/~maho/dgemm/dgemm.cpp check md5. % md5 Makefile