OK here are some more numbers for the latest patch in gerrit. I will follow 
Ruperts format as it's the most clear.

MeanSquares:
Threads 3.2             4.2             4.2+patch       patch percentage of 3.20
1               0.3615  0.8214  0.4071          113%
2               0.3222  0.6055  0.3365          104%
4               0.3249  0.4448  0.3293          101%
8               0.1703  0.3093  0.1943          114%
12              0.1457  0.2031  0.1322          91%
24*             0.1062  0.1332  0.0949          89%

MutualInformation:
Threads 3.2             4.2             4.2+patch       patch percentage of 3.20
1               0.1467  0.6103  0.3353          228%
2               0.1036  0.3747  0.1774          171%
4               0.0847  0.2175  0.1262          149%
8               0.0655  0.1291  0.0681          104%
12              0.0551  0.1035  0.0486          88%
24*             0.0460  0.0829  0.0526          114%

*Hyperthreading

The observation to be made about MutualInformation is that while 4.2 it's still 
slower with one thread, there is a significant increase is speed-up due to 
threads now.

Brad

On Jul 26, 2012, at 2:02 PM, Rupert Brooks wrote:

> Ok that makes way more sense, sorry i didnt understand first time around.
> 
> Just so i've got it right
> Threads                    3.20                              4.2+patch        
>     Time 4.2 as percent of 3.20
> 1                         0.347567                            0.383342        
>         110.293%
> 2                         0.300869                            0.335328        
>         111.453
> 4                         0.348677                            0.315688        
>          90.5388
> 8                         0.182681                            0.192132        
>         105.173
> 
> So theres about 10% more time with ITK 4.2 used in the 1 and 2 thread case.  
> That is definitely better than what we were getting.  Cool.
> 
> Rupert
> 
> --------------------------------------------------------------
> Rupert Brooks
> [email protected]
> 
> 
> 
> On Thu, Jul 26, 2012 at 1:13 PM, Bradley Lowekamp <[email protected]> 
> wrote:
> Sorry for not being clear! I got too excited by finding the solution to the 
> performance issue with ITKv3 registration in ITKv4.
> 
> This first is vanilla 3.20, the second is 4.20+ the gerrit patch. The third 
> is the gerrit patch with the pre-malloc of the Jacobin outside the threaded 
> section! Vanilla 4.2 is ~2x 3.20 for this test on my system too.
> 
> Summary for the MeansSquares metric in your test:
> 
> 3.20:  1X
> 4.2: 2+X
> 4.2+gerrit patch: 1X
> 4.2+gerrit patch + single-threaded preallocation of jacobian: 1.5X

========================================================
Bradley Lowekamp  
Medical Science and Computing for
Office of High Performance Computing and Communications
National Library of Medicine 
[email protected]



_______________________________________________
Powered by www.kitware.com

Visit other Kitware open-source projects at
http://www.kitware.com/opensource/opensource.html

Kitware offers ITK Training Courses, for more information visit:
http://kitware.com/products/protraining.php

Please keep messages on-topic and check the ITK FAQ at:
http://www.itk.org/Wiki/ITK_FAQ

Follow this link to subscribe/unsubscribe:
http://www.itk.org/mailman/listinfo/insight-developers

Reply via email to