OK here are some more numbers for the latest patch in gerrit. I will follow Ruperts format as it's the most clear.
MeanSquares: Threads 3.2 4.2 4.2+patch patch percentage of 3.20 1 0.3615 0.8214 0.4071 113% 2 0.3222 0.6055 0.3365 104% 4 0.3249 0.4448 0.3293 101% 8 0.1703 0.3093 0.1943 114% 12 0.1457 0.2031 0.1322 91% 24* 0.1062 0.1332 0.0949 89% MutualInformation: Threads 3.2 4.2 4.2+patch patch percentage of 3.20 1 0.1467 0.6103 0.3353 228% 2 0.1036 0.3747 0.1774 171% 4 0.0847 0.2175 0.1262 149% 8 0.0655 0.1291 0.0681 104% 12 0.0551 0.1035 0.0486 88% 24* 0.0460 0.0829 0.0526 114% *Hyperthreading The observation to be made about MutualInformation is that while 4.2 it's still slower with one thread, there is a significant increase is speed-up due to threads now. Brad On Jul 26, 2012, at 2:02 PM, Rupert Brooks wrote: > Ok that makes way more sense, sorry i didnt understand first time around. > > Just so i've got it right > Threads 3.20 4.2+patch > Time 4.2 as percent of 3.20 > 1 0.347567 0.383342 > 110.293% > 2 0.300869 0.335328 > 111.453 > 4 0.348677 0.315688 > 90.5388 > 8 0.182681 0.192132 > 105.173 > > So theres about 10% more time with ITK 4.2 used in the 1 and 2 thread case. > That is definitely better than what we were getting. Cool. > > Rupert > > -------------------------------------------------------------- > Rupert Brooks > [email protected] > > > > On Thu, Jul 26, 2012 at 1:13 PM, Bradley Lowekamp <[email protected]> > wrote: > Sorry for not being clear! I got too excited by finding the solution to the > performance issue with ITKv3 registration in ITKv4. > > This first is vanilla 3.20, the second is 4.20+ the gerrit patch. The third > is the gerrit patch with the pre-malloc of the Jacobin outside the threaded > section! Vanilla 4.2 is ~2x 3.20 for this test on my system too. > > Summary for the MeansSquares metric in your test: > > 3.20: 1X > 4.2: 2+X > 4.2+gerrit patch: 1X > 4.2+gerrit patch + single-threaded preallocation of jacobian: 1.5X ======================================================== Bradley Lowekamp Medical Science and Computing for Office of High Performance Computing and Communications National Library of Medicine [email protected]
_______________________________________________ Powered by www.kitware.com Visit other Kitware open-source projects at http://www.kitware.com/opensource/opensource.html Kitware offers ITK Training Courses, for more information visit: http://kitware.com/products/protraining.php Please keep messages on-topic and check the ITK FAQ at: http://www.itk.org/Wiki/ITK_FAQ Follow this link to subscribe/unsubscribe: http://www.itk.org/mailman/listinfo/insight-developers
