" On Thursday 18 September 2008 13:19, James Killian wrote: > When reviewing this data the average framerate is about the same (The > average is the middle column). We can see improvement of performance from > build 8748. I do not really remember what your numbers meant exactly. I would read in that numbers that the past patch alone gave you 10-20% better framerates, true? "
Ok in short the new optimization you submitted now (i.e. without SSE) is 10-20% better than what we had (i.e. your initial optimization with my SSE optimization). Back then the Matrix Mult optimization had more of a significant impact. " But, what is your conclusion? Are you still working on cpu specific handtuning? What did the comments regarding a bit more high level optimization opportunities for your application bring? Did you have time to try that out? " My conclusion is that your new submission is good enough to not need to use SSE for the Matrix Mult. I have not been working on CPU specific handtuning, since the primary bottleneck for our code is due to thread synchronization. Currently I am learning some new thread synchronization strategies (e.g. APC calls during context switch)... once I have grasped how these work I may want to get together with the author of the atomic code in OpenThreads. In regards to performance: One thing I propose is that OSG team dedicate some research into new parallelization strategies. See http://www.threadingbuildingblocks.org/ as this covers some of the concepts. We have been able to make templated classes that are a bit more friendlier than what is written in TBB (Threaded Building Blocks). I feel that as time progresses it will be common for people to have multiple processors in their machines, so becoming proficient in threaded code will be a must, and with these new helper classes, it should make writing threaded code simple, and error proof (e.g. eliminating the need to use critical sections). ----- Original Message ----- From: "Mathias Fröhlich" <[EMAIL PROTECTED]> To: "OpenSceneGraph Submissions" <[email protected]> Sent: Friday, September 19, 2008 6:40 AM Subject: Re: [osg-submissions] Platform independentmatrixmultiplicationoptimization Hi James, On Thursday 18 September 2008 13:19, James Killian wrote: > When reviewing this data the average framerate is about the same (The > average is the middle column). We can see improvement of performance from > build 8748. I do not really remember what your numbers meant exactly. I would read in that numbers that the past patch alone gave you 10-20% better framerates, true? But, what is your conclusion? Are you still working on cpu specific handtuning? What did the comments regarding a bit more high level optimization opportunities for your application bring? Did you have time to try that out? Greetings Mathias -- Dr. Mathias Fröhlich, science + computing ag, Software Solutions Hagellocher Weg 71-75, D-72070 Tuebingen, Germany Phone: +49 7071 9457-268, Fax: +49 7071 9457-511 -- Vorstand/Board of Management: Dr. Bernd Finkbeiner, Dr. Florian Geyer, Dr. Roland Niemeier, Dr. Arno Steitz, Dr. Ingrid Zech Vorsitzender des Aufsichtsrats/ Chairman of the Supervisory Board: Prof. Dr. Hanns Ruder Sitz/Registered Office: Tuebingen Registergericht/Registration Court: Stuttgart Registernummer/Commercial Register No.: HRB 382196 _______________________________________________ osg-submissions mailing list [email protected] http://lists.openscenegraph.org/listinfo.cgi/osg-submissions-openscenegraph.org _______________________________________________ osg-submissions mailing list [email protected] http://lists.openscenegraph.org/listinfo.cgi/osg-submissions-openscenegraph.org
