On Wed, 25 Aug 2010 22:06:00 +0100 Robin Cornelius <robin.cornel...@gmail.com> wrote:
> I'm not aware of anyone publishing results for such a test, but if you > have the time it would be interesting reading. Some things to keep in > mind. OpenJpeg has patches floating around on its ML against 1.3 that > reports have claimed up to 40% speed increase in places due to > unrolling the inner loops so finding them and testing would be good. i've few data, only related to x-ray medical images... (400-500MB per images), and i'm the hw&systems monkey, not the coding one.... to today i can supply only decoding time on a *nix platform jasper based decoder/viewer from uncached (not previously opened, no memory cache) to video render time: basing code snippet (a timer + a reading routine from disk + decoder) processor q6...@1,6GHz memory DDR3 4GB (4x1GB) graphics: GTX270 on PCI-x 2.0 4x (single, no SLI) disk on fast SATA2 gcc 4.4.5 LibC6 2.11.2 debian "stable" with GCC and development files and libs backported from SID, kernel 2.6.32-5-amd64 compiled for 32bit (this cause the bug SL viewer and snowglobe one detect a 64bit system why use wrongly uname, but is a 32bit system with extended register enabled) JASPER 1.900 (from site, not from debian repos) compiled w/o optimization: 19,31msec compiled w SSE2: 11,60msec compiled w SSSE3: 9,11msec compiled w SSE2+OpenCL (195.36.31 libs from nvidia (multithread): 4,45msec [1] OpenJPEG1.3 (from google code) compiled w/o optimization: 26,12msec compiled w SSE2: 15,33msec compiled w SSSE3: 13,41msec compiled w SSE2+OpenCL (195.36.31 libs from nvidia (multithread): 6,56msec [1] OpenJPEG2.0_alpha (from google code) compiled w/o optimization: 22,09msec compiled w SSE2: crashed (maybe our fault, still debugging) compiled w SSSE3: crashed (maybe our fault, still debugging) compiled w SSE2+OpenCL (195.36.31 libs from nvidia (multithread): crashed (maybe our fault, still debugging) cannot supply neither code snippet neither image (is a knee of a customer....) too few numbers, just to give a hint [1] OpenCL, in our "poor" code, create some trouble in OS cache and made system unstable after 4h of cyclic run of snippet, beginning a crazy dance of IRQ27 on whole cores of machine and overloading disks transfers why too swap) _______________________________________________ Policies and (un)subscribe information available here: http://wiki.secondlife.com/wiki/OpenSource-Dev Please read the policies before posting to keep unmoderated posting privileges