Hi Steve, Thanks, that's even cooler. :)
Vic On 06/11/2011 11:54 AM, S. Erisman wrote: > SUCCESS! > > I figured out the major problem with my SSE code. Apparently you have > to pay special attention to how the CPU access memory and provide the > proper cache hints so it doesn't starve itself and make you wait for > slow RAM accesses. > > This simple commit has made a huge difference on RemoteFX performance > (more optimizations to come): > > https://github.com/FreeRDP/FreeRDP/commit/220008fad7dc7eabc81e2e03b81604145d369ad4 > > before change: > > | rfx_decode_YCbCr_to_RGB_SSE2 | 15671 | 4.740000 | 0.000302 | > > after change: > > | rfx_decode_YCbCr_to_RGB_SSE2 | 24945 | 0.460000 | 0.000018 | > > > That is a> 18x improvement over the previous version, and the > difference is visually noticeable. > The SSE optimized method is now about 5-6x faster than the non-SSE method. > > Both of these results came from my Intel Atom D510 board. > > Thanks, > Steve ------------------------------------------------------------------------------ EditLive Enterprise is the world's most technically advanced content authoring tool. Experience the power of Track Changes, Inline Image Editing and ensure content is compliant with Accessibility Checking. http://p.sf.net/sfu/ephox-dev2dev _______________________________________________ Freerdp-devel mailing list Freerdp-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/freerdp-devel