Hi Steve,
I noticed the addition of cpuid.h, which is not found on Mac OS X. Is there
a more portable alternative for detecting SSE support level? Can't the
cpuinfo instruction be used for this?
On Tue, Jun 14, 2011 at 3:15 AM, Martin Fleisz
<martin.fle...@thinstuff.com>wrote:
> Awesome Steve - congrats! :)
> > SUCCESS!
> >
> > I figured out the major problem with my SSE code. Apparently you have
> > to pay special attention to how the CPU access memory and provide the
> > proper cache hints so it doesn't starve itself and make you wait for
> > slow RAM accesses.
> >
> > This simple commit has made a huge difference on RemoteFX performance
> > (more optimizations to come):
> >
> >
> https://github.com/FreeRDP/FreeRDP/commit/220008fad7dc7eabc81e2e03b81604145d369ad4
> >
> > before change:
> >
> > | rfx_decode_YCbCr_to_RGB_SSE2 | 15671 | 4.740000 |
> 0.000302 |
> >
> > after change:
> >
> > | rfx_decode_YCbCr_to_RGB_SSE2 | 24945 | 0.460000 |
> 0.000018 |
> >
> >
> > That is a> 18x improvement over the previous version, and the
> > difference is visually noticeable.
> > The SSE optimized method is now about 5-6x faster than the non-SSE
> method.
> >
> > Both of these results came from my Intel Atom D510 board.
> >
> > Thanks,
> > Steve
> >
> >
> > On 6/10/2011 1:09 AM, S. Erisman wrote:
> >> Hey Vic,
> >>
> >> On 6/10/2011 12:32 AM, Vic Lee wrote:
> >>> Hi Steve,
> >>>
> >>> Yes both is faster, but the SSE version is still quite slower than the
> >>> original one. Here is my testing.
> >>>
> >>> Before pulling:
> >>> | rfx_decode_YCbCr_to_RGB_SSE2 | 2123 | 1.750000 | 0.000824 |
> >>> | rfx_decode_YCbCr_to_RGB | 2098 | 0.260000 | 0.000124 |
> >>>
> >>> After pulling your commits:
> >>> | rfx_decode_YCbCr_to_RGB_SSE2 | 2049 | 0.690000 | 0.000337 |
> >>> | rfx_decode_YCbCr_to_RGB | 2111 | 0.240000 | 0.000114 |
> >>>
> >>> Oh by the way, the profiler is cool. :)
> >>>
> >>> Vic
> >
> >
> ------------------------------------------------------------------------------
> > EditLive Enterprise is the world's most technically advanced content
> > authoring tool. Experience the power of Track Changes, Inline Image
> > Editing and ensure content is compliant with Accessibility Checking.
> > http://p.sf.net/sfu/ephox-dev2dev
> > _______________________________________________
> > Freerdp-devel mailing list
> > Freerdp-devel@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/freerdp-devel
>
>
>
> ------------------------------------------------------------------------------
> EditLive Enterprise is the world's most technically advanced content
> authoring tool. Experience the power of Track Changes, Inline Image
> Editing and ensure content is compliant with Accessibility Checking.
> http://p.sf.net/sfu/ephox-dev2dev
> _______________________________________________
> Freerdp-devel mailing list
> Freerdp-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/freerdp-devel
>
------------------------------------------------------------------------------
EditLive Enterprise is the world's most technically advanced content
authoring tool. Experience the power of Track Changes, Inline Image
Editing and ensure content is compliant with Accessibility Checking.
http://p.sf.net/sfu/ephox-dev2dev
_______________________________________________
Freerdp-devel mailing list
Freerdp-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freerdp-devel