Noel Carboni wrote:
> By the way, as an exercise to reinforce the above, I re-coded the
> LittleCMS floating point trilinear interpolation algorithm using SSE2
> intrinsics. It ended up delivering the same performance as the C-coded
> version. Why not better? Because the table-based design
> The ideal situation is if the C code is written in such a way that
> modern optimizing compilers do the right
> thing by default and produce good code for any CPU. This should mean
> that the compilers automatically
> produce SSE code where they should if it is enabled.
Yes, a good thought.
Many time ago, the code had some inline assembly code. I removed every trace of assembly about 15 years ago and take the requirement of pure C99 code forever. Was a good idea. Worked great and survived aging. I think optimizations have to be done by arranging C code to help compiler, assembly
Hi Bob,
I totally agree with you. Any architecture dependent code should be always
optional. The vector library I used came in two versions, a generic one and a
SSE version. The usage of the SSE version, IMHO, must be explicitly indicated
by a flag during the build.
Best Regards,
Lorenzo
> On
On Sun, 30 Jul 2017, Lorenzo Ridolfi wrote:
There’s a lot of Vector Libraries that wrap the usage of SSE
instructions. I used one of that libraries in a project long time a
go and I could not find it. But the performance improvement was
great.
As a lcms user, I would definitely prefer if
Thanks!
Works great
Regards
Marti
On 7/26/2017 7:05 PM, Aaron Boxer wrote:
Hello Marti,
I noticed you don't have a .gitignore file in your repo.
I've attached one that may be useful for the project:
it filters out binaries and other artifacts produced while building.
Cheers,
Aaron
Thanks Noel,
I'm reviewing the changes an will return to you in few days.
Regards
Marti
On 7/28/2017 11:13 PM, Noel Carboni wrote:
Hi Marti,
I've completed my changes to the LittleCMS sources. I've reviewed and
tested them with the testbed.