Loic,
If you're already doing a runtime checking of these bits before
calling the functions you want to optimize then you can use the gcc
Function Specific Opt feature of GCC.
http://gcc.gnu.org/onlinedocs/gcc-4.4.0/gcc/Function-Attributes.html#index-g_t_0040code_007btarget_007d-function-attribute-2259
Basically you add a target attribute to a function (specifying use SSE version).
void my_optimized_function(void* sse_vec, size_t n)
__attribute__ ((__target__ ("sse4.2")));
It's available from GCC 4.4 and on. That happens to be the GCC version
on RHEL6, Debian Squeeze, Ubuntu 10.04 LTS. Hopefully that's good
enough and you can omit the optimization on people on platforms older
than that.
Best,
- Milosz
On Tue, Mar 25, 2014 at 7:22 AM, Laurent GUERBY <[email protected]> wrote:
> On Tue, 2014-03-25 at 10:56 +0100, Loic Dachary wrote:
>> Hi Laurent,
>
> Hi Loic,
>
>> It occurs to me that all we're after is to enable SSE functions such as
>> _mm_set_epi32. We're not trying to have the binary optimized in any implicit
>> way, it is all explicit. The problem seems to be that -msse4.2 will do both
>>
>> * activate _mm_set_epi32 etc functions
>> * optimize the binary to use sse4.2 instructions
>>
>> Do you know of a compiler flag that would only
>>
>> * activate _mm_set_epi32 etc functions
>
> This is a function part of an Intel defined standard to access processor
> feature, this standard will have one or more implementation depending on
> your compiler/libc/OS. IIRC these functions are closely aligned with
> specific processor feature, if the feature isn't there in general it
> makes no sense to use them.
>
> In the particular case of _mm_set_epi32 it seems
> to be a data formating inline function:
>
> /usr/lib/gcc/x86_64-linux-gnu/4.7.2/include/emmintrin.h
> ...
> typedef long long __m128i __attribute__ ((__vector_size__ (16),
> __may_alias__));
> ...
> extern __inline __m128i __attribute__((__gnu_inline__,
> __always_inline__, __artificial__))
> _mm_set_epi32 (int __q3, int __q2, int __q1, int __q0)
> {
> return __extension__ (__m128i)(__v4si){ __q0, __q1, __q2, __q3 };
> }
>
> Functions in this include files are using GCC builtins:
>
> http://gcc.gnu.org/onlinedocs/gcc-4.8.2/gcc/X86-Built-in-Functions.html#X86-Built-in-Functions
>
> To avoid any issue I wouldn't use these functions at all
> on a non SSE machine.
>
> Sincerely,
>
> Laurent
>
>> and not
>>
>> * optimize the binary to use sse4.2 instructions
>>
>> ? It may be a RTFM question and I apologize for that. Reading
>> http://gcc.gnu.org/onlinedocs/gcc-4.8.2/gcc/i386-and-x86-64-Options.html#i386-and-x86-64-Options
>> it looks like this is more or less what --mtune=corei7-avx would do
>> (because gf-complete uses PCLMUL when available). But it feels weird to
>> specify a specific processor model where what we need is a set of features.
>>
>> Thanks for your help :-)
>>
>> On 25/03/2014 10:43, Laurent GUERBY wrote:
>> > On Mon, 2014-03-24 at 22:27 +0100, Loic Dachary wrote:
>> >>
>> >> On 23/03/2014 23:34, Laurent GUERBY wrote:
>> >>> http://gcc.gnu.org/onlinedocs/gcc-4.8.2/gcc/i386-and-x86-64-Options.html#i386-and-x86-64-Options
>> >>>
>> >>> So unless you want to run your code a very very old x86 32 bit processor
>> >>> "-msse" shouldn't be an issue. "-msse2" is similar.
>> >>
>> >> This is good to know :) Should I be worried about unintended side effects
>> >> of -msse4.2 -mssse3 -msse4.1 or -mpclmul ? These are the flags that
>> >> gf-complete are using, specifically.
>> >
>> > Hi,
>> >
>> > SSE4.2 will be available only in more recent
>> > processors as documented on the page above.
>> >
>> > If your library already is dynamically checking for processor
>> > feature I would advise to be conservative in your
>> > -m flags, ie using what debian would use for maximum
>> > x86 portability.
>> >
>> > Sincerely,
>> >
>> > Laurent
>> >
>>
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
Milosz Tanski
CTO
10 East 53rd Street, 37th floor
New York, NY 10022
p: 646-253-9055
e: [email protected]
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html