Hi Yongseok,

>-----Original Message-----
>From: Yongseok Koh <ys...@mellanox.com>
>Sent: Wednesday, April 10, 2019 11:08 PM
>To: Pavan Nikhilesh Bhagavatula <pbhagavat...@marvell.com>
>Cc: Thomas Monjalon <tho...@monjalon.net>; dev <dev@dpdk.org>; Jerin
>Jacob Kollanukkaran <jer...@marvell.com>; jerinjac...@gmail.com
>Subject: [EXT] Re: [dpdk-dev] [PATCH v8 2/4] meson: add infra to support
>machine specific flags
>
>External Email
>
>----------------------------------------------------------------------
>
>> On Apr 10, 2019, at 9:13 AM, jerinjac...@gmail.com wrote:
>>
>> From: Pavan Nikhilesh <pbhagavat...@marvell.com>
>>
>> Currently, RTE_* flags are set based on the implementer ID but there
>> might be some micro arch specific differences from the same vendor eg.
>> CACHE_LINESIZE. Add support to set micro arch specific flags.
>>
>> Signed-off-by: Pavan Nikhilesh <pbhagavat...@marvell.com>
>> Signed-off-by: Jerin Jacob <jer...@marvell.com>
>> ---
>> config/arm/meson.build | 56 ++++++++++++++++++++++++------------------
>> 1 file changed, 32 insertions(+), 24 deletions(-)
>>
>> diff --git a/config/arm/meson.build b/config/arm/meson.build index
>> 170a4981a..24bce2b39 100644
>> --- a/config/arm/meson.build
>> +++ b/config/arm/meson.build
>> @@ -7,25 +7,6 @@ march_opt = '-march=@0@'.format(machine)
>>
>> arm_force_native_march = false
>>
>> -machine_args_generic = [
>> -    ['default', ['-march=armv8-a+crc+crypto']],
>> -    ['native', ['-march=native']],
>> -    ['0xd03', ['-mcpu=cortex-a53']],
>> -    ['0xd04', ['-mcpu=cortex-a35']],
>> -    ['0xd05', ['-mcpu=cortex-a55']],
>> -    ['0xd07', ['-mcpu=cortex-a57']],
>> -    ['0xd08', ['-mcpu=cortex-a72']],
>> -    ['0xd09', ['-mcpu=cortex-a73']],
>> -    ['0xd0a', ['-mcpu=cortex-a75']],
>> -    ['0xd0b', ['-mcpu=cortex-a76']],
>> -]
>> -machine_args_cavium = [
>> -    ['default', ['-march=armv8-a+crc+crypto','-mcpu=thunderx']],
>> -    ['native', ['-march=native']],
>> -    ['0xa1', ['-mcpu=thunderxt88']],
>> -    ['0xa2', ['-mcpu=thunderxt81']],
>> -    ['0xa3', ['-mcpu=thunderxt83']]]
>> -
>> flags_common_default = [
>>      # Accelarate rte_memcpy. Be sure to run unit test
>(memcpy_perf_autotest)
>>      # to determine the best threshold in code. Refer to notes in source
>> file @@ -52,12 +33,10 @@ flags_generic = [
>>      ['RTE_USE_C11_MEM_MODEL', true],
>>      ['RTE_CACHE_LINE_SIZE', 128]]
>> flags_cavium = [
>> -    ['RTE_MACHINE', '"thunderx"'],
>>      ['RTE_CACHE_LINE_SIZE', 128],
>>      ['RTE_MAX_NUMA_NODES', 2],
>>      ['RTE_MAX_LCORE', 96],
>> -    ['RTE_MAX_VFIO_GROUPS', 128],
>> -    ['RTE_USE_C11_MEM_MODEL', false]]
>> +    ['RTE_MAX_VFIO_GROUPS', 128]]
>> flags_dpaa = [
>>      ['RTE_MACHINE', '"dpaa"'],
>>      ['RTE_USE_C11_MEM_MODEL', true],
>> @@ -71,6 +50,27 @@ flags_dpaa2 = [
>>      ['RTE_MAX_NUMA_NODES', 1],
>>      ['RTE_MAX_LCORE', 16],
>>      ['RTE_LIBRTE_DPAA2_USE_PHYS_IOVA', false]]
>> +flags_default_extra = []
>> +flags_thunderx_extra = [
>> +    ['RTE_MACHINE', '"thunderx"'],
>> +    ['RTE_USE_C11_MEM_MODEL', false]]
>> +
>> +machine_args_generic = [
>> +    ['default', ['-march=armv8-a+crc+crypto']],
>> +    ['native', ['-march=native']],
>> +    ['0xd03', ['-mcpu=cortex-a53']],
>> +    ['0xd04', ['-mcpu=cortex-a35']],
>> +    ['0xd07', ['-mcpu=cortex-a57']],
>> +    ['0xd08', ['-mcpu=cortex-a72']],
>> +    ['0xd09', ['-mcpu=cortex-a73']],
>> +    ['0xd0a', ['-mcpu=cortex-a75']]]
>> +
>> +machine_args_cavium = [
>> +    ['default', ['-march=armv8-a+crc+crypto','-mcpu=thunderx']],
>> +    ['native', ['-march=native']],
>> +    ['0xa1', ['-mcpu=thunderxt88'], flags_thunderx_extra],
>> +    ['0xa2', ['-mcpu=thunderxt81'], flags_thunderx_extra],
>> +    ['0xa3', ['-mcpu=thunderxt83'], flags_thunderx_extra]]
>>
>> ## Arm implementer ID (ARM DDI 0487C.a, Section G7.2.106, Page
>> G7-5321) impl_generic = ['Generic armv8', flags_generic,
>> machine_args_generic] @@ -157,8 +157,16 @@ else
>>      endif
>>      foreach marg: machine[2]
>>              if marg[0] == impl_pn
>> -                    foreach f: marg[1]
>> -                            machine_args += f
>> +                    foreach flag: marg[1]
>> +                            if cc.has_argument(flag)
>> +                                    machine_args += flag
>> +                            endif
>> +                    endforeach
>> +                    # Apply any extra machine specific flags.
>> +                    foreach flag: marg.get(2, flags_default_extra)
>> +                            if flag.length() > 0
>> +                                    dpdk_conf.set(flag[0], flag[1])
>> +                            endif
>
>Let me continue the discussion from v7 here.
>Seems I wan't clear enough.
>
>Let me take an example. If the host is thunderx2 (0xaf) and compiler is older
>than v7, flags_thunderx2_extra isn't set. This means, for example,
>RTE_CACHE_LINE_SIZE will still be 128. Is that what you want?
>RTE_CACHE_LINE_SIZE has nothing to do with compiler support and you might
>want to set it regardless of gcc version. You could skip setting -mcpu with 
>setting
>the extra flags.
>

Thanks for the detailed explanation.
I think since we have the check to skip mcpu flag when cc doesn't support it 
(cc.has_argument(flag))
It will be safe to remove 
`
        # Primary part number based mcpu flags are supported
        # for gcc versions > 7
        if cc.version().version_compare(
                        '<7.0') or cmd_output.length() == 0
                if not meson.is_cross_build() and arm_force_native_march == true
                        impl_pn = 'native'
                else
                        impl_pn = 'default'
                endif
        endif
`

The command output check can also be removed as it is handled when calling the 
command script itself.

Thoughts?

PS. I think the safest way to set CACHELINE_SIZE is to read the cache type 
register[1] but sadly only few latest kernels 
have the support through sysfs 
(/sys/devices/system/cpu/cpu0/cache/index0/coherency_line_size) 

>Thoughts?
>
>Thanks,
>Yongseok
>
>
>

Regards,
Pavan.

Reply via email to