[jira] [Commented] (PARQUET-1841) [C++] Experiment to see if using SIMD shuffle operations for DecodeSpaced improves performance

Frank Du (Jira) Mon, 20 Apr 2020 01:12:22 -0700


    [ 
https://issues.apache.org/jira/browse/PARQUET-1841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17087475#comment-17087475
 ]


Frank Du commented on PARQUET-1841:
-----------------------------------

I wrote a draft implementation for AVX512 int32_t/int64_t path using 
mask_expand_32/mask_expand_64, seems it's working as it pass all exited test 
units. But I don't find one bench-marking which based on the decode spaced API, 
can you point me? And I will work on the SSE part then.

> [C++] Experiment to see if using SIMD shuffle operations for DecodeSpaced 
> improves performance
> ----------------------------------------------------------------------------------------------
>
>                 Key: PARQUET-1841
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1841
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-cpp
>            Reporter: Micah Kornfield
>            Assignee: Micah Kornfield
>            Priority: Major
>         Attachments: image-2020-04-14-15-01-48-222.png
>
>
> Followup from PARQUET-1840 for current benchmarks it seems that doing 
> removing the memset somehow either has no impact or is slightly worse.  We 
> should investigate using SIMD operations to speed up spacing. 
>  
> As part of this we can see if moving the memset to only cover uninitialized 
> values after moving all required values provides any speedup.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (PARQUET-1841) [C++] Experiment to see if using SIMD shuffle operations for DecodeSpaced improves performance

Reply via email to