[ 
https://issues.apache.org/jira/browse/ARROW-10058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17206060#comment-17206060
 ] 

Micah Kornfield edited comment on ARROW-10058 at 10/2/20, 9:14 AM:
-------------------------------------------------------------------

[~apitrou] [~yibo] nice performance boost.  For [~apitrou]'s numbers It is 
strange to me that the performance varies so signifcantly for 
BM_DefinitionLevelsToBitmapRepeatedAllPresent (I would only expect this to 
affect MostPresent in any meaningful way.). Any thoughts on why this would be 
the case. (better inline maybe?)


was (Author: [email protected]):
[~apitrou] [~yibo] nice performance boost.  For [~apitrou]'s numbers It is 
strange to me that the performance varies so signifcantly for 
BM_DefinitionLevelsToBitmapRepeatedAllPresent (I would only expect this to 
affect MostPresent in any meaningful way.). Any thoughts on why this would be 
the case.

> [C++] Investigate performance of LevelsToBitmap without BMI2
> ------------------------------------------------------------
>
>                 Key: ARROW-10058
>                 URL: https://issues.apache.org/jira/browse/ARROW-10058
>             Project: Apache Arrow
>          Issue Type: Sub-task
>          Components: C++
>            Reporter: Antoine Pitrou
>            Priority: Major
>              Labels: pull-request-available
>         Attachments: opt-level-conv.diff
>
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> Currently, when some Parquet nested data involves some repetition levels, 
> converting the levels to bitmap goes through a slow scalar path unless the 
> BMI2 instruction set is available and efficient (the latter using the PEXT 
> instruction to process 16 levels at once).
> It may be possible to emulate PEXT for 5- or 6-bit masks by using a lookup 
> table, allowing to process 5-6 levels at once.
> (also, it would be good to add nested reading benchmarks for non-trivial 
> nesting; currently we only benchmark one-level struct and one-level list)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to