alamb commented on PR #7537:
URL: https://github.com/apache/arrow-rs/pull/7537#issuecomment-2906794445

   > And the default is selector because i use it to compute 
avg_size_of_selector.
   
   Make sense -- thank you
   
   I found 
[`SlicesIterator`](https://docs.rs/arrow/latest/arrow/compute/struct.SlicesIterator.html)
 when looking at the Bitmap --> RowSelection code the other day. I think that 
could be used to determine the "average run length" so we could continue to use 
`skip/select` for large contiguous runs but switch to bitmap when there are 
smaller
   
   The other thing I couldn't easily work out was if there was any way to 
switch from `select/skip` *within* a output batch, or if the plan needs to be 
either `RowSelector` or `BitMap` for each output batch
   
   Or maybe we could just add a third type of `ReadPlan`, namely 
`ReadPlan::Bitmap` 🤔  


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to