alamb opened a new pull request, #7521:
URL: https://github.com/apache/arrow-rs/pull/7521

   # Which issue does this PR close?
   
   - part of https://github.com/apache/arrow-rs/issues/7456
   
   # Rationale for this change
   
   I am trying to avoid potentially decoding arrays twice 
   
   Applying `ArrowPredicate` is sometimes slower than filtering afterwards. 
Part of the reason for this is that filter columns are decoded twice. 
   
   I want a way to inject a pre-calculated filter result into the record batch 
decoding machinery and one way I found that works well is to provide an 
`ArrayBuilder` instance that uses the cached result. I found it convenient to 
have a struct on which to hang the cache rather than a bunch of free functions
    
   You can see how this is used here:
   - https://github.com/apache/arrow-rs/pull/7513
   
   # What changes are included in this PR?
   * Factor the internal functions that create `ArrayBuilder`s into a new struct
   
   # Are there any user-facing changes?
   No, this code is entirely internal (e.g. this code is not public: 
https://docs.rs/parquet/latest/parquet/?search=array_builder)
   
   <!--
   If there are user-facing changes then we may require documentation to be 
updated before approving the PR.
   -->
   
   <!---
   If there are any breaking changes to public APIs, please call them out.
   -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to