alamb opened a new pull request, #12623:
URL: https://github.com/apache/datafusion/pull/12623

   ## Which issue does this PR close?
   
   Follow on to https://github.com/apache/datafusion/pull/12269 from @jayzhan211
   
   ## Rationale for this change
   
   The initial implementation of the Column-wise group checking code uses 
`Vec<bool>` to track null/validity but Arrow has a variety of optimized 
structures for doing so (`BooleanBufferBuilder`, etc)
   
   I would like to get this code into great shape before we fill out support 
for other data types (like StringView and BinaryView)
   
   I also feel like  `PrimitiveGroupValueBuilder` could largely be a wrapper 
around PrimitiveBuilder in arrow which would avoid the code duplication as well 
as take advantage of Arrow's optimizations
   
   Let's use BooleanBufferBuilder to simplify the null handling in 
PrimitiveGroupValueBuilder. I 
   
   
   ## What changes are included in this PR?
   Rewrite `PrimitiveGroupValueBuilder`
   
   ## Are these changes tested?
   
   By existing CI
   
   ## Are there any user-facing changes?
   No
   
   Performance results (RUNNING)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to