alamb opened a new pull request, #9056:
URL: https://github.com/apache/arrow-rs/pull/9056

   # Which issue does this PR close?
   
   <!--
   We generally require a GitHub issue to be filed for all bug fixes and 
enhancements and this helps us generate change logs for our releases. You can 
link an issue to this PR using the GitHub syntax.
   -->
   
   - related to https://github.com/apache/datafusion/pull/19477
   
   # Rationale for this change
   
   While working on https://github.com/apache/datafusion/pull/19477,  and 
profiling ClickBench q7, I noticed that the RowSelectors was being cloned to 
resolve the strategy -- for a large number of selections this is expensive and 
shows up in the traces
   
   <img width="1724" height="1074" alt="Screenshot 2025-12-28 at 4 49 49 PM" 
src="https://github.com/user-attachments/assets/72c6fd22-9377-48ef-ba80-6bc03b177cf7";
 />
   
   
   ```shell
   samply record -- ./datafusion-cli-alamb_enable_pushdown  -f q.sql  > 
/dev/null  2>&
   ```
   
   We should change the code to avoid cloning the RowSelectors when resolving 
the strategy.
   
   # Changes
   
   Don't clone / allocate while resolving the strategy.
   
   I don't expect this to have a massive impact, but it did show up in the 
profile
   
   FYI @hhhizzz -- perhaps you could review this PR
   
   
   # Are these changes tested?
   
   Yes by CI
   
   # Are there any user-facing changes?
   small performance improvement


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to