Re: [I] Potentially improve join performance by implementing a version of the take kernel that accepts an iterator of indices [datafusion]

via GitHub Mon, 02 Dec 2024 19:25:46 -0800


Rachelint commented on issue #13620:
URL: https://github.com/apache/datafusion/issues/13620#issuecomment-2513468121


   > I agree with @Dandandan -- and specifically it isn't clear to me that an 
iterator based approach will be faster than using the `take` kernel -- I 
suspect the bottleneck will be the copy that is happening as part of `take` not 
the actual managment of the indexes
   > 
   > If the issue is that the indices themselves take up too much space, then 
perhaps we can do some more effort to incrementally generate them and reuse the 
arrays, as suggested by @Dandandan
   > 
   > Here is an example in grouping where we reuse indexes:
   > 
   > 
https://github.com/apache/datafusion/blob/8773846859b0390ceb782602efd403e2487d8552/datafusion/physical-plan/src/aggregates/row_hash.rs#L402-L404
   
   Agree with take may be the bottleneck, I try `take + eq` appoach in 
`vectorized compare of primitive` in 
https://github.com/Rachelint/arrow-datafusion/tree/optimize-vectorized-operations-bak
   
   Even try best to reuse the buffer, `take` still cost much cpu in flamegraph 
in the new added benchmark also in this pr.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [I] Potentially improve join performance by implementing a version of the take kernel that accepts an iterator of indices [datafusion]

Reply via email to