Dandandan opened a new issue, #12131:
URL: https://github.com/apache/datafusion/issues/12131

   ### Is your feature request related to a problem or challenge?
   
   equal_rows_arr compares pairs of 2 arrays with indices for equality. 
   
   Currently this is done in the following way
   
   * `take` the values for the indices for the first pair
   * comparing the arrays using `eq` or `not_distinct`
   * doing the same for the next pairs and `and`ing the results
   * Filtering the indices based on the resulting boolean array
   
   ### Describe the solution you'd like
   
   We could optimize this in some ways:
   
   * writing a kernel that doesn't use `take` (i.e. copy the array) but 
compares arrays based on the indices.
   * writing results to a single booleanbuffer rather than creating a new one 
every time
   * removing indices from the list (e.g. using `Vec::retain`) not matching 
rather than creating a boolean array for a filter 
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to