Dandandan opened a new issue, #12131: URL: https://github.com/apache/datafusion/issues/12131
### Is your feature request related to a problem or challenge? equal_rows_arr compares pairs of 2 arrays with indices for equality. Currently this is done in the following way * `take` the values for the indices for the first pair * comparing the arrays using `eq` or `not_distinct` * doing the same for the next pairs and `and`ing the results * Filtering the indices based on the resulting boolean array ### Describe the solution you'd like We could optimize this in some ways: * writing a kernel that doesn't use `take` (i.e. copy the array) but compares arrays based on the indices. * writing results to a single booleanbuffer rather than creating a new one every time * removing indices from the list (e.g. using `Vec::retain`) not matching rather than creating a boolean array for a filter ### Describe alternatives you've considered _No response_ ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
