alamb commented on code in PR #12996:
URL: https://github.com/apache/datafusion/pull/12996#discussion_r1819112999


##########
datafusion/physical-plan/src/aggregates/group_values/group_column.rs:
##########
@@ -287,6 +469,63 @@ where
         };
     }
 
+    fn vectorized_equal_to(

Review Comment:
   The same combination of `take` of some subset of rows is basically what the 
arrow filter kernel does. However curently it only accepts a single array as 
input --> single array as output. 
   
   I wonder if we could implement some version of `filter` that that can be 
invoked multiple times  or something each time adding to the in-progress array.
   
   That might be the low level building block that generalized `take` and 
`filter` and `interleave` (and we could use it as prt of `Repartition`) 🤔 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to