aucahuasi commented on a change in pull request #11257:
URL: https://github.com/apache/arrow/pull/11257#discussion_r720548962



##########
File path: r/tests/testthat/test-dplyr-summarize.R
##########
@@ -227,6 +228,19 @@ test_that("Group by n_distinct() on dataset", {
       collect(),
     tbl
   )
+  # Without groupby
+  expect_dplyr_equal(
+    input %>%
+      summarize(distinct = n_distinct(lgl, na.rm = FALSE)) %>%
+      collect(),
+    tbl
+  )
+  expect_dplyr_equal(
+    input %>%
+      summarize(distinct = n_distinct(lgl, na.rm = TRUE)) %>%

Review comment:
       Good catch @nealrichardson , thanks!
   Let me try to address this. I guess the kernel will remain as unary in C++, 
but I need to change the definitions in `dplyr-functions.R`  for 
`agg_funcs$n_distinct` to support that behavior.
   @ianmcook Let me know if you have more ideas here, I could use some for 
guidance :)




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to