thisisnic commented on code in PR #14286:
URL: https://github.com/apache/arrow/pull/14286#discussion_r988811346
##########
r/tests/testthat/test-dplyr-join.R:
##########
@@ -338,3 +342,33 @@ test_that("arrow dplyr query can join two datasets", {
}
)
})
+
+test_that("joins on datasets handles keep", {
+ full_data_df <- tibble::tibble(
+ x = rep(c("a", "b"), each = 5),
+ y = rep(1:5, 2),
+ z = rep("zzz", 10),
+ index = 1:10
+ )
+ small_dataset_df <- tibble::tibble(
+ value = c(0.1, 0.2, 0.3, 0.4, 0.5),
+ x = c(rep("a", 3), rep("b", 2)),
+ y = 1:5,
+ z = 6:10
+ )
+ full_data <- InMemoryDataset$create(full_data_df)
Review Comment:
Just went to check the original repro, and can confirm that the behaviour is
the same for Tables and Datasets, so using a Table is fine here.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]