ianmcook opened a new pull request, #33664: URL: https://github.com/apache/arrow/pull/33664
# Which issue does this PR close? Closes #14981 # Rationale for this change dplyr 1.1.0 introduces a new function `join_by()` for specifying join conditions. This PR adds supports for `join_by()` in dplyr joins on Arrow objects. The support is limited only to equality conditions. The code added in this PR throws an error if the user specifies inequality conditions or uses helper functions in `join_by()`. https://www.tidyverse.org/blog/2022/11/dplyr-1-1-0-is-coming-soon/#join-improvements <!-- Why are you proposing this change? If this is already explained clearly in the issue then this section is not needed. Explaining clearly why changes are proposed helps reviewers understand your changes and offer better suggestions for fixes. --> # What changes are included in this PR? - Code to handle `join_by()` in dplyr joins on Arrow objects with equality conditions - Tests of handling of `join_by()`, which are skipped when the version of dplyr is less than `1.0.99.9000` which is the current version number of the development version of dplyr on GitHub which that become version `1.1.0` on CRAN. # Are these changes tested? Yes # Are there any user-facing changes? Yes, the new dplyr syntax for specifying join conditions is supported, but use of this new syntax is optional. The old dplyr join syntax will continue to work. There are no breaking changes in this PR. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
