ianmcook opened a new pull request, #33664:
URL: https://github.com/apache/arrow/pull/33664

   # Which issue does this PR close?
   
   Closes #14981
   
   # Rationale for this change
   
   dplyr 1.1.0 introduces a new function `join_by()` for specifying join 
conditions. This PR adds supports for `join_by()` in dplyr joins on Arrow 
objects. The support is limited only to equality conditions. The code added in 
this PR throws an error if the user specifies inequality conditions or uses 
helper functions in `join_by()`.
   
   
https://www.tidyverse.org/blog/2022/11/dplyr-1-1-0-is-coming-soon/#join-improvements
   
   <!--
    Why are you proposing this change? If this is already explained clearly in 
the issue then this section is not needed.
    Explaining clearly why changes are proposed helps reviewers understand your 
changes and offer better suggestions for fixes.  
   -->
   
   # What changes are included in this PR?
   
   - Code to handle `join_by()` in dplyr joins on Arrow objects with equality 
conditions
   - Tests of handling of `join_by()`, which are skipped when the version of 
dplyr is less than `1.0.99.9000` which is the current version number of the 
development version of dplyr on GitHub which that become version `1.1.0` on 
CRAN.
   
   
   # Are these changes tested?
   
   Yes
   
   # Are there any user-facing changes?
   
   Yes, the new dplyr syntax for specifying join conditions is supported, but 
use of this new syntax is optional. The old dplyr join syntax will continue to 
work. There are no breaking changes in this PR.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to