[ 
https://issues.apache.org/jira/browse/ARROW-10415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405899#comment-17405899
 ] 

Ian Cook edited comment on ARROW-10415 at 8/27/21, 4:13 PM:
------------------------------------------------------------

{code:java}
df %>% select(x, y) %>% distinct()
# or
df %>% distinct(x, y){code}
is equivalent to
{code:java}
df %>% group_by(x, y) %>% summarize(){code}
so when ARROW-13543 gives us the ability to do the latter, we will be able to 
implement the former.


was (Author: icook):
{code:java}
df %>% select(x, y) %>% distinct(){code}
is equivalent to
{code:java}
df %>% group_by(x, y) %>% summarize(){code}
so when ARROW-13543 gives us the ability to do the latter, we will be able to 
implement the former.

> [R] Support for dplyr::distinct()
> ---------------------------------
>
>                 Key: ARROW-10415
>                 URL: https://issues.apache.org/jira/browse/ARROW-10415
>             Project: Apache Arrow
>          Issue Type: Wish
>          Components: R
>    Affects Versions: 2.0.0
>            Reporter: Christian M
>            Priority: Minor
>              Labels: dplyr, query-engine
>             Fix For: 6.0.0
>
>         Attachments: image-2020-10-28-15-01-54-198.png
>
>
> It would be nice if dplyr::distinct worked with arrow tables: 
>  
> !image-2020-10-28-15-01-54-198.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to