[ 
https://issues.apache.org/jira/browse/ARROW-13766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17461567#comment-17461567
 ] 

Dewey Dunnington commented on ARROW-13766:
------------------------------------------

Some example usage maybe useful for a test:

{code:R}
library(arrow, warn.conflicts = FALSE)
library(dplyr, warn.conflicts = FALSE)

df <- tibble(a = rep(letters, 10), b = 1:260, c = 260:1)

df %>% slice_min(a, n = 5, with_ties = TRUE)
#> # A tibble: 10 × 3
#>    a         b     c
#>    <chr> <int> <int>
#>  1 a         1   260
#>  2 a        27   234
#>  3 a        53   208
#>  4 a        79   182
#>  5 a       105   156
#>  6 a       131   130
#>  7 a       157   104
#>  8 a       183    78
#>  9 a       209    52
#> 10 a       235    26
df %>% slice_min(a, n = 5, with_ties = FALSE)
#> # A tibble: 5 × 3
#>   a         b     c
#>   <chr> <int> <int>
#> 1 a         1   260
#> 2 a        27   234
#> 3 a        53   208
#> 4 a        79   182
#> 5 a       105   156

df %>% slice_min(c, n = 5)
#> # A tibble: 5 × 3
#>   a         b     c
#>   <chr> <int> <int>
#> 1 z       260     1
#> 2 y       259     2
#> 3 x       258     3
#> 4 w       257     4
#> 5 v       256     5
df %>% slice_min(c, prop = 5 / 260)
#> # A tibble: 5 × 3
#>   a         b     c
#>   <chr> <int> <int>
#> 1 z       260     1
#> 2 y       259     2
#> 3 x       258     3
#> 4 w       257     4
#> 5 v       256     5
{code}




> [R] Add Arrow methods slice_min(), slice_max()
> ----------------------------------------------
>
>                 Key: ARROW-13766
>                 URL: https://issues.apache.org/jira/browse/ARROW-13766
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: R
>            Reporter: Ian Cook
>            Priority: Major
>              Labels: query-engine
>             Fix For: 7.0.0
>
>
> Implement [{{slice_min()}} and 
> {{slice_max()}}|https://dplyr.tidyverse.org/reference/slice.html] methods for 
> {{ArrowTabular}}, {{Dataset}}, and {{arrow_dplyr_query}} objects.
> These dplyr functions supersede the older dplyr function 
> [{{top_n()}}|https://dplyr.tidyverse.org/reference/top_n.html] which I 
> suppose we should also consider implementing a method for.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to