eitsupi opened a new issue, #35445:
URL: https://github.com/apache/arrow/issues/35445

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   In dplyr, I believe that using `across(everything())` on a grouped data 
frame will not select the column used for grouping.
   
   ``` r
   mtcars |>
     dplyr::group_by(cyl) |>
     dplyr::summarise(dplyr::across(everything(), sum))
   #> # A tibble: 3 × 11
   #>     cyl   mpg  disp    hp  drat    wt  qsec    vs    am  gear  carb
   #>   <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
   #> 1     4  293. 1156.   909  44.8  25.1  211.    10     8    45    17
   #> 2     6  138. 1283.   856  25.1  21.8  126.     4     3    27    24
   #> 3     8  211. 4943.  2929  45.2  56.0  235.     0     2    46    49
   ```
   
   <sup>Created on 2023-05-05 with [reprex 
v2.0.2](https://reprex.tidyverse.org)</sup>
   
   However, arrow does not seem to exclude the columns used for grouping. The 
following example results in an error.
   (I installed arrow 12.0.0.20230503 from R-universe)
   
   ``` r
   mtcars |>
     arrow::as_arrow_table() |>
     dplyr::group_by(cyl) |>
     dplyr::summarise(dplyr::across(everything(), sum)) |>
     dplyr::collect()
   #> Error in `compute.arrow_dplyr_query()`:
   #> ! Invalid: Multiple matches for FieldRef.Name(cyl) in mpg: double
   #> cyl: double
   #> disp: double
   #> hp: double
   #> drat: double
   #> wt: double
   #> qsec: double
   #> vs: double
   #> am: double
   #> gear: double
   #> carb: double
   #> cyl: double
   #> Backtrace:
   #>     ▆
   #>  1. ├─dplyr::collect(...)
   #>  2. └─arrow:::collect.arrow_dplyr_query(...)
   #>  3.   └─arrow:::compute.arrow_dplyr_query(x)
   #>  4.     └─base::tryCatch(...)
   #>  5.       └─base (local) tryCatchList(expr, classes, parentenv, handlers)
   #>  6.         └─base (local) tryCatchOne(expr, names, parentenv, 
handlers[[1L]])
   #>  7.           └─value[[3L]](cond)
   #>  8.             └─arrow:::augment_io_error_msg(e, call, schema = schema())
   #>  9.               └─rlang::abort(msg, call = call)
   ```
   
   <sup>Created on 2023-05-05 with [reprex 
v2.0.2](https://reprex.tidyverse.org)</sup>
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to