PMassicotte opened a new issue, #35649:
URL: https://github.com/apache/arrow/issues/35649
### Describe the bug, including details regarding any error messages,
version, and platform.
I am trying to work on a dataset without having to pull it with `collect()`
I found out that if I was using `pivot_longer()` in the chain of operation,
`write_dataset()` is hanging and nothing seems to happen.
```r
library(tidyverse)
library(arrow)
one_level_tree <- tempfile()
mtcars |>
to_duckdb() |>
pivot_longer(everything()) |>
to_arrow() |>
# collect() |> # collecting make it work, otherwise, it hangs on
write_dataset()
write_dataset(one_level_tree, partitioning = "name")
list.files(one_level_tree, recursive = TRUE)
```
```r
r$> sessioninfo::session_info()
─ Session info
──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
setting value
version R version 4.3.0 (2023-04-21)
os Ubuntu 23.04
system x86_64, linux-gnu
ui X11
language en_CA:en
collate en_CA.UTF-8
ctype en_CA.UTF-8
tz America/Toronto
date 2023-05-17
pandoc NA
─ Packages
──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
package * version date (UTC) lib source
arrow * 12.0.0 2023-05-05 [1] RSPM (R 4.3.0)
assertthat 0.2.1 2019-03-21 [1] RSPM (R 4.3.0)
bit 4.0.5 2022-11-15 [1] RSPM (R 4.3.0)
bit64 4.0.5 2020-08-30 [1] RSPM (R 4.3.0)
cli 3.6.1 2023-03-23 [1] RSPM (R 4.3.0)
colorspace 2.1-0 2023-01-23 [1] RSPM (R 4.3.0)
dplyr * 1.1.2 2023-04-20 [1] RSPM (R 4.3.0)
fansi 1.0.4 2023-01-22 [1] RSPM (R 4.3.0)
forcats * 1.0.0 2023-01-29 [1] RSPM (R 4.3.0)
generics 0.1.3 2022-07-05 [1] RSPM (R 4.3.0)
ggplot2 * 3.4.2 2023-04-03 [1] RSPM (R 4.3.0)
glue 1.6.2 2022-02-24 [1] RSPM (R 4.3.0)
gtable 0.3.3 2023-03-21 [1] RSPM (R 4.3.0)
hms 1.1.3 2023-03-21 [1] RSPM (R 4.3.0)
httpuv 1.6.11 2023-05-11 [1] RSPM (R 4.3.0)
jsonlite 1.8.4 2022-12-06 [1] RSPM (R 4.3.0)
later 1.3.1 2023-05-02 [1] RSPM (R 4.3.0)
lifecycle 1.0.3 2022-10-07 [1] RSPM (R 4.3.0)
lubridate * 1.9.2 2023-02-10 [1] RSPM (R 4.3.0)
magrittr 2.0.3 2022-03-30 [1] RSPM (R 4.3.0)
munsell 0.5.0 2018-06-12 [1] RSPM (R 4.3.0)
pillar 1.9.0 2023-03-22 [1] RSPM (R 4.3.0)
pkgconfig 2.0.3 2019-09-22 [1] RSPM (R 4.3.0)
promises 1.2.0.1 2021-02-11 [1] RSPM (R 4.3.0)
purrr * 1.0.1 2023-01-10 [1] RSPM (R 4.3.0)
R6 2.5.1 2021-08-19 [1] RSPM (R 4.3.0)
Rcpp 1.0.10 2023-01-22 [1] RSPM (R 4.3.0)
readr * 2.1.4 2023-02-10 [1] RSPM (R 4.3.0)
rlang 1.1.1 2023-04-28 [1] RSPM (R 4.3.0)
scales 1.2.1 2022-08-20 [1] RSPM (R 4.3.0)
sessioninfo 1.2.2 2021-12-06 [1] RSPM (R 4.3.0)
stringi 1.7.12 2023-01-11 [1] CRAN (R 4.3.0)
stringr * 1.5.0 2022-12-02 [1] RSPM (R 4.3.0)
tibble * 3.2.1 2023-03-20 [1] RSPM (R 4.3.0)
tidyr * 1.3.0 2023-01-24 [1] RSPM (R 4.3.0)
tidyselect 1.2.0 2022-10-10 [1] RSPM (R 4.3.0)
tidyverse * 2.0.0 2023-02-22 [1] RSPM (R 4.3.0)
timechange 0.2.0 2023-01-11 [1] RSPM (R 4.3.0)
tzdb 0.4.0 2023-05-12 [1] RSPM (R 4.3.0)
utf8 1.2.3 2023-01-31 [1] RSPM (R 4.3.0)
vctrs 0.6.2 2023-04-19 [1] RSPM (R 4.3.0)
withr 2.5.0 2022-03-03 [1] RSPM (R 4.3.0)
```
### Component(s)
R
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]