PMassicotte opened a new issue, #35649:
URL: https://github.com/apache/arrow/issues/35649

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   I am trying to work on a dataset without having to pull it with `collect()` 
I found out that if I was using `pivot_longer()` in the chain of operation, 
`write_dataset()` is hanging and nothing seems to happen.
   
   ```r
   library(tidyverse)
   library(arrow)
   
   
   one_level_tree <- tempfile()
   
   mtcars |>
     to_duckdb() |>
     pivot_longer(everything()) |>
     to_arrow() |>
     # collect() |> # collecting make it work, otherwise, it hangs on 
write_dataset()
     write_dataset(one_level_tree, partitioning = "name")
   
   list.files(one_level_tree, recursive = TRUE)
   ```
   
   ```r
   r$> sessioninfo::session_info()
   ─ Session info 
──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
    setting  value
    version  R version 4.3.0 (2023-04-21)
    os       Ubuntu 23.04
    system   x86_64, linux-gnu
    ui       X11
    language en_CA:en
    collate  en_CA.UTF-8
    ctype    en_CA.UTF-8
    tz       America/Toronto
    date     2023-05-17
    pandoc   NA
   
   ─ Packages 
──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
    package     * version date (UTC) lib source
    arrow       * 12.0.0  2023-05-05 [1] RSPM (R 4.3.0)
    assertthat    0.2.1   2019-03-21 [1] RSPM (R 4.3.0)
    bit           4.0.5   2022-11-15 [1] RSPM (R 4.3.0)
    bit64         4.0.5   2020-08-30 [1] RSPM (R 4.3.0)
    cli           3.6.1   2023-03-23 [1] RSPM (R 4.3.0)
    colorspace    2.1-0   2023-01-23 [1] RSPM (R 4.3.0)
    dplyr       * 1.1.2   2023-04-20 [1] RSPM (R 4.3.0)
    fansi         1.0.4   2023-01-22 [1] RSPM (R 4.3.0)
    forcats     * 1.0.0   2023-01-29 [1] RSPM (R 4.3.0)
    generics      0.1.3   2022-07-05 [1] RSPM (R 4.3.0)
    ggplot2     * 3.4.2   2023-04-03 [1] RSPM (R 4.3.0)
    glue          1.6.2   2022-02-24 [1] RSPM (R 4.3.0)
    gtable        0.3.3   2023-03-21 [1] RSPM (R 4.3.0)
    hms           1.1.3   2023-03-21 [1] RSPM (R 4.3.0)
    httpuv        1.6.11  2023-05-11 [1] RSPM (R 4.3.0)
    jsonlite      1.8.4   2022-12-06 [1] RSPM (R 4.3.0)
    later         1.3.1   2023-05-02 [1] RSPM (R 4.3.0)
    lifecycle     1.0.3   2022-10-07 [1] RSPM (R 4.3.0)
    lubridate   * 1.9.2   2023-02-10 [1] RSPM (R 4.3.0)
    magrittr      2.0.3   2022-03-30 [1] RSPM (R 4.3.0)
    munsell       0.5.0   2018-06-12 [1] RSPM (R 4.3.0)
    pillar        1.9.0   2023-03-22 [1] RSPM (R 4.3.0)
    pkgconfig     2.0.3   2019-09-22 [1] RSPM (R 4.3.0)
    promises      1.2.0.1 2021-02-11 [1] RSPM (R 4.3.0)
    purrr       * 1.0.1   2023-01-10 [1] RSPM (R 4.3.0)
    R6            2.5.1   2021-08-19 [1] RSPM (R 4.3.0)
    Rcpp          1.0.10  2023-01-22 [1] RSPM (R 4.3.0)
    readr       * 2.1.4   2023-02-10 [1] RSPM (R 4.3.0)
    rlang         1.1.1   2023-04-28 [1] RSPM (R 4.3.0)
    scales        1.2.1   2022-08-20 [1] RSPM (R 4.3.0)
    sessioninfo   1.2.2   2021-12-06 [1] RSPM (R 4.3.0)
    stringi       1.7.12  2023-01-11 [1] CRAN (R 4.3.0)
    stringr     * 1.5.0   2022-12-02 [1] RSPM (R 4.3.0)
    tibble      * 3.2.1   2023-03-20 [1] RSPM (R 4.3.0)
    tidyr       * 1.3.0   2023-01-24 [1] RSPM (R 4.3.0)
    tidyselect    1.2.0   2022-10-10 [1] RSPM (R 4.3.0)
    tidyverse   * 2.0.0   2023-02-22 [1] RSPM (R 4.3.0)
    timechange    0.2.0   2023-01-11 [1] RSPM (R 4.3.0)
    tzdb          0.4.0   2023-05-12 [1] RSPM (R 4.3.0)
    utf8          1.2.3   2023-01-31 [1] RSPM (R 4.3.0)
    vctrs         0.6.2   2023-04-19 [1] RSPM (R 4.3.0)
    withr         2.5.0   2022-03-03 [1] RSPM (R 4.3.0)
   ```
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to