rkrug opened a new issue, #40711:
URL: https://github.com/apache/arrow/issues/40711

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   When saving the attached rds file to a parquet file, I get the error as in 
the repress `Error: Invalid: Problem with column 3 (au_orcid): Invalid: 
Expecting a character vector`.
   
   The problem seems to be that one column only contains NA, and these are 
interpreted as logical by R, while arrow expects a character vector.
   
   ```
   arrow          15.0.1     2024-03-19 [1] https://apache.r-universe.dev (R 
4.3.3)
   
   setting  value
    version  R version 4.3.3 (2024-02-29)
    os       macOS Sonoma 14.4
    system   aarch64, darwin20
    ui       X11
    language (EN)
    collate  en_US.UTF-8
    ctype    en_US.UTF-8
    tz       Europe/Zurich
    date     2024-03-21
    pandoc   3.1.12.3 @ /opt/homebrew/bin/pandoc
   ```
   
   
   ``` r
   library(arrow)
   #>
   #> Attaching package: ‘arrow’
   #> The following object is masked from ‘package:utils’:
   #>
   #>     timestamp
   data <- readRDS(“problem.rds”)
   arrow::write_dataset(
       data,
       path = “~/problem”,
       partitioning = c(“publication_year”, “page”),
       format = “parquet”
   )
   #> Error: Invalid: Problem with column 3 (au_orcid): Invalid: Expecting a 
character vector
   ```
   <sup>Created on 2024-03-21 with [reprex 
v2.1.0](https://reprex.tidyverse.org/)</sup>
   
[problem.rds.zip](https://github.com/apache/arrow/files/14696782/problem.rds.zip)
   
   
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to