rkrug opened a new issue, #40711: URL: https://github.com/apache/arrow/issues/40711
### Describe the bug, including details regarding any error messages, version, and platform. When saving the attached rds file to a parquet file, I get the error as in the repress `Error: Invalid: Problem with column 3 (au_orcid): Invalid: Expecting a character vector`. The problem seems to be that one column only contains NA, and these are interpreted as logical by R, while arrow expects a character vector. ``` arrow 15.0.1 2024-03-19 [1] https://apache.r-universe.dev (R 4.3.3) setting value version R version 4.3.3 (2024-02-29) os macOS Sonoma 14.4 system aarch64, darwin20 ui X11 language (EN) collate en_US.UTF-8 ctype en_US.UTF-8 tz Europe/Zurich date 2024-03-21 pandoc 3.1.12.3 @ /opt/homebrew/bin/pandoc ``` ``` r library(arrow) #> #> Attaching package: ‘arrow’ #> The following object is masked from ‘package:utils’: #> #> timestamp data <- readRDS(“problem.rds”) arrow::write_dataset( data, path = “~/problem”, partitioning = c(“publication_year”, “page”), format = “parquet” ) #> Error: Invalid: Problem with column 3 (au_orcid): Invalid: Expecting a character vector ``` <sup>Created on 2024-03-21 with [reprex v2.1.0](https://reprex.tidyverse.org/)</sup> [problem.rds.zip](https://github.com/apache/arrow/files/14696782/problem.rds.zip) ### Component(s) R -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
