SHIMA Tatsuya created ARROW-15599: ------------------------------------- Summary: [R] can't explicitly convert a column as a typestamp from CSV (or other delimited) file Key: ARROW-15599 URL: https://issues.apache.org/jira/browse/ARROW-15599 Project: Apache Arrow Issue Type: Bug Affects Versions: 6.0.1 Environment: R version 4.1.2 (2021-11-01) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Ubuntu 20.04.3 LTS Reporter: SHIMA Tatsuya
I tried to read the csv column type as timestamp, but I could only get it to work well when `col_types` was not specified. I'm sorry if I missed something and this is the expected behavior. (It would be great if you could add an example with `col_types` in the documentation.) {code:r} library(arrow) #> #> Attaching package: 'arrow' #> The following object is masked from 'package:utils': #> #> timestamp t_string <- tibble::tibble( x = "2018-10-07 19:04:05" ) write_csv_arrow(t_string, "tmp.csv") read_csv_arrow( "tmp.csv", as_data_frame = FALSE ) #> Table #> 1 rows x 1 columns #> $x <timestamp[s]> read_csv_arrow( "tmp.csv", col_names = "x", col_types = "?", as_data_frame = FALSE ) #> Table #> 2 rows x 1 columns #> $x <string> read_csv_arrow( "tmp.csv", col_names = "x", col_types = "T", as_data_frame = FALSE ) #> Error: Invalid: In CSV column #0: CSV conversion error to timestamp[s]: invalid value 'x' read_csv_arrow( "tmp.csv", col_names = "x", col_types = "T", as_data_frame = FALSE, timestamp_parsers = "%Y-%m-%d %H:%M:%S" ) #> Error: Invalid: In CSV column #0: CSV conversion error to timestamp[s]: invalid value 'x' {code} -- This message was sent by Atlassian Jira (v8.20.1#820001)