[
https://issues.apache.org/jira/browse/ARROW-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jacob Wujciak-Jens updated ARROW-16318:
---------------------------------------
Summary: [R]Timezone is not supported by to_duckdb() (was: Timezone is not
supported by to_duckdb())
> [R]Timezone is not supported by to_duckdb()
> -------------------------------------------
>
> Key: ARROW-16318
> URL: https://issues.apache.org/jira/browse/ARROW-16318
> Project: Apache Arrow
> Issue Type: Bug
> Components: R
> Affects Versions: 7.0.0
> Reporter: Zsolt Kegyes-Brassai
> Priority: Minor
>
> Here is a reproducible example:
>
> {code:java}
> library(tidyverse)
> library(arrow)
> df1 <- tibble(time = lubridate::now(tzone = "UTC"))
> str(df1)
> #> tibble [1 x 1] (S3: tbl_df/tbl/data.frame)
> #> $ time: POSIXct[1:1], format: "2022-04-25 12:50:10"
> write_dataset(df1, here::here("temp/df1"), format = "parquet")
> open_dataset(here::here("temp/df1")) |>
> to_duckdb()
> #> Error: duckdb_prepare_R: Failed to prepare query SELECT *
> #> FROM "arrow_001" AS "q01"
> #> WHERE (0 = 1)
> #> Error: Not implemented Error: Unsupported Internal Arrow Type tsu:UTC
> df2 <- tibble(time = lubridate::now())
> str(df2)
> #> tibble [1 x 1] (S3: tbl_df/tbl/data.frame)
> #> $ time: POSIXct[1:1], format: "2022-04-25 14:50:11"
> write_dataset(df2, here::here("temp/df2"), format = "parquet")
> open_dataset(here::here("temp/df2")) |>
> to_duckdb()
> #> # Source: table<arrow_002> [?? x 1]
> #> # Database: duckdb_connection
> #> time
> #> <dttm>
> #> 1 2022-04-25 12:50:11
> {code}
>
> The timestamps without timezone information are working fine.
> How one can remove easily the timezone information from {{timestamp }}type
> column from a parquet dataset?
--
This message was sent by Atlassian Jira
(v8.20.7#820007)