[
https://issues.apache.org/jira/browse/ARROW-7723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-7723:
----------------------------------
Labels: pull-request-available (was: )
> [Python] StructArray timestamp type with timezone to_pandas convert error
> --------------------------------------------------------------------------
>
> Key: ARROW-7723
> URL: https://issues.apache.org/jira/browse/ARROW-7723
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Reporter: Bryan Cutler
> Assignee: Wes McKinney
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 0.16.0
>
>
> When a {{StructArray}} has a child that is a timestamp with a timezone, the
> {{to_pandas}} conversion outputs an int64 instead of a timestamp
> {code:java}
> In [1]: import pyarrow as pa
> ...: import pandas as pd
> ...: arr = pa.array([{'start': pd.Timestamp.now(), 'end':
> pd.Timestamp.now()}])
> ...:
>
> In [2]: arr.to_pandas()
>
> Out[2]:
> 0 {'end': 2020-01-29 11:38:02.792681, 'start': 2...
> dtype: object
> In [3]: ts = pd.Timestamp.now()
>
> In [4]: arr2 = pa.array([ts], type=pa.timestamp('us', tz='America/New_York'))
>
> In [5]: arr2.to_pandas()
>
> Out[5]:
> 0 2020-01-29 06:38:47.848944-05:00
> dtype: datetime64[ns, America/New_York]
> In [6]: arr = pa.StructArray.from_arrays([arr2, arr2], ['start', 'stop'])
>
> In [7]: arr.to_pandas()
>
> Out[7]:
> 0 {'start': 1580297927848944000, 'stop': 1580297...
> dtype: object
> {code}
> from https://github.com/apache/arrow/pull/6312
--
This message was sent by Atlassian Jira
(v8.3.4#803005)