[ 
https://issues.apache.org/jira/browse/ARROW-15565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joris Van den Bossche updated ARROW-15565:
------------------------------------------
    Summary: [Python] Extension dtype in the index is not preserved on a 
roundtrip from Pandas  (was: [Python] Interval dtype is not preserved on a 
roundtrip from Pandas and IntervalIndex)

> [Python] Extension dtype in the index is not preserved on a roundtrip from 
> Pandas
> ---------------------------------------------------------------------------------
>
>                 Key: ARROW-15565
>                 URL: https://issues.apache.org/jira/browse/ARROW-15565
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Alenka Frim
>            Priority: Major
>
> Doing a roundtrip from Pandas with IntervalIndex results in an object-dtype 
> index, and not a proper Intervalndex - that means the interval dtype is not 
> preserved on a roundtrip, while this would be the case if it was a normal 
> column instead of the index.
> Example:
> {code}
> >>> import pandas as pd
> >>> import pyarrow as pa
> >>> df = pd.DataFrame(index=pd.interval_range(start=0, end=3))
> >>> table = pa.table(df)
> >>> table.to_pandas().index
> Index([{'left': 0, 'right': 1}, {'left': 1, 'right': 2},
>        {'left': 2, 'right': 3}],
>       dtype='object')
> >>> df.index
> IntervalIndex([(0, 1], (1, 2], (2, 3]], dtype='interval[int64, right]')
> >>> 
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to