[
https://issues.apache.org/jira/browse/ARROW-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Licht Takeuchi reassigned ARROW-1743:
-------------------------------------
Assignee: Licht Takeuchi
> Table to_pandas fails when index contains categorical column
> ------------------------------------------------------------
>
> Key: ARROW-1743
> URL: https://issues.apache.org/jira/browse/ARROW-1743
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.7.1
> Reporter: Brian Pendleton
> Assignee: Licht Takeuchi
>
> Categorical columns in the index of a dataframe are causing a roundtrip
> failure.
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df['a'] = df.a.astype('category')
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
> Traceback (most recent call last):
> File "<stdin>", line 1, in <module>
> File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
> File
> "C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py",
> line 303, in table_to_blockmanager
> if not values.flags.writeable:
> AttributeError: 'Categorical' object has no attribute 'flags'
> {code}
> Works as expected when you don't change have the categorical:
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
> b
> a
> 1 1
> 2 2
> 3 3
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)