[
https://issues.apache.org/jira/browse/ARROW-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Brian Pendleton updated ARROW-1743:
-----------------------------------
Description:
Categorical columns in the index of a dataframe are causing a roundtrip
failure.
{code}
>>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
>>> df['a'] = df.a.astype('category')
>>> df = df.set_index('a')
>>> tbl = pa.Table.from_pandas(df)
>>> tbl.to_pandas()
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
File
"C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py",
line 303, in table_to_blockmanager
if not values.flags.writeable:
AttributeError: 'Categorical' object has no attribute 'flags'
{code}
Works as expected when you don't change have the categorical:
{code}
>>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
>>> df = df.set_index('a')
>>> tbl = pa.Table.from_pandas(df)
>>> tbl.to_pandas()
b
a
1 1
2 2
3 3
{code}
was:
Categorical columns in the index of a dataframe are causing a roundtrip
failure.
{code}
>>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
>>> df['a'] = df.a.astype('category')
>>> df = df.set_index('a')
>>> tbl = pa.Table.from_pandas(df)
>>> tbl.to_pandas()
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
File
"C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py",
line 303, in table_to_blockmanager
if not values.flags.writeable:
AttributeError: 'Categorical' object has no attribute 'flags'
{code}
> Table to_pandas fails when index contains categorical column
> ------------------------------------------------------------
>
> Key: ARROW-1743
> URL: https://issues.apache.org/jira/browse/ARROW-1743
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.7.1
> Reporter: Brian Pendleton
>
> Categorical columns in the index of a dataframe are causing a roundtrip
> failure.
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df['a'] = df.a.astype('category')
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
> Traceback (most recent call last):
> File "<stdin>", line 1, in <module>
> File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
> File
> "C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py",
> line 303, in table_to_blockmanager
> if not values.flags.writeable:
> AttributeError: 'Categorical' object has no attribute 'flags'
> {code}
> Works as expected when you don't change have the categorical:
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
> b
> a
> 1 1
> 2 2
> 3 3
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)