[ 
https://issues.apache.org/jira/browse/ARROW-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Pendleton updated ARROW-1743:
-----------------------------------
    Description: 
Categorical columns in the index of a dataframe are causing a roundtrip 
failure.  

{code}
>>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
>>> df['a'] = df.a.astype('category')
>>> df = df.set_index('a')
>>> tbl = pa.Table.from_pandas(df)
>>> tbl.to_pandas()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
  File 
"C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py",
 line 303, in table_to_blockmanager
    if not values.flags.writeable:
AttributeError: 'Categorical' object has no attribute 'flags'
{code}


Works as expected when you don't change have the categorical:
{code}
>>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
>>> df = df.set_index('a')
>>> tbl = pa.Table.from_pandas(df)
>>> tbl.to_pandas()
   b
a
1  1
2  2
3  3
{code}


  was:
Categorical columns in the index of a dataframe are causing a roundtrip 
failure.  

{code}
>>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
>>> df['a'] = df.a.astype('category')
>>> df = df.set_index('a')
>>> tbl = pa.Table.from_pandas(df)
>>> tbl.to_pandas()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
  File 
"C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py",
 line 303, in table_to_blockmanager
    if not values.flags.writeable:
AttributeError: 'Categorical' object has no attribute 'flags'
{code}




> Table to_pandas fails when index contains categorical column
> ------------------------------------------------------------
>
>                 Key: ARROW-1743
>                 URL: https://issues.apache.org/jira/browse/ARROW-1743
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>    Affects Versions: 0.7.1
>            Reporter: Brian Pendleton
>
> Categorical columns in the index of a dataframe are causing a roundtrip 
> failure.  
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df['a'] = df.a.astype('category')
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
> Traceback (most recent call last):
>   File "<stdin>", line 1, in <module>
>   File "table.pxi", line 881, in pyarrow.lib.Table.to_pandas
>   File 
> "C:\Users\bpendlet\Miniconda3\envs\panpy3\lib\site-packages\pyarrow\pandas_compat.py",
>  line 303, in table_to_blockmanager
>     if not values.flags.writeable:
> AttributeError: 'Categorical' object has no attribute 'flags'
> {code}
> Works as expected when you don't change have the categorical:
> {code}
> >>> df = pd.DataFrame({'a': [1, 2, 3], 'b': [1, 2, 3]})
> >>> df = df.set_index('a')
> >>> tbl = pa.Table.from_pandas(df)
> >>> tbl.to_pandas()
>    b
> a
> 1  1
> 2  2
> 3  3
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to