[ 
https://issues.apache.org/jira/browse/ARROW-6872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-6872:
--------------------------------
    Fix Version/s: 1.0.0

> [C++][Python] Empty table with dictionary-columns raises 
> ArrowNotImplementedError
> ---------------------------------------------------------------------------------
>
>                 Key: ARROW-6872
>                 URL: https://issues.apache.org/jira/browse/ARROW-6872
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++, Python
>    Affects Versions: 0.15.0
>            Reporter: Marco Neumann
>            Priority: Minor
>             Fix For: 1.0.0
>
>
> h2. Abstract
> As a pyarrow user, I would expect that I can create an empty table out of 
> every schema that I created via pandas. This does not work for dictionary 
> types (e.g. {{"category"}} dtypes).
> h2. Test Case
> This code:
> {code:python}
> import pandas as pd
> import pyarrow as pa
> df = pd.DataFrame({"x": pd.Series(["x", "y"], dtype="category")})
> table = pa.Table.from_pandas(df)
> schema = table.schema
> table_empty = schema.empty_table()  # boom
> {code}
> produces this exception:
> {noformat}
> Traceback (most recent call last):
>   File "arrow_bug.py", line 8, in <module>
>     table_empty = schema.empty_table()
>   File "pyarrow/types.pxi", line 860, in __iter__
>   File "pyarrow/array.pxi", line 211, in pyarrow.lib.array
>   File "pyarrow/array.pxi", line 36, in pyarrow.lib._sequence_to_array
>   File "pyarrow/error.pxi", line 86, in pyarrow.lib.check_status
> pyarrow.lib.ArrowNotImplementedError: Sequence converter for type 
> dictionary<values=string, indices=int8, ordered=0> not implemented
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to