[
https://issues.apache.org/jira/browse/ARROW-2593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17659619#comment-17659619
]
Rok Mihevc commented on ARROW-2593:
-----------------------------------
This issue has been migrated to [issue
#18978|https://github.com/apache/arrow/issues/18978] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [Python] TypeError: data type "mixed-integer" not understood
> ------------------------------------------------------------
>
> Key: ARROW-2593
> URL: https://issues.apache.org/jira/browse/ARROW-2593
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.9.0
> Reporter: Dima Ryazanov
> Assignee: Krisztian Szucs
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.10.0
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> Pyarrow 0.9 raises an exception when converting some tables to pandas
> dataframes. Earlier versions work fine. Repro steps:
> {{In [1]: import pandas as pd}}
> {{In [2]: import pyarrow as pa}}
> {{In [3]: df = pd.DataFrame(\{'foo': [], 123: []})}}
> {{In [4]: table = pa.Table.from_pandas(df)}}
> {{In [5]: table.to_pandas()}}
> {{---------------------------------------------------------------------------}}
> {{KeyError Traceback (most recent call
> last)}}
> {{~/envs/cli3/lib/python3.6/site-packages/pyarrow/pandas_compat.py in
> _pandas_type_to_numpy_type(pandas_type)}}
> {{ 666 try:}}
> {{--> 667 return _pandas_logical_type_map[pandas_type]}}
> {{ 668 except KeyError:}}
> {{KeyError: 'mixed-integer'}}
> (I ended up with a dataframe with mixed string/integer columns by using
> pd.read_excel(..., skiprows=[0]) - which skipped the header, and treated the
> first line of data as column names.)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)