Ged Steponavicius created ARROW-8223:
----------------------------------------
Summary: Schema.from_pandas breaks with pandas nullable integer
dtype
Key: ARROW-8223
URL: https://issues.apache.org/jira/browse/ARROW-8223
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.15.1, 0.16.0, 0.15.0
Environment: pyarrow 0.16
Reporter: Ged Steponavicius
{code:java}
import pandas as pd
import pyarrow as pa
df = pd.DataFrame([{'int_col':1},
{'int_col':2}])
df['int_col'] = df['int_col'].astype(pd.Int64Dtype())
schema = pa.Schema.from_pandas(df)
{code}
produces ArrowTypeError: Did not pass numpy.dtype object
However, this works fine
{code:java}
schema = pa.Table.from_pandas(df).schema{code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)