Augusto Radtke created ARROW-2722:
-------------------------------------
Summary: ndarray to arrow conversion fails when downcasted from
pandas to_numeric
Key: ARROW-2722
URL: https://issues.apache.org/jira/browse/ARROW-2722
Project: Apache Arrow
Issue Type: Bug
Components: C++, Python
Affects Versions: 0.9.0
Environment: Windows 10 64-bit
Reporter: Augusto Radtke
The following snippet:
{code:java}
import numpy as np
import pandas as pd
import pyarrow as pa
pa.array(pd.to_numeric(pd.Series(np.array([65536,2,3], dtype=np.uint64)),
downcast='unsigned'),
from_pandas=True, type='uint32')
{code}
fails to convert with message:
{noformat}
ArrowNotImplementedError Traceback (most recent call last)
<ipython-input-2-b259c5cb7044> in <module>()
4
5 pa.array(pd.to_numeric(pd.Series(np.array([65536,2,3], dtype=np.uint64)),
downcast='unsigned'),
----> 6 from_pandas=True, type='uint32')
array.pxi in pyarrow.lib.array()
array.pxi in pyarrow.lib._ndarray_to_array()
error.pxi in pyarrow.lib.check_status()
ArrowNotImplementedError: Unsupported numpy type 6{noformat}
This is a Windows 64-bit machine, running Python 3.6.5, pyarrow 0.9.0, pandas
0.23.1 and numpy 1.14.5.
Seems to be fine for uint16 or uint8 downcasting. Unfortunately I didn't had
the time to dig deeper or try on a Linux machine but it feels like its related
to the LLP64 model.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)