Victor Jimenez created ARROW-1999:
-------------------------------------
Summary: [Python] from_numpy_dtype returns wrong types
Key: ARROW-1999
URL: https://issues.apache.org/jira/browse/ARROW-1999
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.8.0
Environment: Windows 10 Build 15063.850
Numpy: 1.14.0
Reporter: Victor Jimenez
The following code shows multiple issues when using {{from_numpy_dtype}}:
{code}
import numpy as np
import pyarrow as pa
pa.from_numpy_dtype(np.unicode) # returns DataType(bool)
pa.from_numpy_dtype(np.int) # returns DataType(bool)
pa.from_numpy_dtype(np.int64) # Fails with the following message:
#
# ArrowNotImplementedError Traceback (most recent call last)
# <ipython-input-14-ca0855a7dda8> in <module>()
# ----> 1 pa.from_numpy_dtype(np.int64)
# 2
#
# types.pxi in pyarrow.lib.from_numpy_dtype()
#
# error.pxi in pyarrow.lib.check_status()
#
# ArrowNotImplementedError: Unsupported numpy type 32760
{code}
Additionally, a potentially related issue is also seen when using
{{to_pandas_dtype}}:
{code}
pa.DataType.to_pandas_dtype(pa.string()) # Returns numpy.object_
# (shouldn't it be numpy.unicode?)
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)