[
https://issues.apache.org/jira/browse/ARROW-8122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney resolved ARROW-8122.
---------------------------------
Resolution: Fixed
Issue resolved by pull request 6624
[https://github.com/apache/arrow/pull/6624]
> [Python] Empty numpy arrays with shape cannot be deserialized
> -------------------------------------------------------------
>
> Key: ARROW-8122
> URL: https://issues.apache.org/jira/browse/ARROW-8122
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.16.0
> Reporter: Wenjun Si
> Assignee: Wenjun Si
> Priority: Major
> Labels: pull-request-available, serialization
> Fix For: 0.17.0
>
> Time Spent: 1h
> Remaining Estimate: 0h
>
> In PyArrow 0.16.0, when we try to deserialize a serialized empty Numpy Array
> with shape, for instance, np.array([[], []]), an ArrowInvalid is raised.
> Code reproducing this error:
> {code:python}
> import numpy as np
> import pyarrow
> arr = np.array([[], []])
> pyarrow.deserialize(pyarrow.serialize(arr).to_buffer()) # this line cannot
> work
> {code}
> and the error stack is
> {code:python}
> Traceback (most recent call last):
> File
> "/Users/wenjun/miniconda3/lib/python3.7/site-packages/IPython/core/interactiveshell.py",
> line 3326, in run_code
> exec(code_obj, self.user_global_ns, self.user_ns)
> File "<ipython-input-4-0ace9226dd72>", line 1, in <module>
> pyarrow.deserialize(pyarrow.serialize(arr).to_buffer())
> File "pyarrow/serialization.pxi", line 476, in pyarrow.lib.deserialize
> File "pyarrow/serialization.pxi", line 438, in pyarrow.lib.deserialize_from
> File "pyarrow/serialization.pxi", line 414, in pyarrow.lib.read_serialized
> File "pyarrow/error.pxi", line 84, in pyarrow.lib.check_status
> pyarrow.lib.ArrowInvalid: strides must not involve buffer over run
> {code}
> The same code works in PyArrow 0.15.x
--
This message was sent by Atlassian Jira
(v8.3.4#803005)