[
https://issues.apache.org/jira/browse/ARROW-14686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-14686:
-----------------------------------
Labels: pull-request-available (was: )
> [Python] Current endian decision for numpy does not work for built-in data
> type.
> --------------------------------------------------------------------------------
>
> Key: ARROW-14686
> URL: https://issues.apache.org/jira/browse/ARROW-14686
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Affects Versions: 7.0.0
> Reporter: Kazuaki Ishizaki
> Assignee: Kazuaki Ishizaki
> Priority: Major
> Labels: pull-request-available
> Fix For: 7.0.0
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Although the current NumPy converter code tries to detect endian against a
> given object, it does not work for a built-in object. This is because a
> build-in object may have {{native |}} byte order.
> This issue causes test failure as follows:
> {code:java}
> FAILED pyarrow/tests/test_array.py::test_array_from_numpy_unicode -
> UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3:
> code point not in range(0x110000)
> FAILED pyarrow/tests/test_array.py::test_array_from_strided -
> UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3:
> code point not in range(0x110000)
> FAILED pyarrow/tests/test_array.py::test_array_from_numpy_str_utf8 -
> UnicodeDecodeError: 'utf-32-le' codec can't decode bytes in position 0-3:
> code point not in range(0x110000) {code}
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)