[ https://issues.apache.org/jira/browse/ARROW-2141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16384358#comment-16384358 ]
ASF GitHub Bot commented on ARROW-2141: --------------------------------------- BryanCutler opened a new pull request #1689: ARROW-2141: [Python] Support variable length binary conversion from Pandas URL: https://github.com/apache/arrow/pull/1689 Currently, when performing `from_pandas` conversion with binary data and the user specifies the type as variable length binary `pa.binary()` then the type is inferred and a cast from binary to binary is attempted. The casting then fails because the cast kernel does not support binary types. This PR checks if the user specifies a variable length binary type in conversion, and then copies data to a BinaryArray instead of trying to infer the type and then casting. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > [Python] Conversion from Numpy object array to varsize binary unimplemented > --------------------------------------------------------------------------- > > Key: ARROW-2141 > URL: https://issues.apache.org/jira/browse/ARROW-2141 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Affects Versions: 0.8.0 > Reporter: Antoine Pitrou > Assignee: Bryan Cutler > Priority: Major > Labels: pull-request-available > > {code:python} > >>> arr = np.array([b'xx'], dtype=np.object) > >>> pa.array(arr, type=pa.binary(2)) > <pyarrow.lib.FixedSizeBinaryArray object at 0x7fe1ecaefa98> > [ > b'xx' > ] > >>> pa.array(arr, type=pa.binary()) > Traceback (most recent call last): > File "<ipython-input-12-e40948b94b33>", line 1, in <module> > pa.array(arr, type=pa.binary()) > File "array.pxi", line 177, in pyarrow.lib.array > File "error.pxi", line 77, in pyarrow.lib.check_status > File "error.pxi", line 85, in pyarrow.lib.check_status > ArrowNotImplementedError: > /home/antoine/arrow/cpp/src/arrow/python/numpy_to_arrow.cc:1585 code: > converter.Convert() > /home/antoine/arrow/cpp/src/arrow/python/numpy_to_arrow.cc:1098 code: > compute::Cast(&context, *arr, type_, options, &casted) > /home/antoine/arrow/cpp/src/arrow/compute/kernels/cast.cc:1022 code: > Cast(ctx, Datum(array.data()), out_type, options, &datum_out) > /home/antoine/arrow/cpp/src/arrow/compute/kernels/cast.cc:1009 code: > GetCastFunction(*value.type(), out_type, options, &func) > No cast implemented from binary to binary > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)