Tor Eivind McKenzie-Syvertsen created ARROW-13681:
-----------------------------------------------------
Summary: pyarow.compute.list_parent_indices only computes for
first chunk
Key: ARROW-13681
URL: https://issues.apache.org/jira/browse/ARROW-13681
Project: Apache Arrow
Issue Type: Bug
Reporter: Tor Eivind McKenzie-Syvertsen
I came across this issue due to very unexpected behaviour from the "explode"
function obtained here:
https://issues.apache.org/jira/browse/ARROW-12099
indices = pc.list_parent_indices(table[col_name])
if table[column] in this example contains several chunks, the indices will look
perfectly fine for that chunk, but erratic and unexpected results for second
chunk.
No warning or info was given either
A workaround that solved the problem for me is:
{code:java}
indices = pc.list_parent_indices(table.combine_chunks()[col_name])
{code}
The behaviour then changes dramatically.
I'm assuming this isnt expected and should be fixed?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)