Tor Eivind McKenzie-Syvertsen created ARROW-13681:
-----------------------------------------------------

             Summary:  pyarow.compute.list_parent_indices only computes for 
first chunk
                 Key: ARROW-13681
                 URL: https://issues.apache.org/jira/browse/ARROW-13681
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: Tor Eivind McKenzie-Syvertsen


I came across this issue due to very unexpected behaviour from the "explode" 
function obtained here:
https://issues.apache.org/jira/browse/ARROW-12099
indices = pc.list_parent_indices(table[col_name])

if table[column] in this example contains several chunks, the indices will look 
perfectly fine for that chunk, but erratic and unexpected results for second 
chunk.
No warning or info was given either

A workaround that solved the problem for me is:
{code:java}
  indices = pc.list_parent_indices(table.combine_chunks()[col_name])
{code}
The behaviour then changes dramatically.

I'm assuming this isnt expected and should be fixed?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to