Steve M. Kim created ARROW-9594:
-----------------------------------

             Summary: [Python] DictionaryArray.to_numpy does not correctly 
convert null indexes to null values
                 Key: ARROW-9594
                 URL: https://issues.apache.org/jira/browse/ARROW-9594
             Project: Apache Arrow
          Issue Type: Bug
    Affects Versions: 1.0.0
            Reporter: Steve M. Kim


Example
{code:java}

 >>> a = pa.DictionaryArray.from_arrays(pa.array([0, 1, None, 0], 
 >>> type=pa.int32()), pa.array(['foo', 'bar']))
>>> a
<pyarrow.lib.DictionaryArray object at 0x7f12fc94ccf0>-- dictionary:
  [
    "foo",
    "bar"
  ]
-- indices:
  [
    0,
    1,
    null,
    0
  ]
>>> a.to_pandas()  # this works
0    foo
1    bar
2    NaN
3    foo
dtype: category
Categories (2, object): [foo, bar]
>>> a.to_numpy(zero_copy_only=False)  # this is broken
array(['foo', 'bar', 'bar', 'foo'], dtype=object)

{code}
 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to