Wes McKinney created ARROW-1741:
-----------------------------------
Summary: [C++] Comparison function for DictionaryArray to
determine if indices are "compatible"
Key: ARROW-1741
URL: https://issues.apache.org/jira/browse/ARROW-1741
Project: Apache Arrow
Issue Type: New Feature
Components: C++
Reporter: Wes McKinney
For example, if one array's dictionary is larger than the other, but the
overlapping beginning portion is the same, then the respective dictionary
indices correspond to the same values. Therefore, in analytics, one may choose
to drop the smaller dictionary in favor of the larger dictionary, and this need
not incur any computational overhead (beyond comparing the dictionary prefixes
-- there may be some way to engineer "dictionary lineage" to make this
comparison even cheaper)
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)