[
https://issues.apache.org/jira/browse/ARROW-5345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17662366#comment-17662366
]
Rok Mihevc commented on ARROW-5345:
-----------------------------------
This issue has been migrated to [issue
#21804|https://github.com/apache/arrow/issues/21804] on GitHub. Please see the
[migration documentation|https://github.com/apache/arrow/issues/14542] for
further details.
> [C++] Relax Field hashing in DictionaryMemo
> -------------------------------------------
>
> Key: ARROW-5345
> URL: https://issues.apache.org/jira/browse/ARROW-5345
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Wes McKinney
> Priority: Major
>
> Follow up to ARROW-3144
> Currently we associate dictionaries with a hash table mapping a Field's
> memory address to a dictionary id. This poses an issue if two RecordBatches
> are equal (equal field names, equal types) but were instantiated separately.
> We don't have a hash function in C++ for Field so we should consider
> implementing one and using that instead (if it is not too expensive) so that
> same but "different" (different C++ objects) won't blow up in the user's face
> with an unintuitive error (this did in fact occur once in the Python test
> suite, not sure exactly why it wasn't a problem before, I think it worked "by
> accident")
--
This message was sent by Atlassian Jira
(v8.20.10#820010)