[
https://issues.apache.org/jira/browse/ARROW-6853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ji Liu updated ARROW-6853:
--------------------------
Description:
Hasher interface was introduce in ARROW-5898 and now have two different
implementations ({{MurmurHasher and SimpleHasher}}) and it could be more in the
future.
And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use
{{SimpleHasher}} for calculating hashCode. This issue enables them to use
different hasher or even user-defined hasher for their own use cases.
was:
Hasher interface was introduce in ARROW-5898 and now have two different
implementations ({{MurmurHasher and }}{{SimpleHasher}}) and it could be more in
the future.
And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use
{{SimpleHasher}} for calculating hashCode. This issue enables them to use
different hasher or even user-defined hasher for their own use cases.
> [Java] Support vector and dictionary encoder use different hasher for
> calculating hashCode
> ------------------------------------------------------------------------------------------
>
> Key: ARROW-6853
> URL: https://issues.apache.org/jira/browse/ARROW-6853
> Project: Apache Arrow
> Issue Type: New Feature
> Components: Java
> Reporter: Ji Liu
> Assignee: Ji Liu
> Priority: Major
>
> Hasher interface was introduce in ARROW-5898 and now have two different
> implementations ({{MurmurHasher and SimpleHasher}}) and it could be more in
> the future.
> And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use
> {{SimpleHasher}} for calculating hashCode. This issue enables them to use
> different hasher or even user-defined hasher for their own use cases.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)