[ https://issues.apache.org/jira/browse/ARROW-6853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ji Liu updated ARROW-6853: -------------------------- Description: Hasher interface was introduce in ARROW-5898 and now have two different implementations ({{MurmurHasher and SimpleHasher}}) and it could be more in the future. And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use {{SimpleHasher}} for calculating hashCode. This issue enables them to use different hasher or even user-defined hasher for their own use cases. was: Hasher interface was introduce in ARROW-5898 and now have two different implementations ({{MurmurHasher and }}{{SimpleHasher}}) and it could be more in the future. And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use {{SimpleHasher}} for calculating hashCode. This issue enables them to use different hasher or even user-defined hasher for their own use cases. > [Java] Support vector and dictionary encoder use different hasher for > calculating hashCode > ------------------------------------------------------------------------------------------ > > Key: ARROW-6853 > URL: https://issues.apache.org/jira/browse/ARROW-6853 > Project: Apache Arrow > Issue Type: New Feature > Components: Java > Reporter: Ji Liu > Assignee: Ji Liu > Priority: Major > > Hasher interface was introduce in ARROW-5898 and now have two different > implementations ({{MurmurHasher and SimpleHasher}}) and it could be more in > the future. > And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use > {{SimpleHasher}} for calculating hashCode. This issue enables them to use > different hasher or even user-defined hasher for their own use cases. -- This message was sent by Atlassian Jira (v8.3.4#803005)