[ 
https://issues.apache.org/jira/browse/ARROW-6853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ji Liu updated ARROW-6853:
--------------------------
    Description: 
Hasher interface was introduce in ARROW-5898 and now have two different 
implementations ({{MurmurHasher and SimpleHasher}}) and it could be more in the 
future.

And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use 
{{SimpleHasher}} for calculating hashCode. This issue enables them to use 
different hasher or even user-defined hasher for their own use cases.

  was:
Hasher interface was introduce in ARROW-5898 and now have two different 
implementations ({{MurmurHasher and }}{{SimpleHasher}}) and it could be more in 
the future.

And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use 
{{SimpleHasher}} for calculating hashCode. This issue enables them to use 
different hasher or even user-defined hasher for their own use cases.


> [Java] Support vector and dictionary encoder use different hasher for 
> calculating hashCode
> ------------------------------------------------------------------------------------------
>
>                 Key: ARROW-6853
>                 URL: https://issues.apache.org/jira/browse/ARROW-6853
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: Java
>            Reporter: Ji Liu
>            Assignee: Ji Liu
>            Priority: Major
>
> Hasher interface was introduce in ARROW-5898 and now have two different 
> implementations ({{MurmurHasher and SimpleHasher}}) and it could be more in 
> the future.
> And currently {{ValueVector#hashCode}} and {{DictionaryHashTable}} only use 
> {{SimpleHasher}} for calculating hashCode. This issue enables them to use 
> different hasher or even user-defined hasher for their own use cases.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to