[
https://issues.apache.org/jira/browse/ARROW-16513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17534479#comment-17534479
]
Weston Pace commented on ARROW-16513:
-------------------------------------
Yes it is. Thanks.
> [C++] Add a compute function to hash inputs
> -------------------------------------------
>
> Key: ARROW-16513
> URL: https://issues.apache.org/jira/browse/ARROW-16513
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++
> Reporter: Weston Pace
> Priority: Major
>
> We have a lot of internal logic for hashing inputs and it might be nice to
> expose some of this to users (e.g.
> https://stackoverflow.com/questions/72177022/how-to-get-hash-of-string-column-in-polars-or-pyarrow)
> The `HashBatch` method in `key_hash.h` (not quite merged but close) is likely
> to be the most performant. However, it does make some sacrifices on
> uniqueness of hashes in the spirit of performance (so we should make sure to
> document these).
--
This message was sent by Atlassian Jira
(v8.20.7#820007)