[
https://issues.apache.org/jira/browse/ARROW-1900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16336732#comment-16336732
]
Wes McKinney commented on ARROW-1900:
-------------------------------------
No, with int8, uint8, the range is always small so there is no need to examine
the data. With int32 or int64 data, if the range of the data is small you can
avoid a normal hash table scan. You need access to the whole array at once to
do this so it may not be practical in Arrow in general
> [C++] Add utility functions for determining value range (maximum and minimum)
> of integer arrays
> -----------------------------------------------------------------------------------------------
>
> Key: ARROW-1900
> URL: https://issues.apache.org/jira/browse/ARROW-1900
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++
> Reporter: Wes McKinney
> Priority: Major
>
> These functions don't need to be kernels right away; they are useful
> internally for determining when a "small range" alternative to a hash table
> can be used for integer arrays. The maximum and minimum is determined in a
> single scan
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)