[
https://issues.apache.org/jira/browse/HIVE-16151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15957494#comment-15957494
]
Gopal V edited comment on HIVE-16151 at 4/5/17 7:25 PM:
--------------------------------------------------------
This came up as a ~4% performance loss in an extra null check, but does allow
for larger hash tables.
LGTM - +1.
was (Author: gopalv):
This came up as a ~4% performance in an extra null check, but does allow for
larger hash tables.
LGTM - +1.
> BytesBytesHashTable allocates large arrays
> ------------------------------------------
>
> Key: HIVE-16151
> URL: https://issues.apache.org/jira/browse/HIVE-16151
> Project: Hive
> Issue Type: Bug
> Reporter: Prasanth Jayachandran
> Assignee: Sergey Shelukhin
> Attachments: HIVE-16151.patch
>
>
> These arrays cause GC pressure and also impose key count limitations on the
> table. Wrt the latter, we won't be able to get rid of it without a 64-bit
> hash function, but for now we can get rid of the former. If we need the
> latter we'd add murmur64 and probably account for it differently for resize
> (we don't want to blow up the hashtable by 4 bytes/key in the common case
> where #of keys is less than ~1.5B :))
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)