[
https://issues.apache.org/jira/browse/HIVE-25142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Panagiotis Garefalakis updated HIVE-25142:
------------------------------------------
Fix Version/s: 4.0.0
> Rehashing in map join fast hash table causing corruption for large keys
> ------------------------------------------------------------------------
>
> Key: HIVE-25142
> URL: https://issues.apache.org/jira/browse/HIVE-25142
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Reporter: mahesh kumar behera
> Assignee: mahesh kumar behera
> Priority: Major
> Labels: pull-request-available
> Fix For: 4.0.0
>
> Time Spent: 40m
> Remaining Estimate: 0h
>
> In map join the hash table is created using the keys. To support rehashing,
> the keys are stored in write buffer. The hash table contains the offset of
> the keys along with the hash code. When rehashing is done, the offset is
> extracted from the hash table and then hash code is generated again. For
> large keys of size greater than 255, the key length is also stored along with
> the key. In case of fast hash table implementation the way key is extracted
> is not proper. There is a code bug and thats causing the wrong key to be
> extracted and causing wrong hash code generation. This is causing the
> corruption in the hash table.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)