[
https://issues.apache.org/jira/browse/HIVE-12945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15124659#comment-15124659
]
Gopal V commented on HIVE-12945:
--------------------------------
Ran all the failed test-cases on my laptop and only
tez_dynpart_hashjoin_1.q.out had changes related to JDK8.
Looks like a constant folding miss case though (the BETWEEN should fold the
IS_NULL).
{code}
@@ -695,7 +695,7 @@ STAGE PLANS:
alias: a
Statistics: Num rows: 12288 Data size: 2641964 Basic stats:
COMPLETE Column stats: NONE
Filter Operator
- predicate: ((cbigint is not null and cint BETWEEN 1000000
AND 3000000) and cint is not null) (type: boolean)
+ predicate: ((cbigint is not null and cint is not null) and
cint BETWEEN 1000000 AND 3000000) (type: boolean)
Statistics: Num rows: 6144 Data size: 1320982 Basic stats:
COMPLETE Column stats: NONE
Select Operator
expressions: cint (type: int)
{code}
> Bucket pruning: bucketing for -ve hashcodes have historical issues
> ------------------------------------------------------------------
>
> Key: HIVE-12945
> URL: https://issues.apache.org/jira/browse/HIVE-12945
> Project: Hive
> Issue Type: Bug
> Components: Tez
> Affects Versions: 2.0.0
> Reporter: Gopal V
> Assignee: Gopal V
> Priority: Critical
> Attachments: HIVE-12945.02.patch, HIVE-12945.1.patch
>
>
> The different ETL pathways differed in reducer choice slightly for -ve
> hashcodes.
> {code}
> (hashCode & Integer.MAX_VALUE) % numberOfBuckets;
> !=
> Math.abs(hashCode) % numberOfBuckets
> {code}
> Add a backwards compat option, which can be used to protect against old data
> left over from 0.13.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)