[
https://issues.apache.org/jira/browse/HIVE-17209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated HIVE-17209:
------------------------------------
Resolution: Fixed
Hadoop Flags: Reviewed
Fix Version/s: 3.0.0
Status: Resolved (was: Patch Available)
Created ORC-221 for orc related change and it got committed as well.
Thanks [~sershe]. Committed this patch to master.
> ObjectCacheFactory should return null when tez shared object registry is not
> setup
> ----------------------------------------------------------------------------------
>
> Key: HIVE-17209
> URL: https://issues.apache.org/jira/browse/HIVE-17209
> Project: Hive
> Issue Type: Bug
> Reporter: Rajesh Balamohan
> Assignee: Rajesh Balamohan
> Priority: Minor
> Fix For: 3.0.0
>
> Attachments: HIVE-17209.1.patch
>
>
> HIVE-15269 introduced dynamic min/max bloom filter
> ("hive.tez.dynamic.semijoin.reduction=true"). This needs to access
> ObjectCache and in tez, ObjectCache can only be created by {{TezProcessor}}.
> In the following case {{AM --> splits -->
> OrcInputFormat.pickStripes::evaluatePredicateMinMax -->
> DynamicValue.getLiteral --> objectCache access}}, AM ends up throwing lots of
> NPE since AM has not created ObjectCache.
> Orc reader catches these exceptions, skips PPD and proceeds further. For e.g,
> in Q95 it ends up throwing ~30,000 NPE before completing split information.
> ObjectCacheFactory should return null when tez shared object registry is not
> setup.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)