[
https://issues.apache.org/jira/browse/HIVE-17209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16108161#comment-16108161
]
Rajesh Balamohan commented on HIVE-17209:
-----------------------------------------
Thanks [~sershe]. This would also need a fix in ORC
(https://github.com/apache/orc/blob/master/java/core/src/java/org/apache/orc/impl/RecordReaderImpl.java#L554).
Since {{getLiteralList()}} would be empty, it needs to check for empty
structure to avoid IndexOutOfBoundsException.
> ObjectCacheFactory should return null when tez shared object registry is not
> setup
> ----------------------------------------------------------------------------------
>
> Key: HIVE-17209
> URL: https://issues.apache.org/jira/browse/HIVE-17209
> Project: Hive
> Issue Type: Bug
> Reporter: Rajesh Balamohan
> Assignee: Rajesh Balamohan
> Priority: Minor
> Attachments: HIVE-17209.1.patch
>
>
> HIVE-15269 introduced dynamic min/max bloom filter
> ("hive.tez.dynamic.semijoin.reduction=true"). This needs to access
> ObjectCache and in tez, ObjectCache can only be created by {{TezProcessor}}.
> In the following case {{AM --> splits -->
> OrcInputFormat.pickStripes::evaluatePredicateMinMax -->
> DynamicValue.getLiteral --> objectCache access}}, AM ends up throwing lots of
> NPE since AM has not created ObjectCache.
> Orc reader catches these exceptions, skips PPD and proceeds further. For e.g,
> in Q95 it ends up throwing ~30,000 NPE before completing split information.
> ObjectCacheFactory should return null when tez shared object registry is not
> setup.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)