[
https://issues.apache.org/jira/browse/HIVE-10704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14563842#comment-14563842
]
Mostafa Mokhtar commented on HIVE-10704:
----------------------------------------
Table size of 0 is usually an error in the statistics and the actual table can
be arbitrary big, so evenly dividing the memory is a much safer bet.
And in the case you mentioned above each hash table will get
hive.auto.convert.join.noconditionaltask.size / 3.
By default this is Container size/3 which is more than enough for smaller
tables.
> Errors in Tez HashTableLoader when estimated table size is 0
> ------------------------------------------------------------
>
> Key: HIVE-10704
> URL: https://issues.apache.org/jira/browse/HIVE-10704
> Project: Hive
> Issue Type: Bug
> Components: Query Processor
> Reporter: Jason Dere
> Assignee: Mostafa Mokhtar
> Fix For: 1.2.1
>
> Attachments: HIVE-10704.1.patch, HIVE-10704.2.patch,
> HIVE-10704.3.patch
>
>
> Couple of issues:
> - If the table sizes in MapJoinOperator.getParentDataSizes() are 0 for all
> tables, the largest small table selection is wrong and could select the large
> table (which results in NPE)
> - The memory estimates can either divide-by-zero, or allocate 0 memory if the
> table size is 0. Try to come up with a sensible default for this.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)