Paramvir109 opened a new pull request, #5845:
URL: https://github.com/apache/hive/pull/5845

   
   ### What changes were proposed in this pull request?
   
   Need to properly propagate the exception which can occur while parallel 
loading of fast Hash tables by the threads.
   In the current scenario the thread is silently dying causing 
incorrect(lesser number)of HT entries to be reported which is giving incorrect 
results
   
   ### Why are the changes needed?
   It fixes bug caused by https://issues.apache.org/jira/browse/HIVE-25149 that 
hides actual exceptions and gives incorrect results.  Example query is added in 
the jira : https://issues.apache.org/jira/browse/HIVE-28735
   
   ### Is the change a dependency upgrade?
   No
   
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   ### How was this patch tested?
   Tested using the sample query. 
   Hive 3 
   ```
   Caused by: java.lang.OutOfMemoryError: Java heap space
       at 
org.apache.hadoop.hive.ql.exec.vector.mapjoin.fast.VectorMapJoinFastLongHashTable.expandAndRehash(VectorMapJoinFastLongHashTable.java:166)
       at 
org.apache.hadoop.hive.ql.exec.vector.mapjoin.fast.VectorMapJoinFastLongHashTable.add(VectorMapJoinFastLongHashTable.java:100)
       at 
org.apache.hadoop.hive.ql.exec.vector.mapjoin.fast.VectorMapJoinFastLongHashTable.adaptPutRow(VectorMapJoinFastLongHashTable.java:91)
       at 
org.apache.hadoop.hive.ql.exec.vector.mapjoin.fast.VectorMapJoinFastLongHashMap.putRow(VectorMapJoinFastLongHashMap.java:147)
       at 
org.apache.hadoop.hive.ql.exec.vector.mapjoin.fast.VectorMapJoinFastTableContainer.putRow(VectorMapJoinFastTableContainer.java:184)
       at 
org.apache.hadoop.hive.ql.exec.vector.mapjoin.fast.VectorMapJoinFastHashTableLoader.load(VectorMapJoinFastHashTableLoader.java:130)
       at 
org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTableInternal(MapJoinOperator.java:388)
       at 
org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTable(MapJoinOperator.java:457)
       at 
org.apache.hadoop.hive.ql.exec.MapJoinOperator.lambda$initializeOp$0(MapJoinOperator.java:241)
       ... 3 more
   ```
   
   Before the fix : Wrong results on each run
   After the fix : Getting same OOM exception as in hive 3
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org
For additional commands, e-mail: gitbox-h...@hive.apache.org

Reply via email to