[ https://issues.apache.org/jira/browse/HIVE-7629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14087366#comment-14087366 ]
Suma Shivaprasad commented on HIVE-7629: ---------------------------------------- The issue is due to DataWritableReadSupport trying to access column indexes which are not present in the table. Have the fix for the same. Will be attaching the patch for it shortly > Problem in SMB Joins between two Parquet tables > ----------------------------------------------- > > Key: HIVE-7629 > URL: https://issues.apache.org/jira/browse/HIVE-7629 > Project: Hive > Issue Type: Bug > Components: Serializers/Deserializers > Affects Versions: 0.13.1 > Reporter: Suma Shivaprasad > > The issue is clearly seen when two bucketed and sorted parquet tables with > different number of columns are involved in the join . The following > exception is seen > Caused by: java.lang.IndexOutOfBoundsException: Index: 2, Size: 2 > at java.util.ArrayList.rangeCheck(ArrayList.java:635) > at java.util.ArrayList.get(ArrayList.java:411) > at > org.apache.hadoop.hive.ql.io.parquet.read.DataWritableReadSupport.init(DataWritableReadSupport.java:101) > at > org.apache.hadoop.hive.ql.io.parquet.read.ParquetRecordReaderWrapper.getSplit(ParquetRecordReaderWrapper.java:204) > at > org.apache.hadoop.hive.ql.io.parquet.read.ParquetRecordReaderWrapper.<init>(ParquetRecordReaderWrapper.java:79) > at > org.apache.hadoop.hive.ql.io.parquet.read.ParquetRecordReaderWrapper.<init>(ParquetRecordReaderWrapper.java:66) > at > org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat.getRecordReader(MapredParquetInputFormat.java:51) > at > org.apache.hadoop.hive.ql.io.CombineHiveRecordReader.<init>(CombineHiveRecordReader.java:65) -- This message was sent by Atlassian JIRA (v6.2#6252)