[
https://issues.apache.org/jira/browse/HIVE-7186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Alex Nastetsky updated HIVE-7186:
---------------------------------
Environment: Hortonworks Data Platform 2.0.6.0 (was: Hortonworks Data
Platform 2.0)
> Unable to perform join on table
> -------------------------------
>
> Key: HIVE-7186
> URL: https://issues.apache.org/jira/browse/HIVE-7186
> Project: Hive
> Issue Type: Bug
> Affects Versions: 0.12.0
> Environment: Hortonworks Data Platform 2.0.6.0
> Reporter: Alex Nastetsky
>
> Occasionally, a table will start exhibiting behavior that will prevent it
> from being used in a JOIN.
> When doing a map join, it will just stall at "Starting to launch local task
> to process map join; ".
> When doing a regular join, it will make progress but then error out with a
> IndexOutOfBoundsException:
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException:
> java.lang.IndexOutOfBoundsException
> at
> org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.processOp(ReduceSinkOperator.java:365)
> at org.apache.hadoop.hive.ql.exec.Operator.process(Operator.java:504)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:842)
> at
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:91)
> at org.apache.hadoop.hive.ql.exec.Operator.process(Operator.java:504)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:842)
> at
> org.apache.hadoop.hive.ql.exec.MapOperator.process(MapOperator.java:534)
> ... 9 more
> Caused by: java.lang.IndexOutOfBoundsException
> at java.nio.Buffer.checkIndex(Buffer.java:532)
> at
> java.nio.ByteBufferAsIntBufferL.put(ByteBufferAsIntBufferL.java:131)
> at
> org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:1153)
> at
> org.apache.hadoop.mapred.MapTask$OldOutputCollector.collect(MapTask.java:586)
> at
> org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.collect(ReduceSinkOperator.java:372)
> at
> org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.processOp(ReduceSinkOperator.java:334)
> ... 15 more
>
> Doing simple selects against this table work fine and do not show any
> apparent problems with the data.
> Assume that the table in question is called tableA and was created by queryA.
> Doing either of the following has helped resolve the issue in the past.
> 1) create table tableB as select * from tableA;
> Then just use tableB instead in the JOIN.
> 2) regenerate tableA using queryA
> Then use tableA in the JOIN again. It usually works the second time.
>
> When doing a "describe formatted" on the tables, the totalSize will be
> different between the original tableA and tableB, and sometimes (but not
> always) between the original tableA and the regenerated tableA. The numRows
> will be the same across all versions of the tables.
> This problem can not be reproduced consistently, but the issue always happens
> when we try to use an affected table in a JOIN.
--
This message was sent by Atlassian JIRA
(v6.2#6252)