[
https://issues.apache.org/jira/browse/ORC-672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Taraka Rama Rao Lethavadla updated ORC-672:
-------------------------------------------
Description:
Scenario:
Orc file contains data(more than 1024 entries) in array<float> and an external
table is created with one of the column as array<decimal(9,2)>. Now
{noformat}
LOAD DATA INPATH '/tmp/test.orc' OVERWRITE INTO TABLE test_orc_table{noformat}
query is failing with below exception
{noformat}
Caused by: java.lang.RuntimeException:
java.lang.ArrayIndexOutOfBoundsException: 1024
at
org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:133)
at
org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:56)
at
org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:361)
... 23 more Caused by:
java.lang.ArrayIndexOutOfBoundsException: 1024 at
org.apache.orc.impl.TreeReaderFactory$TreeReader.nextVector(TreeReaderFactory.java:270)
at
org.apache.orc.impl.TreeReaderFactory$FloatTreeReader.nextVector(TreeReaderFactory.java:667)
at
org.apache.orc.impl.ConvertTreeReaderFactory$DecimalFromFloatTreeReader.nextVector(ConvertTreeReaderFactory.java:1251)
at
org.apache.orc.impl.TreeReaderFactory$ListTreeReader.nextVector(TreeReaderFactory.java:2199)
at
org.apache.orc.impl.TreeReaderFactory$StructTreeReader.nextBatch(TreeReaderFactory.java:2013)
at
org.apache.orc.impl.RecordReaderImpl.nextBatch(RecordReaderImpl.java:1290) at
org.apache.hadoop.hive.ql.io.orc.RecordReaderImpl.nextBatch(RecordReaderImpl.java:156)
at
org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:129)
... 25 more
{noformat}
So fix similar to ORC-598 can be done to other readers like
DecimalFromFloatTreeReader
was:
Scenario:
Orc file contains data(more than 1024 entries) in array<float> and an external
table is created with one of the column as array<decimal(9,2)>. Now LOAD DATA
INPATH '/tmp/test.orc' OVERWRITE INTO TABLE test_orc_table query is failing
with below exception
{noformat}
Caused by: java.lang.RuntimeException:
java.lang.ArrayIndexOutOfBoundsException: 1024
at
org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:133)
at
org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:56)
at
org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:361)
... 23 more Caused by:
java.lang.ArrayIndexOutOfBoundsException: 1024 at
org.apache.orc.impl.TreeReaderFactory$TreeReader.nextVector(TreeReaderFactory.java:270)
at
org.apache.orc.impl.TreeReaderFactory$FloatTreeReader.nextVector(TreeReaderFactory.java:667)
at
org.apache.orc.impl.ConvertTreeReaderFactory$DecimalFromFloatTreeReader.nextVector(ConvertTreeReaderFactory.java:1251)
at
org.apache.orc.impl.TreeReaderFactory$ListTreeReader.nextVector(TreeReaderFactory.java:2199)
at
org.apache.orc.impl.TreeReaderFactory$StructTreeReader.nextBatch(TreeReaderFactory.java:2013)
at
org.apache.orc.impl.RecordReaderImpl.nextBatch(RecordReaderImpl.java:1290) at
org.apache.hadoop.hive.ql.io.orc.RecordReaderImpl.nextBatch(RecordReaderImpl.java:156)
at
org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:129)
... 25 more
{noformat}
So fix similar to ORC-598 can be done to other readers like
DecimalFromFloatTreeReader
> Fix made in ORC-598 need to be extended to other readers like
> DecimalFromFloatTreeReader
> ----------------------------------------------------------------------------------------
>
> Key: ORC-672
> URL: https://issues.apache.org/jira/browse/ORC-672
> Project: ORC
> Issue Type: Bug
> Components: Java
> Affects Versions: 1.5.1
> Reporter: Taraka Rama Rao Lethavadla
> Priority: Major
>
> Scenario:
> Orc file contains data(more than 1024 entries) in array<float> and an
> external table is created with one of the column as array<decimal(9,2)>. Now
> {noformat}
> LOAD DATA INPATH '/tmp/test.orc' OVERWRITE INTO TABLE test_orc_table{noformat}
> query is failing with below exception
> {noformat}
> Caused by: java.lang.RuntimeException:
> java.lang.ArrayIndexOutOfBoundsException: 1024
> at
> org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:133)
> at
> org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:56)
> at
> org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:361)
> ... 23 more Caused by:
> java.lang.ArrayIndexOutOfBoundsException: 1024 at
> org.apache.orc.impl.TreeReaderFactory$TreeReader.nextVector(TreeReaderFactory.java:270)
> at
> org.apache.orc.impl.TreeReaderFactory$FloatTreeReader.nextVector(TreeReaderFactory.java:667)
> at
> org.apache.orc.impl.ConvertTreeReaderFactory$DecimalFromFloatTreeReader.nextVector(ConvertTreeReaderFactory.java:1251)
> at
> org.apache.orc.impl.TreeReaderFactory$ListTreeReader.nextVector(TreeReaderFactory.java:2199)
> at
> org.apache.orc.impl.TreeReaderFactory$StructTreeReader.nextBatch(TreeReaderFactory.java:2013)
> at
> org.apache.orc.impl.RecordReaderImpl.nextBatch(RecordReaderImpl.java:1290) at
> org.apache.hadoop.hive.ql.io.orc.RecordReaderImpl.nextBatch(RecordReaderImpl.java:156)
> at
> org.apache.hadoop.hive.ql.io.orc.VectorizedOrcInputFormat$VectorizedOrcRecordReader.next(VectorizedOrcInputFormat.java:129)
> ... 25 more
> {noformat}
> So fix similar to ORC-598 can be done to other readers like
> DecimalFromFloatTreeReader
--
This message was sent by Atlassian Jira
(v8.3.4#803005)