[
https://issues.apache.org/jira/browse/HIVE-11102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14609589#comment-14609589
]
Gopal V commented on HIVE-11102:
--------------------------------
I added it to my nightlies and I see some strange logs from Query27
{code}
2015-07-01 01:36:18,903 WARN [ORC_GET_SPLITS #2] orc.ReaderImpl: Cannot find
field for: cd_demo_sk in _col0, _col1, _col2, _col3, _col4, _col5, _col6,
_col7, _col8,
2015-07-01 01:36:18,904 WARN [ORC_GET_SPLITS #2] orc.ReaderImpl: Cannot find
field for: cd_gender in _col0, _col1, _col2, _col3, _col4, _col5, _col6, _col7,
_col8,
2015-07-01 01:36:18,904 WARN [ORC_GET_SPLITS #2] orc.ReaderImpl: Cannot find
field for: cd_marital_status in _col0, _col1, _col2, _col3, _col4, _col5,
_col6, _col7, _col8,
2015-07-01 01:36:18,904 WARN [ORC_GET_SPLITS #2] orc.ReaderImpl: Cannot find
field for: cd_education_status in _col0, _col1, _col2, _col3, _col4, _col5,
_col6, _col7, _col8,
2015-07-01 01:36:18,903 WARN [ORC_GET_SPLITS #1] orc.ReaderImpl: Cannot find
field for: cd_demo_sk in _col0, _col1, _col2, _col3, _col4, _col5, _col6,
_col7, _col8,
{code}
> ReaderImpl: getColumnIndicesFromNames does not work for ACID tables
> -------------------------------------------------------------------
>
> Key: HIVE-11102
> URL: https://issues.apache.org/jira/browse/HIVE-11102
> Project: Hive
> Issue Type: Bug
> Components: File Formats
> Affects Versions: 1.3.0, 1.2.1, 2.0.0
> Reporter: Gopal V
> Assignee: Sergey Shelukhin
> Attachments: HIVE-11102.patch
>
>
> ORC reader impl does not estimate the size of ACID data files correctly.
> {code}
> Caused by: java.lang.IndexOutOfBoundsException: Index: 0
> at java.util.Collections$EmptyList.get(Collections.java:3212)
> at
> org.apache.hadoop.hive.ql.io.orc.OrcProto$Type.getSubtypes(OrcProto.java:12240)
> at
> org.apache.hadoop.hive.ql.io.orc.ReaderImpl.getColumnIndicesFromNames(ReaderImpl.java:651)
> at
> org.apache.hadoop.hive.ql.io.orc.ReaderImpl.getRawDataSizeOfColumns(ReaderImpl.java:634)
> at
> org.apache.hadoop.hive.ql.io.orc.OrcInputFormat$SplitGenerator.populateAndCacheStripeDetails(OrcInputFormat.java:938)
> at
> org.apache.hadoop.hive.ql.io.orc.OrcInputFormat$SplitGenerator.call(OrcInputFormat.java:847)
> at
> org.apache.hadoop.hive.ql.io.orc.OrcInputFormat$SplitGenerator.call(OrcInputFormat.java:713)
> at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:744)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)