[
https://issues.apache.org/jira/browse/HIVE-18106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16260142#comment-16260142
]
Ferdinand Xu commented on HIVE-18106:
-------------------------------------
What version did you hit this error? I tried on the latest upstream and can not
reproduce this.
> analyze table fails on parquet table
> ------------------------------------
>
> Key: HIVE-18106
> URL: https://issues.apache.org/jira/browse/HIVE-18106
> Project: Hive
> Issue Type: Bug
> Reporter: Rajesh Balamohan
>
> {noformat}
> hive> analyze table item compute statistics for columns;
> Caused by: org.apache.parquet.io.ParquetDecodingException: Can not read value
> at 0 in block -1 in file hdfs://...
> at
> org.apache.parquet.hadoop.InternalParquetRecordReader.nextKeyValue(InternalParquetRecordReader.java:223)
> at
> org.apache.parquet.hadoop.ParquetRecordReader.nextKeyValue(ParquetRecordReader.java:212)
> at
> org.apache.hadoop.hive.ql.io.parquet.read.ParquetRecordReaderWrapper.<init>(ParquetRecordReaderWrapper.java:98)
> at
> org.apache.hadoop.hive.ql.io.parquet.read.ParquetRecordReaderWrapper.<init>(ParquetRecordReaderWrapper.java:60)
> at
> org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat.getRecordReader(MapredParquetInputFormat.java:87)
> at
> org.apache.hadoop.hive.ql.io.HiveInputFormat.getRecordReader(HiveInputFormat.java:415)
> ... 27 more
> Caused by: java.lang.UnsupportedOperationException:
> org.apache.parquet.column.values.dictionary.PlainValuesDictionary$PlainIntegerDictionary
> at
> org.apache.parquet.column.Dictionary.decodeToBinary(Dictionary.java:44)
> at
> org.apache.hadoop.hive.ql.io.parquet.convert.ETypeConverter$BinaryConverter.setDictionary(ETypeConverter.java:283)
> at
> org.apache.parquet.column.impl.ColumnReaderImpl.<init>(ColumnReaderImpl.java:346)
> at
> org.apache.parquet.column.impl.ColumnReadStoreImpl.newMemColumnReader(ColumnReadStoreImpl.java:82)
> at
> org.apache.parquet.column.impl.ColumnReadStoreImpl.getColumnReader(ColumnReadStoreImpl.java:77)
> at
> org.apache.parquet.io.RecordReaderImplementation.<init>(RecordReaderImplementation.java:270)
> at
> org.apache.parquet.io.MessageColumnIO$1.visit(MessageColumnIO.java:140)
> at
> org.apache.parquet.io.MessageColumnIO$1.visit(MessageColumnIO.java:106)
> at
> org.apache.parquet.filter2.compat.FilterCompat$NoOpFilter.accept(FilterCompat.java:154)
> at
> org.apache.parquet.io.MessageColumnIO.getRecordReader(MessageColumnIO.java:106)
> at
> org.apache.parquet.hadoop.InternalParquetRecordReader.checkRead(InternalParquetRecordReader.java:136)
> at
> org.apache.parquet.hadoop.InternalParquetRecordReader.nextKeyValue(InternalParquetRecordReader.java:194)
> ... 32 more
> hive> select count(*) from item;
> 300000
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)