[
https://issues.apache.org/jira/browse/PARQUET-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165597#comment-17165597
]
Gabor Szadovszky commented on PARQUET-1852:
-------------------------------------------
I think, we need more info here. First, it is not a parquet-format issue. Based
on the stack-trace it occurred in parquet-mr. We don't know which version of
parquet-mr, though. Also, it would nice to know all the details of this issue.
What is the parquet schema of the file to be written? What kind of data is
written to the file? How to reproduce the issue?
> Array Index OutOf Bounds Exception when fall Back Dictionary Encoded Data
> -------------------------------------------------------------------------
>
> Key: PARQUET-1852
> URL: https://issues.apache.org/jira/browse/PARQUET-1852
> Project: Parquet
> Issue Type: Bug
> Components: parquet-format
> Reporter: jiangbo
> Priority: Major
>
> java.lang.ArrayIndexOutOfBoundsException: 39782
> \n\tat
> org.apache.parquet.column.values.dictionary.DictionaryValuesWriter$PlainBinaryDictionaryValuesWriter.fallBackDictionaryEncodedData(DictionaryValuesWriter.java:284)
> \n\tat
> org.apache.parquet.column.values.dictionary.DictionaryValuesWriter.fallBackAllValuesTo(DictionaryValuesWriter.java:123)
> \n\tat
> org.apache.parquet.column.values.fallback.FallbackValuesWriter.fallBack(FallbackValuesWriter.java:147)
> \n\tat
> org.apache.parquet.column.values.fallback.FallbackValuesWriter.checkFallback(FallbackValuesWriter.java:141)
> \n\tat
> org.apache.parquet.column.values.fallback.FallbackValuesWriter.writeBytes(FallbackValuesWriter.java:163)
> \n\tat
> org.apache.parquet.column.impl.ColumnWriterV1.write(ColumnWriterV1.java:201)
> \n\tat
> org.apache.parquet.io.MessageColumnIO$MessageColumnIORecordConsumer.addBinary(MessageColumnIO.java:467)
> \n\tat
> org.apache.parquet.io.RecordConsumerLoggingWrapper.addBinary(RecordConsumerLoggingWrapper.java:119)
> \n\tat
> org.apache.parquet.example.data.simple.BinaryValue.writeValue(BinaryValue.java:45)
> \n\tat
> org.apache.parquet.example.data.simple.SimpleGroup.writeValue(SimpleGroup.java:229)
> \n\tat
> org.apache.parquet.example.data.GroupWriter.writeGroup(GroupWriter.java:51)
> \n\tat org.apache.parquet.example.data.GroupWriter.write(GroupWriter.java:37)
> \n\tat
> org.apache.parquet.hadoop.example.GroupWriteSupport.write(GroupWriteSupport.java:79)
> \n\tat
> org.apache.parquet.hadoop.example.GroupWriteSupport.write(GroupWriteSupport.java:36)
> \n\tat
> org.apache.parquet.hadoop.InternalParquetRecordWriter.write(InternalParquetRecordWriter.java:123)
> \n\tat org.apache.parquet.hadoop.ParquetWriter.write(ParquetWriter.java:293)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)