[ https://issues.apache.org/jira/browse/PARQUET-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gabor Szadovszky resolved PARQUET-2052. --------------------------------------- Resolution: Fixed > Integer overflow when writing huge binary using dictionary encoding > ------------------------------------------------------------------- > > Key: PARQUET-2052 > URL: https://issues.apache.org/jira/browse/PARQUET-2052 > Project: Parquet > Issue Type: Bug > Reporter: Chao Sun > Assignee: Chao Sun > Priority: Major > > To check whether it should fallback to plain encoding, > {{DictionaryValuesWriter}} currently use two variables: > {{dictionaryByteSize}} and {{maxDictionaryByteSize}}, both of which are > integer. This will cause issue when one first writes a relatively small > binary within the threshold and then write a huge string which cause > {{dictionaryByteSize}} overflow and becoming negative. -- This message was sent by Atlassian Jira (v8.3.4#803005)