[
https://issues.apache.org/jira/browse/PARQUET-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gabor Szadovszky resolved PARQUET-2052.
---------------------------------------
Resolution: Fixed
> Integer overflow when writing huge binary using dictionary encoding
> -------------------------------------------------------------------
>
> Key: PARQUET-2052
> URL: https://issues.apache.org/jira/browse/PARQUET-2052
> Project: Parquet
> Issue Type: Bug
> Reporter: Chao Sun
> Assignee: Chao Sun
> Priority: Major
>
> To check whether it should fallback to plain encoding,
> {{DictionaryValuesWriter}} currently use two variables:
> {{dictionaryByteSize}} and {{maxDictionaryByteSize}}, both of which are
> integer. This will cause issue when one first writes a relatively small
> binary within the threshold and then write a huge string which cause
> {{dictionaryByteSize}} overflow and becoming negative.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)