[
https://issues.apache.org/jira/browse/FLINK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jingsong Lee closed FLINK-22202.
--------------------------------
Resolution: Fixed
master (1.13): 413ff6ae7e7a1bd6c5fe495a141f02d383165776
> Thread safety in ParquetColumnarRowInputFormat
> ----------------------------------------------
>
> Key: FLINK-22202
> URL: https://issues.apache.org/jira/browse/FLINK-22202
> Project: Flink
> Issue Type: Bug
> Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
> Reporter: Jingsong Lee
> Assignee: Jingsong Lee
> Priority: Critical
> Labels: pull-request-available
> Fix For: 1.13.0
>
>
> In a {{VectorizedColumnBatch}}, the dictionary will be lazied deserialized.
> If there are multiple batches at the same time, there may be thread safety
> problems, because the deserialization of the dictionary depends on some
> internal structures.
> We need set numBatchesToCirculate to 1 for ParquetColumnarRowInputFormat.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)