[
https://issues.apache.org/jira/browse/FLINK-28018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17553484#comment-17553484
]
Zhu Zhu commented on FLINK-28018:
---------------------------------
I have assigned you the ticket. Feel free to open a fix for it.
> the start index to create empty splits in BinaryInputFormat#createInputSplits
> is inappropriate
> ----------------------------------------------------------------------------------------------
>
> Key: FLINK-28018
> URL: https://issues.apache.org/jira/browse/FLINK-28018
> Project: Flink
> Issue Type: Bug
> Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
> Affects Versions: 1.15.0, 1.14.4, 1.16.0
> Reporter: zl
> Assignee: zl
> Priority: Major
> Fix For: 1.16.0, 1.15.2
>
>
> when the number of created split is smaller than the minimum desired number
> of file splits,
> [BinaryInputFormat.java#L150|https://github.com/apache/flink/blob/master/flink-core/src/main/java/org/apache/flink/api/common/io/BinaryInputFormat.java#L150]
> use `{_}*files.size()*{_}` as the start index to create empty splits. That
> is inappropriate, the start index should be `{_}*inputSplits.size()*{_}`.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)