[
https://issues.apache.org/jira/browse/SPARK-33962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun resolved SPARK-33962.
-----------------------------------
Fix Version/s: 3.2.0
Resolution: Fixed
Issue resolved by pull request 30994
[https://github.com/apache/spark/pull/30994]
> Fix incorrect min partition condition in getRanges
> --------------------------------------------------
>
> Key: SPARK-33962
> URL: https://issues.apache.org/jira/browse/SPARK-33962
> Project: Spark
> Issue Type: Bug
> Components: Structured Streaming
> Affects Versions: 3.2.0
> Reporter: L. C. Hsieh
> Assignee: L. C. Hsieh
> Priority: Minor
> Fix For: 3.2.0
>
>
> When calculating offset ranges, we consider minPartitions configuration. If
> minPartitions is not set or is less than or equal the size of given ranges,
> it means there are enough partitions at Kafka so we don't need to split
> offsets to satisfy min partition requirement. But the current condition is
> offsetRanges.size > minPartitions.get and is not correct. Currently getRanges
> will split offsets in unnecessary case.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]