[
https://issues.apache.org/jira/browse/BEAM-1725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17546438#comment-17546438
]
Kenneth Knowles commented on BEAM-1725:
---------------------------------------
This issue has been migrated to https://github.com/apache/beam/issues/18306
> SparkRunner should deduplicate when an UnboundedSource requires Deduping
> ------------------------------------------------------------------------
>
> Key: BEAM-1725
> URL: https://issues.apache.org/jira/browse/BEAM-1725
> Project: Beam
> Issue Type: Bug
> Components: runner-spark
> Reporter: Thomas Groh
> Priority: P3
>
> The implementation of an Unbounded Read does not inspect the requiresDeduping
> property of the source, and as such does not appropriately deduplicate
> sources that require it.
> https://github.com/apache/beam/blob/master/runners/spark/src/main/java/org/apache/beam/runners/spark/io/SparkUnboundedSource.java
--
This message was sent by Atlassian Jira
(v8.20.7#820007)