[ https://issues.apache.org/jira/browse/SPARK-35880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368965#comment-17368965 ]
Apache Spark commented on SPARK-35880: -------------------------------------- User 'vkorukanti' has created a pull request for this issue: https://github.com/apache/spark/pull/33065 > [SS] Track the number of duplicates dropped in streaming dedupe operator > ------------------------------------------------------------------------ > > Key: SPARK-35880 > URL: https://issues.apache.org/jira/browse/SPARK-35880 > Project: Spark > Issue Type: Improvement > Components: Structured Streaming > Affects Versions: 3.1.2 > Reporter: Venki Korukanti > Priority: Minor > > Currently there is no way to find how many duplicates in the input are > dropped. Having this metric will help track down incorrect results issues. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org