[
https://issues.apache.org/jira/browse/SPARK-20296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15964756#comment-15964756
]
Apache Spark commented on SPARK-20296:
--------------------------------------
User 'jtoka' has created a pull request for this issue:
https://github.com/apache/spark/pull/17609
> UnsupportedOperationChecker text on distinct aggregations differs from docs
> ---------------------------------------------------------------------------
>
> Key: SPARK-20296
> URL: https://issues.apache.org/jira/browse/SPARK-20296
> Project: Spark
> Issue Type: Documentation
> Components: Structured Streaming
> Affects Versions: 2.1.0
> Reporter: Jason Tokayer
> Priority: Trivial
>
> In the unsupported operations section in the docs
> https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html
> it states that "Distinct operations on streaming Datasets are not
> supported.". However, in
> ```org.apache.spark.sql.catalyst.analysis.UnsupportedOperationChecker.scala```,
> the error message is ```Distinct aggregations are not supported on streaming
> DataFrames/Datasets, unless it is on aggregated DataFrame/Dataset in Complete
> output mode. Consider using approximate distinct aggregation```.
> It seems that the error message is incorrect.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]