Jason Tokayer created SPARK-20296:
-------------------------------------
Summary: UnsupportedOperationChecker text on distinct aggregations
differs from docs
Key: SPARK-20296
URL: https://issues.apache.org/jira/browse/SPARK-20296
Project: Spark
Issue Type: Documentation
Components: Structured Streaming
Affects Versions: 2.1.0
Reporter: Jason Tokayer
Priority: Minor
In the unsupported operations section in the docs
https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html
it states that "Distinct operations on streaming Datasets are not supported.".
However, in
```org.apache.spark.sql.catalyst.analysis.UnsupportedOperationChecker.scala```,
the error message is ```Distinct aggregations are not supported on streaming
DataFrames/Datasets, unless it is on aggregated DataFrame/Dataset in Complete
output mode. Consider using approximate distinct aggregation```.
It seems that the error message is incorrect.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]