Wei Liu created SPARK-48411:
-------------------------------
Summary: Add E2E test for DropDuplicateWithinWatermark
Key: SPARK-48411
URL: https://issues.apache.org/jira/browse/SPARK-48411
Project: Spark
Issue Type: New Feature
Components: Connect, SS
Affects Versions: 4.0.0
Reporter: Wei Liu
Currently we do not have a e2e test for DropDuplicateWithinWatermark, we should
add one. We can simply use one of the test written in Scala here (with the
testStream API) and replicate it to python:
[https://github.com/apache/spark/commit/0e9e34c1bd9bd16ad5efca77ce2763eb950f3103]
The change should happen in
[https://github.com/apache/spark/blob/eee179135ed21dbdd8b342d053c9eda849e2de77/python/pyspark/sql/tests/streaming/test_streaming.py#L29]
so we can test it in both connect and non-connect.
Test with:
```
python/run-tests --testnames pyspark.sql.tests.streaming.test_streaming
python/run-tests --testnames
pyspark.sql.tests.connect.streaming.test_parity_streaming
```
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]