rangadi commented on code in PR #40834:
URL: https://github.com/apache/spark/pull/40834#discussion_r1170693559
##########
connector/connect/common/src/main/protobuf/spark/connect/relations.proto:
##########
@@ -363,6 +364,23 @@ message Deduplicate {
optional bool all_columns_as_keys = 3;
}
+// Relation of type [[DeduplicateWithinWatermark]] which have duplicate rows
removed within the time
+// range of watermark, could consider either only the subset of columns or all
the columns.
+message DeduplicateWithinWatermark {
Review Comment:
Optional: We can just reuse Deduplicate message inside here :).
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]