cloud-fan commented on code in PR #39240:
URL: https://github.com/apache/spark/pull/39240#discussion_r1058942699


##########
connector/connect/common/src/main/protobuf/spark/connect/relations.proto:
##########
@@ -353,9 +353,10 @@ message Sample {
   // (Optional) The random seed.
   optional int64 seed = 5;
 
-  // (Optional) Explicitly sort the underlying plan to make the ordering 
deterministic.
-  // This flag is only used to randomly splits DataFrame with the provided 
weights.
-  optional bool force_stable_sort = 6;
+  // (Required) Explicitly sort the underlying plan to make the ordering 
deterministic or cache it.
+  // This flag is true when invoking `dataframe.randomSplit` to randomly 
splits DataFrame with the
+  // provided weights. Otherwise, it is false.
+  bool deterministic_order = 6;

Review Comment:
   The default value `false` is fine here. @amaliujia what's the principle to 
mark field as required or optional?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to