Github user viirya commented on a diff in the pull request:
https://github.com/apache/spark/pull/21857#discussion_r205560454
--- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala ---
@@ -1947,6 +1947,21 @@ class Dataset[T] private[sql](
Except(planWithBarrier, other.planWithBarrier)
}
+ /**
+ * Returns a new Dataset containing rows in this Dataset but not in
another Dataset while
+ * preserving the duplicates.
--- End diff --
I think it is good to mention `resolves columns by position (not by name)`
here too.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]