Github user gatorsmile commented on a diff in the pull request:
https://github.com/apache/spark/pull/21587#discussion_r196301308
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala
---
@@ -99,16 +99,19 @@ case class ClusteredDistribution(
* This is a strictly stronger guarantee than [[ClusteredDistribution]].
Given a tuple and the
* number of partitions, this distribution strictly requires which
partition the tuple should be in.
*/
-case class HashClusteredDistribution(expressions: Seq[Expression]) extends
Distribution {
+case class HashClusteredDistribution(
+ expressions: Seq[Expression],
+ requiredNumPartitions: Option[Int] = None) extends Distribution {
--- End diff --
Do we need to update
https://github.com/cloud-fan/spark/blob/b69a7271e4c5c4c1b46f6a4837e12ac714ab33b4/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala#L214-L217?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]