Github user jose-torres commented on a diff in the pull request:
https://github.com/apache/spark/pull/21560#discussion_r197935989
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala
---
@@ -349,6 +349,17 @@ object UnsupportedOperationChecker {
_: DeserializeToObject | _: SerializeFromObject | _:
SubqueryAlias |
_: TypedFilter) =>
case node if node.nodeName == "StreamingRelationV2" =>
+ case Repartition(1, false, _) =>
+ case node: Aggregate =>
+ val aboveSinglePartitionCoalesce = node.find {
+ case Repartition(1, false, _) => true
--- End diff --
I don't think there's any particular reason we need to. There's no reason
we couldn't execute multiple repartitions if the optimizer isn't smart enough
to combine them.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]