[GitHub] spark pull request #21560: [SPARK-24386][SS] coalesce(1) aggregates in conti...

jose-torres Mon, 25 Jun 2018 13:45:02 -0700

Github user jose-torres commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21560#discussion_r197935989
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala
 ---
    @@ -349,6 +349,17 @@ object UnsupportedOperationChecker {
                   _: DeserializeToObject | _: SerializeFromObject | _: 
SubqueryAlias |
                   _: TypedFilter) =>
             case node if node.nodeName == "StreamingRelationV2" =>
    +        case Repartition(1, false, _) =>
    +        case node: Aggregate =>
    +          val aboveSinglePartitionCoalesce = node.find {
    +            case Repartition(1, false, _) => true
    --- End diff --
    
    I don't think there's any particular reason we need to. There's no reason 
we couldn't execute multiple repartitions if the optimizer isn't smart enough 
to combine them.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #21560: [SPARK-24386][SS] coalesce(1) aggregates in conti...

Reply via email to