[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-11 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115987034 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-11 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115925754 --- Diff: core/src/main/scala/org/apache/spark/MapOutputStatistics.scala --- @@ -23,5 +23,9 @@ package org.apache.spark * @param shuffleId ID of the

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-11 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115925532 --- Diff: core/src/main/scala/org/apache/spark/MapOutputStatistics.scala --- @@ -23,5 +23,9 @@ package org.apache.spark * @param shuffleId ID of the

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115680727 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115642947 --- Diff: core/src/main/scala/org/apache/spark/MapOutputStatistics.scala --- @@ -23,5 +23,9 @@ package org.apache.spark * @param shuffleId ID of the

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115642967 --- Diff: core/src/main/scala/org/apache/spark/MapOutputTracker.scala --- @@ -156,12 +156,14 @@ private[spark] abstract class MapOutputTracker(conf:

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115644428 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/ShuffleExchange.scala --- @@ -221,6 +221,12 @@ object ShuffleExchange {

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115667172 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115645927 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115657718 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115643011 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -39,16 +40,18 @@ private[spark] sealed trait MapStatus { * necessary

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-26 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r103140028 --- Diff: core/src/main/scala/org/apache/spark/MapOutputStatistics.scala --- @@ -23,5 +23,9 @@ package org.apache.spark * @param shuffleId ID of the

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-26 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r103140017 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -39,16 +40,18 @@ private[spark] sealed trait MapStatus { * necessary

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-26 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r103138824 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-26 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r103134610 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-23 Thread watermen
Github user watermen commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r102892814 --- Diff: core/src/main/scala/org/apache/spark/MapOutputStatistics.scala --- @@ -23,5 +23,9 @@ package org.apache.spark * @param shuffleId ID of the

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-23 Thread witgo
Github user witgo commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r102891969 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -39,16 +40,18 @@ private[spark] sealed trait MapStatus { * necessary

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-08 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r100033968 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-04 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r99465564 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala --- @@ -216,7 +216,7 @@ class PlannerSuite extends SharedSQLContext {

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-02-04 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r99463628 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/PlannerSuite.scala --- @@ -216,7 +216,7 @@ class PlannerSuite extends SharedSQLContext

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-25 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97786814 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-25 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97786175 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-25 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97783672 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,101 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97712511 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97710485 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97710405 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97710396 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97701247 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97700863 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97700723 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97700670 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,100 @@ trait BaseLimitExec extends UnaryExecNode with

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-01-24 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r97700568 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala --- @@ -230,6 +230,21 @@ case object SinglePartition