[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600957 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600110 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600909 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600707 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600885 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117601249 --- Diff: common/unsafe/src/test/java/org/apache/spark/unsafe/types/UTF8StringSuite.java --- @@ -730,4 +726,62 @@ public void testToLong() throws

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117601293 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/StringExpressionsSuite.scala --- @@ -375,24 +374,61 @@ class

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600921 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117601306 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala --- @@ -523,6 +524,14 @@ class SQLQuerySuite extends QueryTest

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600718 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117600840 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117601121 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,69 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117601355 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLQuerySuite.scala --- @@ -2015,4 +2015,121 @@ class SQLQuerySuite extends QueryTest

[GitHub] spark issue #12646: [SPARK-14878][SQL] Trim characters string function suppo...

2017-05-19 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/12646 Jenkins is about to shut down, we can retest this later --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #12646: [SPARK-14878][SQL] Trim characters string function suppo...

2017-05-19 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/12646 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #18031: [SPARK-20801] Record accurate size of blocks in M...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/18031#discussion_r117491182 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -193,8 +219,27 @@ private[spark] object HighlyCompressedMapStatus

[GitHub] spark pull request #18031: [SPARK-20801] Record accurate size of blocks in M...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/18031#discussion_r117489904 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -193,8 +219,27 @@ private[spark] object HighlyCompressedMapStatus

[GitHub] spark pull request #18031: [SPARK-20801] Record accurate size of blocks in M...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/18031#discussion_r117443085 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -193,8 +219,27 @@ private[spark] object HighlyCompressedMapStatus

[GitHub] spark pull request #18031: [SPARK-20801] Record accurate size of blocks in M...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/18031#discussion_r117440204 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -121,48 +126,69 @@ private[spark] class CompressedMapStatus

[GitHub] spark pull request #18031: [SPARK-20801] Record accurate size of blocks in M...

2017-05-19 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/18031#discussion_r117440029 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -193,8 +219,27 @@ private[spark] object HighlyCompressedMapStatus

[GitHub] spark issue #17985: Add "full_outer" name to join types

2017-05-18 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17985 @BartekH Yes, I think we can add that to exception message. Please also add a test case for checking supported join types. --- If your project is set up for it, you can reply to this email and have

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117175951 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsSuite.scala --- @@ -175,7 +178,7 @@ class StatisticsSuite extends

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117176108 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsSuite.scala --- @@ -175,7 +178,7 @@ class StatisticsSuite extends

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117174966 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala --- @@ -414,6 +415,50 @@ private[hive] class HiveClientImpl

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117169136 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveComparisonTest.scala --- @@ -192,13 +192,7 @@ abstract class HiveComparisonTest

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117168812 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsSuite.scala --- @@ -215,6 +218,217 @@ class StatisticsSuite extends

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117156770 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala --- @@ -414,6 +415,50 @@ private[hive] class HiveClientImpl

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117172781 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsSuite.scala --- @@ -215,6 +218,217 @@ class StatisticsSuite extends

[GitHub] spark pull request #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated S...

2017-05-18 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/14971#discussion_r117169168 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsSuite.scala --- @@ -232,7 +446,8 @@ class StatisticsSuite extends

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r117148664 --- Diff: common/unsafe/src/test/java/org/apache/spark/unsafe/types/UTF8StringSuite.java --- @@ -730,4 +726,49 @@ public void testToLong() throws

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116947912 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,67 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116947565 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,67 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116925628 --- Diff: common/unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java --- @@ -510,6 +510,67 @@ public UTF8String trim

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116924800 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala --- @@ -1069,6 +1069,8 @@ class AstBuilder(conf: SQLConf

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116924583 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,246 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116924283 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,246 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-17 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116924073 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-16 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116903142 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116659903 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116656846 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116656190 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116655649 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116655664 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116655632 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116646341 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -2306,7 +2306,15 @@ object functions { * @group string_funcs

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116656332 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116646257 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -2382,7 +2390,15 @@ object functions { * @group string_funcs

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116646329 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -2382,7 +2390,15 @@ object functions { * @group string_funcs

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116656230 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116645110 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala --- @@ -1086,10 +1106,22 @@ class AstBuilder extends

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116656069 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116655023 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -2447,7 +2463,15 @@ object functions { * @group string_funcs

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116644643 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala --- @@ -1106,10 +1126,22 @@ class AstBuilder(conf: SQLConf

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116655691 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116646246 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/functions.scala --- @@ -2306,7 +2306,15 @@ object functions { * @group string_funcs

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116644377 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala --- @@ -1091,6 +1094,23 @@ class AstBuilder(conf: SQLConf

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116655997 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #12646: [SPARK-14878][SQL] Trim characters string functio...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/12646#discussion_r116642218 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/stringExpressions.scala --- @@ -461,68 +462,249 @@ case class FindInSet(left

[GitHub] spark pull request #17935: [SPARK-20690][SQL] Analyzer shouldn't add missing...

2017-05-15 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17935#discussion_r116482273 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/SubquerySuite.scala --- @@ -868,6 +868,29 @@ class SubquerySuite extends QueryTest

[GitHub] spark issue #17977: [SPARK-20735][SQL][TEST] Enable cross join in TPCDSQuery...

2017-05-15 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17977 LGTM. I always set this true when running tpcds. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17985: Add "full_outer" name to join types

2017-05-15 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17985 "full_outer" will be replaced by "fullouter" after `.replace("_", "")`, right? --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request #17962: [SPARK-20718][SQL][followup] Fix canonicalization...

2017-05-12 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17962#discussion_r116174540 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/QueryPlan.scala --- @@ -442,4 +442,14 @@ object QueryPlan

[GitHub] spark pull request #17962: [SPARK-20718][SQL][followup] Fix canonicalization...

2017-05-12 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17962#discussion_r116171180 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/QueryPlan.scala --- @@ -442,4 +442,14 @@ object QueryPlan

[GitHub] spark issue #17962: [SPARK-20718][SQL][followup] Fix canonicalization for Hi...

2017-05-12 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17962 cc @gatorsmile @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17962: [SPARK-20718][SQL][followup] Fix canonicalization...

2017-05-12 Thread wzhfy
GitHub user wzhfy opened a pull request: https://github.com/apache/spark/pull/17962 [SPARK-20718][SQL][followup] Fix canonicalization for HiveTableScanExec ## What changes were proposed in this pull request? Fix canonicalization for different filter orders

[GitHub] spark issue #17959: [SPARK-20718][SQL] FileSourceScanExec with different fil...

2017-05-12 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17959 @gatorsmile Right, thanks for pointing this out! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #17959: [SPARK-20718][SQL] FileSourceScanExec with differ...

2017-05-12 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17959#discussion_r116162790 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala --- @@ -519,8 +519,18 @@ case class FileSourceScanExec

[GitHub] spark issue #17959: [SPARK-20718][SQL] FileSourceScanExec with different fil...

2017-05-11 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17959 also cc @gatorsmile --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17959: [SPARK-20718][SQL] FileSourceScanExec with different fil...

2017-05-11 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17959 cc @cloud-fan @hvanhovell --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17959: [SPARK-20718][SQL] FileSourceScanExec with differ...

2017-05-11 Thread wzhfy
GitHub user wzhfy opened a pull request: https://github.com/apache/spark/pull/17959 [SPARK-20718][SQL] FileSourceScanExec with different filter orders should be the same after canonicalization ## What changes were proposed in this pull request? Since `constraints

[GitHub] spark issue #16677: [SPARK-19355][SQL] Use map output statistices to improve...

2017-05-10 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/16677 @hvanhovell @cloud-fan We have seen value of this PR in our customer scenarios, and that's why we started a discussion in dev list before. And thank @viirya to discuss with us and implement

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115680727 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode

[GitHub] spark issue #17930: [SPARK-20688][SQL] correctly check analysis for scalar s...

2017-05-10 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17930 Not introduced by this pr, but I just saw a style nit [here](https://github.com/apache/spark/pull/17930/files#diff-1d14ac233eac6f233c027dba0bdf871dR140) ``` if (conditions.isEmpty

[GitHub] spark pull request #17918: [SPARK-20678][SQL] Ndv for columns not in filter ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17918#discussion_r115672685 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/statsEstimation/FilterEstimationSuite.scala --- @@ -421,7 +420,7 @@ class

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115642947 --- Diff: core/src/main/scala/org/apache/spark/MapOutputStatistics.scala --- @@ -23,5 +23,9 @@ package org.apache.spark * @param shuffleId ID

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115642967 --- Diff: core/src/main/scala/org/apache/spark/MapOutputTracker.scala --- @@ -156,12 +156,14 @@ private[spark] abstract class MapOutputTracker(conf

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115644428 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/exchange/ShuffleExchange.scala --- @@ -221,6 +221,12 @@ object ShuffleExchange

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115667172 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115645927 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115657718 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/limit.scala --- @@ -90,25 +95,102 @@ trait BaseLimitExec extends UnaryExecNode

[GitHub] spark pull request #16677: [SPARK-19355][SQL] Use map output statistices to ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/16677#discussion_r115643011 --- Diff: core/src/main/scala/org/apache/spark/scheduler/MapStatus.scala --- @@ -39,16 +40,18 @@ private[spark] sealed trait MapStatus { * necessary

[GitHub] spark pull request #17918: [SPARK-20678][SQL] Ndv for columns not in filter ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17918#discussion_r115664182 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/JoinEstimation.scala --- @@ -217,32 +217,18 @@ case class

[GitHub] spark pull request #17918: [SPARK-20678][SQL] Ndv for columns not in filter ...

2017-05-10 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17918#discussion_r115664061 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/JoinEstimation.scala --- @@ -217,32 +217,18 @@ case class

[GitHub] spark pull request #17918: [SPARK-20678][SQL] Ndv for columns not in filter ...

2017-05-09 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17918#discussion_r115489131 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/statsEstimation/JoinEstimation.scala --- @@ -217,32 +217,18 @@ case class

[GitHub] spark issue #17918: [SPARK-20678][SQL] Ndv for columns not in filter conditi...

2017-05-09 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17918 @ron8hu You can look at tpcds q18. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #17918: [SPARK-20678][SQL] Ndv for columns not in filter conditi...

2017-05-09 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17918 @ron8hu Yes, but I forgot the query numbers. The case is, first we filter a large part of table A, then join table B on another column. Since ndv of that column is not scaled down, the estimated

[GitHub] spark issue #17918: [SPARK-20678][SQL] Ndv for columns not in filter conditi...

2017-05-09 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17918 cc @cloud-fan @gatorsmile @ron8hu --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17918: [SPARK-20678][SQL] Ndv for columns not in filter ...

2017-05-09 Thread wzhfy
GitHub user wzhfy opened a pull request: https://github.com/apache/spark/pull/17918 [SPARK-20678][SQL] Ndv for columns not in filter condition should also be updated ## What changes were proposed in this pull request? In filter estimation, we update column stats for those

[GitHub] spark issue #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17897 @cloud-fan OK, let me close this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
Github user wzhfy closed the pull request at: https://github.com/apache/spark/pull/17897 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17897 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17897#discussion_r115238974 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/tables.scala --- @@ -1025,3 +1025,5 @@ case class ShowCreateTableCommand(table

[GitHub] spark pull request #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17897#discussion_r115189581 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala --- @@ -114,7 +114,7 @@ class QueryExecution(val sparkSession

[GitHub] spark pull request #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17897#discussion_r115189148 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala --- @@ -2561,6 +2561,14 @@ class SQLQuerySuite extends QueryTest

[GitHub] spark issue #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17897 @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #17897: [SPARK-20635][SQL] No SQL tab in Spark UI

2017-05-08 Thread wzhfy
GitHub user wzhfy opened a pull request: https://github.com/apache/spark/pull/17897 [SPARK-20635][SQL] No SQL tab in Spark UI ## What changes were proposed in this pull request? There's no SQL tab in Spark UI since `SQLExecution.withNewExecutionId` is removed in `def

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-04-25 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17649 The changes look good to me if we don't care about the case sensitivity issue. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #17649: [SPARK-20380][SQL] Output table comment for DESC FORMATT...

2017-04-23 Thread wzhfy
Github user wzhfy commented on the issue: https://github.com/apache/spark/pull/17649 @gatorsmile Hive treats comment simply as a key in the string-string parameter map, while spark extracts comment from the map as a field in `CatalogTable`. So the question is, should spark consider

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-21 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r112635039 --- Diff: sql/core/src/test/resources/sql-tests/inputs/describe-table-after-alter-table.sql --- @@ -0,0 +1,29 @@ +CREATE TABLE table_with_comment

[GitHub] spark pull request #17649: [SPARK-20380][SQL] Output table comment for DESC ...

2017-04-21 Thread wzhfy
Github user wzhfy commented on a diff in the pull request: https://github.com/apache/spark/pull/17649#discussion_r112634259 --- Diff: sql/core/src/test/resources/sql-tests/inputs/describe-table-after-alter-table.sql --- @@ -0,0 +1,29 @@ +CREATE TABLE table_with_comment

<    2   3   4   5   6   7   8   9   10   11   >