[GitHub] spark pull request #22895: [SPARK-25886][SQL][Minor] Improve error message o...

2018-10-30 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22895#discussion_r229563086 --- Diff: external/avro/src/main/scala/org/apache/spark/sql/avro/AvroDataToCatalyst.scala --- @@ -100,9 +100,14 @@ case class AvroDataToCatalyst

[GitHub] spark issue #22883: [SPARK-25837] [Core] Fix potential slowdown in AppStatus...

2018-10-30 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22883 Seems reasonable. Ping @vanzin --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands

[GitHub] spark pull request #22895: [SPARK-25886][SQL][Minor] Improve error message o...

2018-10-30 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22895#discussion_r229553580 --- Diff: external/avro/src/main/scala/org/apache/spark/sql/avro/AvroDataToCatalyst.scala --- @@ -100,9 +100,14 @@ case class AvroDataToCatalyst

[GitHub] spark pull request #22895: [SPARK-25886][SQL][Minor] Improve error message o...

2018-10-30 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22895#discussion_r229553381 --- Diff: external/avro/src/main/scala/org/apache/spark/sql/avro/AvroDataToCatalyst.scala --- @@ -100,9 +100,14 @@ case class AvroDataToCatalyst

[GitHub] spark issue #22864: [SPARK-25861][Minor][WEBUI] Remove unused refreshInterva...

2018-10-30 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22864 Looks like the parameter is not used from the first day? https://github.com/apache/spark/pull/290/files#diff-b8adb646ef90f616c34eb5c98d1ebd16R140

[GitHub] spark pull request #22883: [SPARK-25837] [Core] Fix potential slowdown in Ap...

2018-10-30 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22883#discussion_r229535706 --- Diff: core/src/main/scala/org/apache/spark/status/AppStatusListener.scala --- @@ -1105,6 +1095,15 @@ private[spark] class AppStatusListener

[GitHub] spark issue #22887: [SPARK-25880][CORE] user set's hadoop conf should not ov...

2018-10-30 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22887 Hi @gjhkael , can you explain more about why you make the change? Did you try `spark.SessionState.newHadoopConf

[GitHub] spark pull request #22895: [SPARK-25886][SQL] Improve error message of `Fail...

2018-10-30 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22895 [SPARK-25886][SQL] Improve error message of `FailureSafeParser` and `from_avro` in FAILFAST mode ## What changes were proposed in this pull request? Currently in `FailureSafeParser

[GitHub] spark pull request #22890: [SPARK-25883][SQL][Minor] Override method `pretty...

2018-10-30 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22890 [SPARK-25883][SQL][Minor] Override method `prettyName` in `from_avro`/`to_avro` ## What changes were proposed in this pull request? Previously in from_avro/to_avro, we override

[GitHub] spark issue #22878: [SPARK-25789][SQL] Support for Dataset of Avro

2018-10-29 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22878 @xuanyuanking , thanks for the work! You can try editing the previous commit message https://help.github.com/articles/creating-a-commit-with-multiple-authors/ , and then `push -f

[GitHub] spark issue #22861: [SPARK-25663][SPARK-25661][SQL][TEST] Refactor BuiltInDa...

2018-10-28 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22861 Personally I am against accessing the main args in such way. It looks a bit ugly. But if we have to move everything to `BenchmarkBase`, then maybe this is the way

[GitHub] spark pull request #22847: [SPARK-25850][SQL] Make the split threshold for t...

2018-10-26 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22847#discussion_r228600757 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -812,6 +812,17 @@ object SQLConf { .intConf

[GitHub] spark pull request #22848: [SPARK-25851][SQL][MINOR] Fix deprecated API warn...

2018-10-26 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22848#discussion_r228559067 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/ui/SQLListener.scala --- @@ -89,12 +89,12 @@ private class LongLongTupleConverter

[GitHub] spark pull request #22848: [SPARK-25851][SQL] Fix deprecated API warning in ...

2018-10-26 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22848 [SPARK-25851][SQL] Fix deprecated API warning in SQLListener ## What changes were proposed in this pull request? In https://github.com/apache/spark/pull/21596, Jackson is upgraded to

[GitHub] spark pull request #22823: [SPARK-25676][SQL][TEST] Improve BenchmarkWideTab...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22823#discussion_r228407600 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/CodeGenerator.scala --- @@ -910,12 +910,14 @@ class

[GitHub] spark pull request #22814: [SPARK-25819][SQL] Support parse mode option for ...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22814#discussion_r228392145 --- Diff: external/avro/src/main/scala/org/apache/spark/sql/avro/package.scala --- @@ -31,10 +32,32 @@ package object avro { * @since 2.4.0

[GitHub] spark pull request #22814: [SPARK-25819][SQL] Support parse mode option for ...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22814#discussion_r228388968 --- Diff: docs/sql-data-sources-avro.md --- @@ -177,6 +180,19 @@ Data source options of Avro can be set using the `.option` method on `DataFrameR

[GitHub] spark issue #22814: [SPARK-25819][SQL] Support parse mode option for the fun...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22814 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #22814: [SPARK-25819][SQL] Support parse mode option for the fun...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22814 @cloud-fan @HyukjinKwon I have changed the default value to "FAILFAST" and remove the migration guide. See the PR description f

[GitHub] spark pull request #22814: [SPARK-25819][SQL] Support parse mode option for ...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22814#discussion_r228148950 --- Diff: external/avro/src/test/scala/org/apache/spark/sql/avro/AvroFunctionsSuite.scala --- @@ -61,6 +59,24 @@ class AvroFunctionsSuite extends

[GitHub] spark pull request #22814: [SPARK-25819][SQL] Support parse mode option for ...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22814#discussion_r228086740 --- Diff: external/avro/src/main/scala/org/apache/spark/sql/avro/AvroDataToCatalyst.scala --- @@ -21,16 +21,31 @@ import org.apache.avro.Schema

[GitHub] spark pull request #22814: [SPARK-25819][SQL] Support parse mode option for ...

2018-10-25 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22814#discussion_r228064742 --- Diff: docs/sql-data-sources-avro.md --- @@ -177,6 +180,18 @@ Data source options of Avro can be set using the `.option` method on `DataFrameR

[GitHub] spark issue #22814: [WIP][SPARK-25819][SQL] Support parse mode option for th...

2018-10-24 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22814 The major code is ready. I will add a new section in doc soon. --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark pull request #22814: [WIP][SPARK-25819][SQL] Support parse mode option...

2018-10-24 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22814#discussion_r227755767 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ExpressionEvalHelper.scala --- @@ -60,9 +60,11 @@ trait

[GitHub] spark pull request #22814: [WIP][SPARK-25819][SQL] Support parse mode option...

2018-10-24 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22814 [WIP][SPARK-25819][SQL] Support parse mode option for the function `from_avro` ## What changes were proposed in this pull request? Current the function `from_avro` throws exception

[GitHub] spark issue #22663: [SPARK-25490][SQL][TEST] Fix OOM of KryoBenchmark due to...

2018-10-23 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22663 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #22808: [SPARK-25812][UI][TEST] Fix test failure in PagedTableSu...

2018-10-23 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22808 Hi @dongjoon-hyun , I see. I have updated the title. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

[GitHub] spark issue #22808: [SPARK-25675][UI][FOLLOWUP][TEST] Fix test failure in Pa...

2018-10-23 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22808 @srowen @felixcheung @shivusondur --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #22808: [SPARK-25675][UI][FOLLOWUP][TEST] Fix test failur...

2018-10-23 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22808 [SPARK-25675][UI][FOLLOWUP][TEST] Fix test failure in PagedTableSuite ## What changes were proposed in this pull request? In https://github.com/apache/spark/pull/22668, the PR was

[GitHub] spark issue #22668: [SPARK-25675] [Spark Job History] Job UI page does not s...

2018-10-23 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22668 @srowen I think it is easy fix, let me fix it now. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For

[GitHub] spark issue #22237: [SPARK-25243][SQL] Use FailureSafeParser in from_json

2018-10-23 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22237 @HyukjinKwon Got it. I won't take it since you will do it. --- - To unsubscribe, e-mail: reviews-uns

[GitHub] spark issue #22663: [SPARK-25490][SQL][TEST] Fix OOM of KryoBenchmark due to...

2018-10-18 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22663 @dongjoon-hyun Sorry to reply late. I have updated the title and description. --- - To unsubscribe, e-mail: reviews

[GitHub] spark issue #22663: [SPARK-25490][SQL][TEST] Fix OOM of KryoBenchmark due to...

2018-10-18 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22663 Retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #22746: [SPARK-24499][SQL][DOC] Split the page of sql-programmin...

2018-10-17 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22746 This is cool +1 👍 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark pull request #22746: [SPARK-24499][SQL][DOC] Split the page of sql-pro...

2018-10-17 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22746#discussion_r225921716 --- Diff: docs/sql-reference.md --- @@ -0,0 +1,641 @@ +--- +layout: global +title: Reference +displayTitle: Reference

[GitHub] spark pull request #22752: [SPARK-24787][CORE] Revert hsync in EventLoggingL...

2018-10-17 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22752#discussion_r225846631 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/config.scala --- @@ -64,4 +64,11 @@ private[spark] object config

[GitHub] spark pull request #22752: [SPARK-24787][CORE] Revert hsync in EventLoggingL...

2018-10-17 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22752#discussion_r225845782 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/FsHistoryProvider.scala --- @@ -449,7 +450,7 @@ private[history] class

[GitHub] spark pull request #22753: [SPARK-25754][DOC] Change CDN for MathJax

2018-10-17 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22753#discussion_r225810815 --- Diff: docs/_layouts/global.html --- @@ -184,7 +184,8 @@ {{ page.title

[GitHub] spark pull request #22753: [SPARK-25754][DOC] Change CDN for MathJax

2018-10-17 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22753#discussion_r225809921 --- Diff: docs/_layouts/global.html --- @@ -184,7 +184,8 @@ {{ page.title

[GitHub] spark issue #22742: [SPARK-25588][WIP] SchemaParseException: Can't redefine:...

2018-10-16 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22742 Hi @heuermh , I left some comments in JIRA yesterday. I tried the test case in branch-2.3(with tag v2.3.1 and v2.3.0), the case is still reproduced by running: ``` ./build/sbt

[GitHub] spark issue #22753: [SPARK-25754][DOC] Change CDN for MathJax

2018-10-16 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22753 @srowen --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark pull request #22753: [SPARK-25754][DOC] Change CDN for MathJax

2018-10-16 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22753 [SPARK-25754][DOC] Change CDN for MathJax ## What changes were proposed in this pull request? Currently when we open our doc site: https://spark.apache.org/docs/latest/index.html

[GitHub] spark pull request #22746: [SPARK-24499][SQL][DOC] Split the page of sql-pro...

2018-10-16 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22746#discussion_r225783658 --- Diff: docs/sql-reference.md --- @@ -0,0 +1,641 @@ +--- +layout: global +title: Reference +displayTitle: Reference

[GitHub] spark issue #22744: [SPARK-25741][WebUI] Long URLs are not rendered properly...

2018-10-16 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22744 @srowen @vanzin --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark pull request #22744: [SPARK-25741][WebUI] Long URLs are not rendered p...

2018-10-16 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22744 [SPARK-25741][WebUI] Long URLs are not rendered properly in web UI ## What changes were proposed in this pull request? When the URL for description column in the table of job/stage

[GitHub] spark pull request #22727: [SPARK-25735][CORE][MINOR]Improve start-thriftser...

2018-10-16 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22727#discussion_r225418151 --- Diff: sbin/start-thriftserver.sh --- @@ -39,6 +39,10 @@ function usage { pattern+="\|Spark Command: "

[GitHub] spark pull request #22727: [SPARK-25735][CORE][MINOR]Improve start-thriftser...

2018-10-15 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22727#discussion_r225392908 --- Diff: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/HiveThriftServer2.scala --- @@ -71,6 +71,12 @@ object

[GitHub] spark pull request #22727: [SPARK-25735][CORE][MINOR]Improve start-thriftser...

2018-10-15 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22727#discussion_r225175351 --- Diff: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/HiveThriftServer2.scala --- @@ -71,6 +71,12 @@ object

[GitHub] spark issue #22727: [SPARK-25735][CORE][MINOR]Improve start-thriftserver.sh:...

2018-10-15 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22727 @srowen @dongjoon-hyun @jiangxb1987 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #22727: [SPARK-25735][CORE][MINOR]Improve start-thriftser...

2018-10-15 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22727 [SPARK-25735][CORE][MINOR]Improve start-thriftserver.sh: print clean usage and exit with code 1 ## What changes were proposed in this pull request? Currently if we run ``` sh

[GitHub] spark issue #22714: [SPARK-25720][WEBUI] Support auto refresh page for the W...

2018-10-14 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22714 BTW there is another approach: https://github.com/apache/spark/pull/21512 --- - To unsubscribe, e-mail: reviews-unsubscr

[GitHub] spark issue #22699: [SPARK-25711][Core] Improve start-history-server.sh: sho...

2018-10-13 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22699 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark pull request #22709: [SPARK-25718][SQL]Detect recursive reference in A...

2018-10-12 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22709#discussion_r224831076 --- Diff: external/avro/src/main/scala/org/apache/spark/sql/avro/SchemaConverters.scala --- @@ -67,21 +71,28 @@ object SchemaConverters

[GitHub] spark pull request #22709: [SPARK-25718][SQL]Detect recursive reference in A...

2018-10-12 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22709 [SPARK-25718][SQL]Detect recursive reference in Avro schema and throw exception ## What changes were proposed in this pull request? Avro schema allows recursive reference, e.g. the

[GitHub] spark issue #22699: [SPARK-25711][Core] Improve start-history-server.sh: sho...

2018-10-12 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22699 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #22699: [SPARK-25711][Core] Improve start-history-server.sh: sho...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22699 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark pull request #22700: [SPARK-25712][Core][Minor] Improve usage message ...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22700#discussion_r224549311 --- Diff: sbin/start-master.sh --- @@ -31,7 +31,8 @@ if [[ "$@" = *--help ]] || [[ "$@" = *-h ]]; then echo "Usag

[GitHub] spark issue #22699: [SPARK-25711][Core] Allow start-history-server.sh to sho...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22699 Agree. Let me remove them in this PR. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #22700: [SPARK-25712][Core][Minor] Improve usage message ...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22700#discussion_r224537568 --- Diff: sbin/start-master.sh --- @@ -31,7 +31,8 @@ if [[ "$@" = *--help ]] || [[ "$@" = *-h ]]; then echo "Usag

[GitHub] spark pull request #22699: [SPARK-25711][Core] Allow history server to show ...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22699#discussion_r224508786 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServerArguments.scala --- @@ -34,26 +34,25 @@ private[history] class

[GitHub] spark pull request #22699: [SPARK-25711][Core] Allow history server to show ...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22699#discussion_r224506748 --- Diff: core/src/main/scala/org/apache/spark/deploy/history/HistoryServerArguments.scala --- @@ -34,26 +34,25 @@ private[history] class

[GitHub] spark pull request #22699: [SPARK-25711][Core] Allow history server to show ...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22699#discussion_r224506025 --- Diff: sbin/start-history-server.sh --- @@ -28,7 +28,22 @@ if [ -z "${SPARK_HOME}" ]; then export SPARK_HOME="$(cd "`di

[GitHub] spark pull request #22700: [SPARK-25712][Core][Minor] Improve usage message

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22700#discussion_r224502646 --- Diff: sbin/start-master.sh --- @@ -31,7 +31,8 @@ if [[ "$@" = *--help ]] || [[ "$@" = *-h ]]; then echo "Usag

[GitHub] spark pull request #22700: [SPARK-25712][Core][Minor] Improve usage message

2018-10-11 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22700 [SPARK-25712][Core][Minor] Improve usage message ## What changes were proposed in this pull request? Currently if we run ``` ./sbin/start-master.sh -h ``` We get

[GitHub] spark pull request #22699: [SPARK-25711][Core] Allow history server to show ...

2018-10-11 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22699 [SPARK-25711][Core] Allow history server to show usage ## What changes were proposed in this pull request? Currently, if we try run ``` ./start-history-server.sh -h

[GitHub] spark issue #22668: [SPARK-25675] [Spark Job History] Job UI page does not s...

2018-10-11 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22668 LGTM --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark pull request #22668: [SPARK-25675] [Spark Job History] Job UI page doe...

2018-10-10 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22668#discussion_r224323509 --- Diff: core/src/main/scala/org/apache/spark/ui/PagedTable.scala --- @@ -154,9 +150,6 @@ private[ui] trait PagedTable[T

[GitHub] spark pull request #22684: [SPARK-25699][SQL] Partially push down conjunctiv...

2018-10-10 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22684#discussion_r224186347 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcFilters.scala --- @@ -138,39 +138,75 @@ private[sql] object

[GitHub] spark issue #22595: [SPARK-25577][Web UI] Add an on-off switch to display th...

2018-10-10 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22595 I read the original PR https://github.com/apache/spark/pull/14617, and it is wired that the two columns are hidden in the very beginning. Also the screenshot in PR description doesn&#

[GitHub] spark pull request #22668: [SPARK-25675] [Spark Job History] Job UI page doe...

2018-10-10 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22668#discussion_r224101829 --- Diff: core/src/main/scala/org/apache/spark/ui/PagedTable.scala --- @@ -123,10 +123,9 @@ private[ui] trait PagedTable[T

[GitHub] spark pull request #22687: [SPARK-25702][SQL] Push down filters with `Not` o...

2018-10-10 Thread gengliangwang
Github user gengliangwang closed the pull request at: https://github.com/apache/spark/pull/22687 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark issue #22687: [SPARK-25702][SQL] Push down filters with `Not` operator...

2018-10-10 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22687 It's OK. Close this one. Thanks for reviewing @viirya @HyukjinKwon --- - To unsubscribe, e-mail: reviews-uns

[GitHub] spark issue #22687: [SPARK-25702][SQL] Push down filters with `Not` operator...

2018-10-10 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22687 @viirya @HyukjinKwon I did the code changes and then I found the condition is not reachable, as I have stated in PR description. Just feel that it won't hurt to have such handli

[GitHub] spark pull request #22687: [SPARK-25702][SQL] Push down filters with `Not` o...

2018-10-10 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22687 [SPARK-25702][SQL] Push down filters with `Not` operator in Parquet ## What changes were proposed in this pull request? Currently, in ParquetFilters, predicates inside `Not` operator

[GitHub] spark issue #22684: [SPARK-25699][SQL] Partially push down conjunctive predi...

2018-10-10 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22684 @dbtsai @gatorsmile --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark pull request #22684: [SPARK-25699][SQL] Partially push down conjunctiv...

2018-10-10 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22684 [SPARK-25699][SQL] Partially push down conjunctive predicated in Orc ## What changes were proposed in this pull request? Inspired by https://github.com/apache/spark/pull/22574

[GitHub] spark issue #22663: [SPARK-25490][SQL][TEST] Refactor KryoBenchmark to use m...

2018-10-08 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22663 @dongjoon-hyun Is the changes OK to you? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #22643: [SPARK-25630][TEST] Reduce test time of HadoopFsRelation...

2018-10-07 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22643 @dongjoon-hyun please take another look, thanks! --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For

[GitHub] spark pull request #22663: [SPARK-25490][SQL][TEST] Refactor KryoBenchmark t...

2018-10-07 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22663#discussion_r223221336 --- Diff: core/benchmarks/KryoBenchmark-results.txt --- @@ -0,0 +1,29

[GitHub] spark pull request #22663: [SPARK-25490][SQL][TEST] Refactor KryoBenchmark t...

2018-10-07 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22663 [SPARK-25490][SQL][TEST] Refactor KryoBenchmark to use main method ## What changes were proposed in this pull request? Also make the scale smaller(reduce to 20%) to avoid OOM

[GitHub] spark issue #22643: [SPARK-25630][TEST] Reduce test time of HadoopFsRelation...

2018-10-06 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22643 retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail

[GitHub] spark issue #22642: [SPARK-25653][TEST] Add tag ExtendedHiveTest for HiveSpa...

2018-10-05 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22642 @dongjoon-hyun thanks, I have created new Jira ticket: https://issues.apache.org/jira/browse/SPARK-25653 --- - To

[GitHub] spark pull request #22643: [SPARK-25630][TEST] Reduce test time of HadoopFsR...

2018-10-05 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22643 [SPARK-25630][TEST] Reduce test time of HadoopFsRelationTest ## What changes were proposed in this pull request? There was 5 suites extends `HadoopFsRelationTest`, for testing &quo

[GitHub] spark pull request #22642: [SPARK-25613][SPARK-25614][TEST] Add tag Extended...

2018-10-05 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22642 [SPARK-25613][SPARK-25614][TEST] Add tag ExtendedHiveTest for HiveSparkSubmitSuite ## What changes were proposed in this pull request? The total run time of `HiveSparkSubmitSuite

[GitHub] spark pull request #22640: [SPARK-25621][SPARK-25622][TEST] Reduce test time...

2018-10-05 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22640 [SPARK-25621][SPARK-25622][TEST] Reduce test time of BucketedReadWithHiveSupportSuite ## What changes were proposed in this pull request? By reduce the number of loops. - `read

[GitHub] spark issue #22611: [SPARK-25595] Ignore corrupt Avro files if flag IGNORE_C...

2018-10-03 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22611 @HyukjinKwon Thanks for the review :) --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #22611: [SPARK-25595] Ignore corrupt Avro files if flag I...

2018-10-03 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22611#discussion_r16887 --- Diff: external/avro/src/main/scala/org/apache/spark/sql/avro/AvroFileFormat.scala --- @@ -100,6 +77,50 @@ private[avro] class AvroFileFormat

[GitHub] spark issue #22609: [SPARK-25594] [Core] Avoid maintaining task information ...

2018-10-02 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22609 Hi @mridulm , it seems that the changes will affect the metrics in `SparkStatusTracker`, e.g. number of active tasks in `getExecutorInfos` or `getStageInfo` . Maybe we

[GitHub] spark issue #22599: [SPARK-25581][SQL] Rename method `benchmark` as `runBenc...

2018-10-02 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22599 Discuss with @cloud-fan offline. Rename method `benchmark` as `runBenchmarkSuite`. Also add comment to guide developers to use `runBenchmark` for each scenario in implementations

[GitHub] spark pull request #22611: [SPARK-25595] Ignore corrupt Avro files if flag I...

2018-10-02 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22611 [SPARK-25595] Ignore corrupt Avro files if flag IGNORE_CORRUPT_FILES enabled ## What changes were proposed in this pull request? With flag IGNORE_CORRUPT_FILES enabled, schema

[GitHub] spark pull request #22599: [SPARK-25581][SQL] Rename method `benchmark` as `...

2018-10-01 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22599#discussion_r221637241 --- Diff: core/src/test/scala/org/apache/spark/benchmark/BenchmarkBase.scala --- @@ -25,7 +25,10 @@ import java.io.{File, FileOutputStream

[GitHub] spark pull request #22599: [SPARK-25581][SQL] Rename method `benchmark` as `...

2018-10-01 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22599#discussion_r221636295 --- Diff: core/src/test/scala/org/apache/spark/benchmark/BenchmarkBase.scala --- @@ -25,7 +25,10 @@ import java.io.{File, FileOutputStream

[GitHub] spark issue #22599: [SPARK-25581][SQL] Rename method `benchmark` as `benchma...

2018-10-01 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22599 @wangyum @dongjoon-hyun @cloud-fan --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #22599: [SPARK-25581][SQL] Rename method `benchmark` as `...

2018-10-01 Thread gengliangwang
GitHub user gengliangwang opened a pull request: https://github.com/apache/spark/pull/22599 [SPARK-25581][SQL] Rename method `benchmark` as `benchmarkSuite` in `BenchmarkBase` ## What changes were proposed in this pull request? Rename method `benchmark` in `BenchmarkBase

[GitHub] spark pull request #22484: [SPARK-25476][SPARK-25510][TEST] Refactor Aggrega...

2018-09-28 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22484#discussion_r221417293 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/SqlBasedBenchmark.scala --- @@ -0,0 +1,60 @@ +/* + * Licensed to

[GitHub] spark pull request #22484: [SPARK-25476][SPARK-25510][TEST] Refactor Aggrega...

2018-09-28 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22484#discussion_r221416202 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/SqlBasedBenchmark.scala --- @@ -0,0 +1,60 @@ +/* + * Licensed to

[GitHub] spark pull request #22484: [SPARK-25476][SPARK-25510][TEST] Refactor Aggrega...

2018-09-28 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22484#discussion_r221415701 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/SqlBasedBenchmark.scala --- @@ -0,0 +1,60 @@ +/* + * Licensed to

[GitHub] spark pull request #22484: [SPARK-25476][SPARK-25510][TEST] Refactor Aggrega...

2018-09-28 Thread gengliangwang
Github user gengliangwang commented on a diff in the pull request: https://github.com/apache/spark/pull/22484#discussion_r221415642 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/SqlBasedBenchmark.scala --- @@ -0,0 +1,60 @@ +/* + * Licensed to

[GitHub] spark issue #22237: [SPARK-25243][SQL] Use FailureSafeParser in from_json

2018-09-27 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22237 I manually ran some queries. My point was about the value of the column `from_json`, `null` vs `Row(null, null, ...)`. Now I am +1 with the proposal of this PR, making the column

[GitHub] spark issue #22237: [SPARK-25243][SQL] Use FailureSafeParser in from_json

2018-09-27 Thread gengliangwang
Github user gengliangwang commented on the issue: https://github.com/apache/spark/pull/22237 Hi @MaxGekk , I just reviewed this PR. I noticed that there is one behavior change. The column value of `from_json(corrupt_record...)` become `Row(null, nulll, ...)`, instead of `null

<    1   2   3   4   5   6   7   8   9   >