[GitHub] [spark] SparkQA removed a comment on pull request #28987: [SPARK-32162][PYTHON][TESTS] Improve error message of Pandas grouped map test with window

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #28987: URL: https://github.com/apache/spark/pull/28987#issuecomment-654168938 **[Test build #125068 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125068/testReport)** for PR 28987 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28859: [SPARK-32024][WEBUI] Update ApplicationStoreInfo.size during HistoryServerDiskManager initializing

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28859: URL: https://github.com/apache/spark/pull/28859#issuecomment-654194495 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28957: [WIP][SPARK-32138] Drop Python 2.7, 3.4 and 3.5

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28957: URL: https://github.com/apache/spark/pull/28957#issuecomment-654193397 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] HeartSaVioR commented on pull request #29011: [WIP][SPARK-XXXXX][SQL] Parallelize HashAggregationQueryWithControlledFallbackSuite

2020-07-06 Thread GitBox
HeartSaVioR commented on pull request #29011: URL: https://github.com/apache/spark/pull/29011#issuecomment-654199443 retest this, please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29012: [SPARK-31710][SQL][FOLLOWUP] Allow cast numeric to timestamp by default

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #29012: URL: https://github.com/apache/spark/pull/29012#issuecomment-654198906 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] attilapiros commented on pull request #28967: [SPARK-32149][SHUFFLE] Improve file path name normalisation at block resolution within the external shuffle service

2020-07-06 Thread GitBox
attilapiros commented on pull request #28967: URL: https://github.com/apache/spark/pull/28967#issuecomment-654206994 Jenkins retest this please This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] GuoPhilipse removed a comment on pull request #29009: [SPARK-32193][SQL] Update migrate guide docs on regexp function

2020-07-06 Thread GitBox
GuoPhilipse removed a comment on pull request #29009: URL: https://github.com/apache/spark/pull/29009#issuecomment-654168933 > No, @GuoPhilipse, I meant if it's supported in `SELECT REGEXP('abc', '([a-z]+)');` way. No,it's a different way.

[GitHub] [spark] maropu commented on pull request #29009: [SPARK-32193][SQL] Update migrate guide docs on regexp function

2020-07-06 Thread GitBox
maropu commented on pull request #29009: URL: https://github.com/apache/spark/pull/29009#issuecomment-654218179 Yea, I checked the doc @HyukjinKwon put above, and it seems the current hive only supports `REGEXP` only in a SQL syntax.

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28808: [SPARK-31975][SQL] Show AnalysisException when WindowFunction is used without WindowExpression

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28808: URL: https://github.com/apache/spark/pull/28808#issuecomment-654221616 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] LantaoJin edited a comment on pull request #28901: [SPARK-32064][SQL] Supporting create temporary table

2020-07-06 Thread GitBox
LantaoJin edited a comment on pull request #28901: URL: https://github.com/apache/spark/pull/28901#issuecomment-654224633 > If I write the output to a temp location and then create a temp view to read from this temp location, ... Ah, I knew your meaning now, using `CREATE TEMP VIEW

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28970: [SPARK-31723][CORE][TEST] Reenable one test case in HistoryServerSuite

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28970: URL: https://github.com/apache/spark/pull/28970#issuecomment-654007871 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28986: [SPARK-32160][CORE][PYSPARK] Disallow to create SparkContext in executors.

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28986: URL: https://github.com/apache/spark/pull/28986#issuecomment-654226098 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins commented on pull request #28986: [SPARK-32160][CORE][PYSPARK] Disallow to create SparkContext in executors.

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28986: URL: https://github.com/apache/spark/pull/28986#issuecomment-654243444 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28986: [SPARK-32160][CORE][PYSPARK] Disallow to create SparkContext in executors.

2020-07-06 Thread GitBox
SparkQA commented on pull request #28986: URL: https://github.com/apache/spark/pull/28986#issuecomment-654242914 **[Test build #125086 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125086/testReport)** for PR 28986 at commit

[GitHub] [spark] SparkQA commented on pull request #28986: [SPARK-32160][CORE][PYSPARK] Disallow to create SparkContext in executors.

2020-07-06 Thread GitBox
SparkQA commented on pull request #28986: URL: https://github.com/apache/spark/pull/28986#issuecomment-654258213 **[Test build #125086 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125086/testReport)** for PR 28986 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28996: [SPARK-29358][SQL] Make unionByName optionally fill missing columns with nulls

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28996: URL: https://github.com/apache/spark/pull/28996#issuecomment-654262794 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28996: [SPARK-29358][SQL] Make unionByName optionally fill missing columns with nulls

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28996: URL: https://github.com/apache/spark/pull/28996#issuecomment-654262786 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28833: [SPARK-20680][SQL] Spark-sql do not support for creating table with void column datatype

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28833: URL: https://github.com/apache/spark/pull/28833#issuecomment-654274016 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28833: [SPARK-20680][SQL] Spark-sql do not support for creating table with void column datatype

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28833: URL: https://github.com/apache/spark/pull/28833#issuecomment-654274016 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28961: [SPARK-32143][SQL] Prevent a skewed join from producing too many partition splits

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28961: URL: https://github.com/apache/spark/pull/28961#issuecomment-654279739 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] ulysses-you opened a new pull request #29013: [SPARK-32196][SQL] Extract In convertible part if it is not convertible

2020-07-06 Thread GitBox
ulysses-you opened a new pull request #29013: URL: https://github.com/apache/spark/pull/29013 ### What changes were proposed in this pull request? Modify `OptimizeIn`, extract In convertible part if it is not convertible. ### Why are the changes needed? Try to

[GitHub] [spark] Ngone51 commented on a change in pull request #28979: [SPARK-32154][SQL] Use ExpressionEncoder for the return type of ScalaUDF to convert to catalyst type

2020-07-06 Thread GitBox
Ngone51 commented on a change in pull request #28979: URL: https://github.com/apache/spark/pull/28979#discussion_r450268964 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala ## @@ -102,6 +105,28 @@ case class ScalaUDF( }

[GitHub] [spark] GuoPhilipse commented on pull request #29009: [SPARK-32193][SQL] Update migrate guide docs on regexp function

2020-07-06 Thread GitBox
GuoPhilipse commented on pull request #29009: URL: https://github.com/apache/spark/pull/29009#issuecomment-654163788 > I think you don't need to file jira for this kind of minor doc fixes. Btw, any other systems supporting `REGEXP` for regular expressions other than Hive? I think we might

[GitHub] [spark] AngersZhuuuu commented on pull request #27983: [SPARK-32105][SQL]Refactor current ScriptTransformationExec code

2020-07-06 Thread GitBox
AngersZh commented on pull request #27983: URL: https://github.com/apache/spark/pull/27983#issuecomment-654167938 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #27983: [SPARK-32105][SQL]Refactor current ScriptTransformationExec code

2020-07-06 Thread GitBox
AngersZh commented on a change in pull request #27983: URL: https://github.com/apache/spark/pull/27983#discussion_r450149433 ## File path: sql/core/src/main/scala/org/apache/spark/sql/execution/script/ScriptTransformbase.scala ## @@ -0,0 +1,167 @@ +/* Review comment:

[GitHub] [spark] SparkQA commented on pull request #28912: [SPARK-32057][SQL][test-hive1.2][test-hadoop2.7] ExecuteStatement: cancel and close should not transiently ERROR

2020-07-06 Thread GitBox
SparkQA commented on pull request #28912: URL: https://github.com/apache/spark/pull/28912#issuecomment-654174667 **[Test build #125056 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125056/testReport)** for PR 28912 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #28912: [SPARK-32057][SQL][test-hive1.2][test-hadoop2.7] ExecuteStatement: cancel and close should not transiently ERROR

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #28912: URL: https://github.com/apache/spark/pull/28912#issuecomment-654096687 **[Test build #125056 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125056/testReport)** for PR 28912 at commit

[GitHub] [spark] HeartSaVioR edited a comment on pull request #29011: [WIP][SPARK-XXXXX][SQL] Parallelize HashAggregationQueryWithControlledFallbackSuite

2020-07-06 Thread GitBox
HeartSaVioR edited a comment on pull request #29011: URL: https://github.com/apache/spark/pull/29011#issuecomment-654170747 Rationalization: it turns out that HashAggregationQueryWithFallbackSuite is more than 11x slower compared to HashAggregationQuerySuite. (It ran same test with 12

[GitHub] [spark] SparkQA commented on pull request #28683: [SPARK-31875][SQL] Provide a option to disable user supplied Hints

2020-07-06 Thread GitBox
SparkQA commented on pull request #28683: URL: https://github.com/apache/spark/pull/28683#issuecomment-654183958 **[Test build #125037 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125037/testReport)** for PR 28683 at commit

[GitHub] [spark] SparkQA commented on pull request #28987: [SPARK-32162][PYTHON][TESTS] Improve error message of Pandas grouped map test with window

2020-07-06 Thread GitBox
SparkQA commented on pull request #28987: URL: https://github.com/apache/spark/pull/28987#issuecomment-654189907 **[Test build #125068 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125068/testReport)** for PR 28987 at commit

[GitHub] [spark] SparkQA removed a comment on pull request #28986: [SPARK-32160][CORE][PYSPARK] Disallow to create SparkContext in executors.

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #28986: URL: https://github.com/apache/spark/pull/28986#issuecomment-654089524 **[Test build #125049 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125049/testReport)** for PR 28986 at commit

[GitHub] [spark] SparkQA commented on pull request #28683: [SPARK-31875][SQL] Provide a option to disable user supplied Hints

2020-07-06 Thread GitBox
SparkQA commented on pull request #28683: URL: https://github.com/apache/spark/pull/28683#issuecomment-654196780 **[Test build #125075 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125075/testReport)** for PR 28683 at commit

[GitHub] [spark] Ngone51 edited a comment on pull request #28629: [SPARK-31769][CORE] Add MDC support for driver threads

2020-07-06 Thread GitBox
Ngone51 edited a comment on pull request #28629: URL: https://github.com/apache/spark/pull/28629#issuecomment-653385789 I'm fine to remove the prefix if we want to inherit the MDC properties directly since I agree API consistent is more important. And I think we need to document it

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28997: [SPARK-32172][CORE]Use createDirectory instead of mkdir

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28997: URL: https://github.com/apache/spark/pull/28997#issuecomment-654202805 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29011: [WIP][SPARK-XXXXX][SQL] Parallelize HashAggregationQueryWithControlledFallbackSuite

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #29011: URL: https://github.com/apache/spark/pull/29011#issuecomment-654202523 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28859: [SPARK-32024][WEBUI] Update ApplicationStoreInfo.size during HistoryServerDiskManager initializing

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28859: URL: https://github.com/apache/spark/pull/28859#issuecomment-654202458 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #29011: [WIP][SPARK-XXXXX][SQL] Parallelize HashAggregationQueryWithControlledFallbackSuite

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #29011: URL: https://github.com/apache/spark/pull/29011#issuecomment-654202523 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28859: [SPARK-32024][WEBUI] Update ApplicationStoreInfo.size during HistoryServerDiskManager initializing

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28859: URL: https://github.com/apache/spark/pull/28859#issuecomment-654202458 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #28997: [SPARK-32172][CORE]Use createDirectory instead of mkdir

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28997: URL: https://github.com/apache/spark/pull/28997#issuecomment-654202805 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] cloud-fan commented on pull request #28808: [SPARK-31975][SQL] Show AnalysisException when WindowFunction is used without WindowExpression

2020-07-06 Thread GitBox
cloud-fan commented on pull request #28808: URL: https://github.com/apache/spark/pull/28808#issuecomment-654207956 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] SparkQA commented on pull request #29012: [SPARK-31710][SQL][FOLLOWUP] Allow cast numeric to timestamp by default

2020-07-06 Thread GitBox
SparkQA commented on pull request #29012: URL: https://github.com/apache/spark/pull/29012#issuecomment-654208910 **[Test build #125080 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125080/testReport)** for PR 29012 at commit

[GitHub] [spark] SparkQA commented on pull request #28808: [SPARK-31975][SQL] Show AnalysisException when WindowFunction is used without WindowExpression

2020-07-06 Thread GitBox
SparkQA commented on pull request #28808: URL: https://github.com/apache/spark/pull/28808#issuecomment-654208965 **[Test build #125082 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125082/testReport)** for PR 28808 at commit

[GitHub] [spark] HyukjinKwon commented on pull request #28987: [SPARK-32162][PYTHON][TESTS] Improve error message of Pandas grouped map test with window

2020-07-06 Thread GitBox
HyukjinKwon commented on pull request #28987: URL: https://github.com/apache/spark/pull/28987#issuecomment-654209030 Woah, finally This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [spark] HyukjinKwon commented on pull request #28957: [WIP][SPARK-32138] Drop Python 2.7, 3.4 and 3.5

2020-07-06 Thread GitBox
HyukjinKwon commented on pull request #28957: URL: https://github.com/apache/spark/pull/28957#issuecomment-654208955 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] maropu commented on pull request #29009: [SPARK-32193][SQL] Update migrate guide docs on regexp function

2020-07-06 Thread GitBox
maropu commented on pull request #29009: URL: https://github.com/apache/spark/pull/29009#issuecomment-654207971 I get a bit confiused and Hive really supports REGEX as a function? ``` // REGEXP case in hive (3.1.1) hive> SELECT 'abc' REGEXP '([a-z]+)'; OK true hive>

[GitHub] [spark] cloud-fan commented on pull request #29012: [SPARK-31710][SQL][FOLLOWUP] Allow cast numeric to timestamp by default

2020-07-06 Thread GitBox
cloud-fan commented on pull request #29012: URL: https://github.com/apache/spark/pull/29012#issuecomment-654208091 retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] SparkQA commented on pull request #28808: [SPARK-31975][SQL] Show AnalysisException when WindowFunction is used without WindowExpression

2020-07-06 Thread GitBox
SparkQA commented on pull request #28808: URL: https://github.com/apache/spark/pull/28808#issuecomment-654221405 **[Test build #125082 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125082/testReport)** for PR 28808 at commit

[GitHub] [spark] gengliangwang commented on pull request #28970: [SPARK-31723][CORE][TEST] Reenable one test case in HistoryServerSuite

2020-07-06 Thread GitBox
gengliangwang commented on pull request #28970: URL: https://github.com/apache/spark/pull/28970#issuecomment-654226200 Retest this please This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28986: [SPARK-32160][CORE][PYSPARK] Disallow to create SparkContext in executors.

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28986: URL: https://github.com/apache/spark/pull/28986#issuecomment-654226087 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] SparkQA removed a comment on pull request #28986: [SPARK-32160][CORE][PYSPARK] Disallow to create SparkContext in executors.

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #28986: URL: https://github.com/apache/spark/pull/28986#issuecomment-654212573 **[Test build #125083 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125083/testReport)** for PR 28986 at commit

[GitHub] [spark] StefanXiepj commented on pull request #29010: [SPARK-32192][SQL] Print column name when throws ClassCastException

2020-07-06 Thread GitBox
StefanXiepj commented on pull request #29010: URL: https://github.com/apache/spark/pull/29010#issuecomment-654239759 > Thanks for your contribution, @StefanXiepj ! btw, which Spark version you used? I think the current master does not accept the alter command; > > ``` > scala>

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28965: [SPARK-32124][CORE][FOLLOW-UP] Use the invalid value Int.MinValue to fill the map index when the event logs from the old Spark

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28965: URL: https://github.com/apache/spark/pull/28965#issuecomment-654254388 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28965: [SPARK-32124][CORE][FOLLOW-UP] Use the invalid value Int.MinValue to fill the map index when the event logs from the old Spark

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28965: URL: https://github.com/apache/spark/pull/28965#issuecomment-654254379 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] wankunde commented on pull request #28944: [SPARK-32128][SQL]import SQLConf.PARTITION_OVERWRITE_VERIFY_PATH config

2020-07-06 Thread GitBox
wankunde commented on pull request #28944: URL: https://github.com/apache/spark/pull/28944#issuecomment-654255062 @holdenk Could you help review this PR ? This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28904: [SPARK-30462][SS] Streamline the logic on file stream source and sink to avoid memory issue

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28904: URL: https://github.com/apache/spark/pull/28904#issuecomment-654265205 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28965: [SPARK-32124][CORE][FOLLOW-UP] Use the invalid value Int.MinValue to fill the map index when the event logs from the old Spark

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28965: URL: https://github.com/apache/spark/pull/28965#issuecomment-654265147 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #29007: [SPARK-XXXXX][SQL][DOCS] consistency in argument naming for time functions

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #29007: URL: https://github.com/apache/spark/pull/29007#issuecomment-654265182 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28904: [SPARK-30462][SS] Streamline the logic on file stream source and sink to avoid memory issue

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28904: URL: https://github.com/apache/spark/pull/28904#issuecomment-654265205 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29007: [SPARK-XXXXX][SQL][DOCS] consistency in argument naming for time functions

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #29007: URL: https://github.com/apache/spark/pull/29007#issuecomment-654265182 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #27019: [SPARK-30027][SQL] Support codegen for aggregate filters in HashAggregateExec

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #27019: URL: https://github.com/apache/spark/pull/27019#issuecomment-654059548 **[Test build #125042 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125042/testReport)** for PR 27019 at commit

[GitHub] [spark] SparkQA commented on pull request #28961: [SPARK-32143][SQL] Prevent a skewed join from producing too many partition splits

2020-07-06 Thread GitBox
SparkQA commented on pull request #28961: URL: https://github.com/apache/spark/pull/28961#issuecomment-654276117 **[Test build #125045 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125045/testReport)** for PR 28961 at commit

[GitHub] [spark] SparkQA commented on pull request #27019: [SPARK-30027][SQL] Support codegen for aggregate filters in HashAggregateExec

2020-07-06 Thread GitBox
SparkQA commented on pull request #27019: URL: https://github.com/apache/spark/pull/27019#issuecomment-654275512 **[Test build #125042 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125042/testReport)** for PR 27019 at commit

[GitHub] [spark] Ngone51 commented on a change in pull request #28979: [SPARK-32154][SQL] Use ExpressionEncoder for the return type of ScalaUDF to convert to catalyst type

2020-07-06 Thread GitBox
Ngone51 commented on a change in pull request #28979: URL: https://github.com/apache/spark/pull/28979#discussion_r450269218 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala ## @@ -2819,13 +2819,12 @@ class Analyzer( case

[GitHub] [spark] SparkQA commented on pull request #28926: [SPARK-32133][SQL] Forbid time field steps for date start/end in Sequence

2020-07-06 Thread GitBox
SparkQA commented on pull request #28926: URL: https://github.com/apache/spark/pull/28926#issuecomment-654280343 **[Test build #125059 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125059/testReport)** for PR 28926 at commit

[GitHub] [spark] Ngone51 commented on a change in pull request #28979: [SPARK-32154][SQL] Use ExpressionEncoder for the return type of ScalaUDF to convert to catalyst type

2020-07-06 Thread GitBox
Ngone51 commented on a change in pull request #28979: URL: https://github.com/apache/spark/pull/28979#discussion_r450269700 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/ScalaUDF.scala ## @@ -36,6 +36,8 @@ import

[GitHub] [spark] AmplabJenkins commented on pull request #27019: [SPARK-30027][SQL] Support codegen for aggregate filters in HashAggregateExec

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #27019: URL: https://github.com/apache/spark/pull/27019#issuecomment-654280445 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #28979: [SPARK-32154][SQL] Use ExpressionEncoder for the return type of ScalaUDF to convert to catalyst type

2020-07-06 Thread GitBox
SparkQA commented on pull request #28979: URL: https://github.com/apache/spark/pull/28979#issuecomment-654280966 **[Test build #125093 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125093/testReport)** for PR 28979 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27019: [SPARK-30027][SQL] Support codegen for aggregate filters in HashAggregateExec

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #27019: URL: https://github.com/apache/spark/pull/27019#issuecomment-654280445 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA removed a comment on pull request #28926: [SPARK-32133][SQL] Forbid time field steps for date start/end in Sequence

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #28926: URL: https://github.com/apache/spark/pull/28926#issuecomment-654141149 **[Test build #125059 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125059/testReport)** for PR 28926 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28961: [SPARK-32143][SQL] Prevent a skewed join from producing too many partition splits

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28961: URL: https://github.com/apache/spark/pull/28961#issuecomment-654279756 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] SparkQA commented on pull request #29013: [SPARK-32196][SQL] Extract In convertible part if it is not convertible

2020-07-06 Thread GitBox
SparkQA commented on pull request #29013: URL: https://github.com/apache/spark/pull/29013#issuecomment-654280952 **[Test build #125092 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125092/testReport)** for PR 29013 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28961: [SPARK-32143][SQL] Prevent a skewed join from producing too many partition splits

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28961: URL: https://github.com/apache/spark/pull/28961#issuecomment-654279739 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28987: [SPARK-32162][PYTHON][TESTS] Improve error message of Pandas grouped map test with window

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28987: URL: https://github.com/apache/spark/pull/28987#issuecomment-654161339 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28987: [SPARK-32162][PYTHON][TESTS] Improve error message of Pandas grouped map test with window

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28987: URL: https://github.com/apache/spark/pull/28987#issuecomment-654161345 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27366: [SPARK-30648][SQL] Support filters pushdown in JSON datasource

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #27366: URL: https://github.com/apache/spark/pull/27366#issuecomment-654164927 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] MaxGekk commented on pull request #27366: [SPARK-30648][SQL] Support filters pushdown in JSON datasource

2020-07-06 Thread GitBox
MaxGekk commented on pull request #27366: URL: https://github.com/apache/spark/pull/27366#issuecomment-654165766 jenkins, retest this, please This is an automated message from the Apache Git Service. To respond to the

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28987: [SPARK-32162][PYTHON][TESTS] Improve error message of Pandas grouped map test with window

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28987: URL: https://github.com/apache/spark/pull/28987#issuecomment-654169294 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins commented on pull request #27983: [SPARK-32105][SQL]Refactor current ScriptTransformationExec code

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #27983: URL: https://github.com/apache/spark/pull/27983#issuecomment-654169356 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA commented on pull request #27366: [SPARK-30648][SQL] Support filters pushdown in JSON datasource

2020-07-06 Thread GitBox
SparkQA commented on pull request #27366: URL: https://github.com/apache/spark/pull/27366#issuecomment-654169106 **[Test build #125071 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125071/testReport)** for PR 27366 at commit

[GitHub] [spark] SparkQA commented on pull request #28991: [SPARK-26533][SQL][test-hive1.2][test-hadoop2.7] Support query auto timeout cancel on thriftserver

2020-07-06 Thread GitBox
SparkQA commented on pull request #28991: URL: https://github.com/apache/spark/pull/28991#issuecomment-654169129 **[Test build #125063 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125063/testReport)** for PR 28991 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #29011: [WIP][SPARK-XXXXX][SQL] Parallelize HashAggregationQueryWithControlledFallbackSuite

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #29011: URL: https://github.com/apache/spark/pull/29011#issuecomment-654169224 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28992: [SPARK-32167][SQL] Fix GetArrayStructFields to respect inner field's nullability together

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28992: URL: https://github.com/apache/spark/pull/28992#issuecomment-654169336 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28969: [SPARK-32150][BUILD] Upgrade to ZStd 1.4.5-4

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28969: URL: https://github.com/apache/spark/pull/28969#issuecomment-654169362 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins commented on pull request #28987: [SPARK-32162][PYTHON][TESTS] Improve error message of Pandas grouped map test with window

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28987: URL: https://github.com/apache/spark/pull/28987#issuecomment-654169294 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] SparkQA removed a comment on pull request #28991: [SPARK-26533][SQL][test-hive1.2][test-hadoop2.7] Support query auto timeout cancel on thriftserver

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #28991: URL: https://github.com/apache/spark/pull/28991#issuecomment-654151070 **[Test build #125063 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125063/testReport)** for PR 28991 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28991: [SPARK-26533][SQL][test-hive1.2][test-hadoop2.7] Support query auto timeout cancel on thriftserver

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28991: URL: https://github.com/apache/spark/pull/28991#issuecomment-654169570 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28969: [SPARK-32150][BUILD] Upgrade to ZStd 1.4.5-4

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28969: URL: https://github.com/apache/spark/pull/28969#issuecomment-654169362 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27983: [SPARK-32105][SQL]Refactor current ScriptTransformationExec code

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #27983: URL: https://github.com/apache/spark/pull/27983#issuecomment-654169356 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28992: [SPARK-32167][SQL] Fix GetArrayStructFields to respect inner field's nullability together

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28992: URL: https://github.com/apache/spark/pull/28992#issuecomment-654169336 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] AmplabJenkins removed a comment on pull request #29011: [WIP][SPARK-XXXXX][SQL] Parallelize HashAggregationQueryWithControlledFallbackSuite

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #29011: URL: https://github.com/apache/spark/pull/29011#issuecomment-654169224 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] SparkQA commented on pull request #28992: [SPARK-32167][SQL] Fix GetArrayStructFields to respect inner field's nullability together

2020-07-06 Thread GitBox
SparkQA commented on pull request #28992: URL: https://github.com/apache/spark/pull/28992#issuecomment-654168934 **[Test build #125066 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125066/testReport)** for PR 28992 at commit

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28912: [SPARK-32057][SQL][test-hive1.2][test-hadoop2.7] ExecuteStatement: cancel and close should not transiently ERROR

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28912: URL: https://github.com/apache/spark/pull/28912#issuecomment-654175665 Merged build finished. Test FAILed. This is an automated message from the Apache Git Service. To

[GitHub] [spark] cloud-fan commented on a change in pull request #27428: [SPARK-30276][SQL] Support Filter expression allows simultaneous use of DISTINCT

2020-07-06 Thread GitBox
cloud-fan commented on a change in pull request #27428: URL: https://github.com/apache/spark/pull/27428#discussion_r450157642 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/RewriteDistinctAggregates.scala ## @@ -148,24 +204,105 @@ object

[GitHub] [spark] AmplabJenkins removed a comment on pull request #28912: [SPARK-32057][SQL][test-hive1.2][test-hadoop2.7] ExecuteStatement: cancel and close should not transiently ERROR

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #28912: URL: https://github.com/apache/spark/pull/28912#issuecomment-654175670 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] [spark] AmplabJenkins removed a comment on pull request #27066: [SPARK-31317][SQL] Add withField method to Column

2020-07-06 Thread GitBox
AmplabJenkins removed a comment on pull request #27066: URL: https://github.com/apache/spark/pull/27066#issuecomment-654175994 This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [spark] cloud-fan commented on a change in pull request #27428: [SPARK-30276][SQL] Support Filter expression allows simultaneous use of DISTINCT

2020-07-06 Thread GitBox
cloud-fan commented on a change in pull request #27428: URL: https://github.com/apache/spark/pull/27428#discussion_r450157305 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/RewriteDistinctAggregates.scala ## @@ -118,7 +118,63 @@ import

[GitHub] [spark] AmplabJenkins commented on pull request #27066: [SPARK-31317][SQL] Add withField method to Column

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #27066: URL: https://github.com/apache/spark/pull/27066#issuecomment-654175994 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [spark] bart-samwel commented on a change in pull request #28996: [SPARK-29358][SQL] Make unionByName optionally fill missing columns with nulls

2020-07-06 Thread GitBox
bart-samwel commented on a change in pull request #28996: URL: https://github.com/apache/spark/pull/28996#discussion_r450166216 ## File path: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala ## @@ -2656,6 +2656,14 @@ object SQLConf {

[GitHub] [spark] SparkQA removed a comment on pull request #28683: [SPARK-31875][SQL] Provide a option to disable user supplied Hints

2020-07-06 Thread GitBox
SparkQA removed a comment on pull request #28683: URL: https://github.com/apache/spark/pull/28683#issuecomment-654055707 **[Test build #125037 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125037/testReport)** for PR 28683 at commit

[GitHub] [spark] AmplabJenkins commented on pull request #28683: [SPARK-31875][SQL] Provide a option to disable user supplied Hints

2020-07-06 Thread GitBox
AmplabJenkins commented on pull request #28683: URL: https://github.com/apache/spark/pull/28683#issuecomment-654185225 This is an automated message from the Apache Git Service. To respond to the message, please log on to

<    1   2   3   4   5   6   7   8   9   10   >