[GitHub] [spark] AmplabJenkins commented on pull request #28360: [SPARK-31569][SQL][DOCS] Add links to subsections in SQL Reference main page

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28360:
URL: https://github.com/apache/spark/pull/28360#issuecomment-619742496







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28360: [SPARK-31569][SQL][DOCS] Add links to subsections in SQL Reference main page

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28360:
URL: https://github.com/apache/spark/pull/28360#issuecomment-619742496







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28360: [SPARK-31569][SQL][DOCS] Add links to subsections in SQL Reference main page

2020-04-26 Thread GitBox


SparkQA commented on pull request #28360:
URL: https://github.com/apache/spark/pull/28360#issuecomment-619741862


   **[Test build #121876 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121876/testReport)**
 for PR 28360 at commit 
[`ea6f546`](https://github.com/apache/spark/commit/ea6f546b47f702f1369f50226463cb34a483f6e9).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] huaxingao opened a new pull request #28360: [SPARK-31569][SQL][DOCS] Add links to subsections in SQL Reference main page

2020-04-26 Thread GitBox


huaxingao opened a new pull request #28360:
URL: https://github.com/apache/spark/pull/28360


   
   
   ### What changes were proposed in this pull request?
   Add links to subsections in SQL Reference main page
   
   
   ### Why are the changes needed?
   Make SQL Reference complete
   
   
   ### Does this PR introduce any user-facing change?
   Yes
   before:
   https://user-images.githubusercontent.com/13592258/80338238-a9551080-8810-11ea-8ae8-d6707fde2cac.png;>
   
   after:
   https://user-images.githubusercontent.com/13592258/80338241-ac500100-8810-11ea-8518-95c4f8c0a2eb.png;>
   
   
   
   ### How was this patch tested?
   Manually build and check.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on pull request #28055: [SPARK-31289][TEST][test-hive1.2] Eliminate org.apache.spark.sql.hive.thriftserver.CliSuite flakiness

2020-04-26 Thread GitBox


yaooqinn commented on pull request #28055:
URL: https://github.com/apache/spark/pull/28055#issuecomment-619740866


   I will run some tests blindly to see whether it introduce a significant 
delay,  since I am not sure how to mock an ungrateful shutdown at the end of 
each test.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MichaelChirico edited a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


MichaelChirico edited a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619740538


   really opened a can of worms here huh.
   
   To me, it makes sense to leave the PR as is, then do the 
`dapply`/`dapplyCollect` centralization in another PR.
   
   Have filed this follow-up Jira in any case:
   
   https://issues.apache.org/jira/browse/SPARK-31570
   
   I could also try combining them in this PR, I don't think it would be 
substantially difficult, but I don't know the rules for closing multiple Jiras 
in the same PR.
   
   up to you.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MichaelChirico commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


MichaelChirico commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619740538


   really opened a can of worms here huh.
   
   To me, it makes sense to leave the PR as is, then do the 
`dapply`/`dapplyCollect` centralization in another PR.
   
   Have filed this follow-up Jira in any case:
   
   https://issues.apache.org/jira/browse/SPARK-31568
   
   I could also try combining them in this PR, I don't think it would be 
substantially difficult, but I don't know the rules for closing multiple Jiras 
in the same PR.
   
   up to you.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


HyukjinKwon commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619738372


   Merged to master and branch-3.0.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #28359: [SPARK-31534][WEBUI][3.0] Text for tooltip should be escaped

2020-04-26 Thread GitBox


dongjoon-hyun commented on pull request #28359:
URL: https://github.com/apache/spark/pull/28359#issuecomment-619738381


   Thank you, @sarutak .



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu commented on a change in pull request #28215: [SPARK-31272][SQL] Support DB2 Kerberos login in JDBC connector

2020-04-26 Thread GitBox


maropu commented on a change in pull request #28215:
URL: https://github.com/apache/spark/pull/28215#discussion_r415526378



##
File path: 
external/docker-integration-tests/src/test/scala/org/apache/spark/sql/jdbc/DB2KrbIntegrationSuite.scala
##
@@ -0,0 +1,89 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.jdbc
+
+import java.security.PrivilegedExceptionAction
+import java.sql.Connection
+import javax.security.auth.login.Configuration
+
+import com.spotify.docker.client.messages.{ContainerConfig, HostConfig}
+import org.apache.hadoop.security.{SecurityUtil, UserGroupInformation}
+import 
org.apache.hadoop.security.UserGroupInformation.AuthenticationMethod.KERBEROS
+
+import org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions
+import 
org.apache.spark.sql.execution.datasources.jdbc.connection.{DB2ConnectionProvider,
 SecureConnectionProvider}
+import org.apache.spark.tags.DockerTest
+
+@DockerTest
+class DB2KrbIntegrationSuite extends DockerKrbJDBCIntegrationSuite {
+  override protected val userName = s"db2/$dockerIp"
+  override protected val keytabFileName = "db2.keytab"
+
+  override val db = new DatabaseOnDocker {
+override val imageName = "ibmcom/db2:11.5.0.0a"
+override val env = Map(
+  "DB2INST1_PASSWORD" -> "rootpass",
+  "LICENSE" -> "accept",
+  "DBNAME" -> "db2"
+)
+override val usesIpc = false
+override val jdbcPort = 5
+override val privileged = true
+override def getJdbcUrl(ip: String, port: Int): String = 
s"jdbc:db2://$ip:$port/db2"
+override def getJdbcProperties() = {
+  val options = new JDBCOptions(Map[String, String](
+JDBCOptions.JDBC_URL -> getJdbcUrl(dockerIp, externalPort),
+JDBCOptions.JDBC_TABLE_NAME -> "bar",
+JDBCOptions.JDBC_KEYTAB -> keytabFileName,
+JDBCOptions.JDBC_PRINCIPAL -> principal
+  ))
+  new DB2ConnectionProvider(null, options).getAdditionalProperties()
+}
+
+override def beforeContainerStart(
+hostConfigBuilder: HostConfig.Builder,
+containerConfigBuilder: ContainerConfig.Builder): Unit = {
+  copyExecutableResource("db2_krb_setup.sh", initDbDir, replaceIp)
+
+  hostConfigBuilder.appendBinds(
+HostConfig.Bind.from(initDbDir.getAbsolutePath)
+  .to("/var/custom").readOnly(true).build()
+  )
+}
+  }
+
+  override protected def setAuthentication(keytabFile: String, principal: 
String): Unit = {
+val config = new SecureConnectionProvider.JDBCConfiguration(
+  Configuration.getConfiguration, "JaasClient", keytabFile, principal)
+Configuration.setConfiguration(config)

Review comment:
   Is this safe when scanning tables in different secure databases ?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu commented on a change in pull request #28215: [SPARK-31272][SQL] Support DB2 Kerberos login in JDBC connector

2020-04-26 Thread GitBox


maropu commented on a change in pull request #28215:
URL: https://github.com/apache/spark/pull/28215#discussion_r415526378



##
File path: 
external/docker-integration-tests/src/test/scala/org/apache/spark/sql/jdbc/DB2KrbIntegrationSuite.scala
##
@@ -0,0 +1,89 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.jdbc
+
+import java.security.PrivilegedExceptionAction
+import java.sql.Connection
+import javax.security.auth.login.Configuration
+
+import com.spotify.docker.client.messages.{ContainerConfig, HostConfig}
+import org.apache.hadoop.security.{SecurityUtil, UserGroupInformation}
+import 
org.apache.hadoop.security.UserGroupInformation.AuthenticationMethod.KERBEROS
+
+import org.apache.spark.sql.execution.datasources.jdbc.JDBCOptions
+import 
org.apache.spark.sql.execution.datasources.jdbc.connection.{DB2ConnectionProvider,
 SecureConnectionProvider}
+import org.apache.spark.tags.DockerTest
+
+@DockerTest
+class DB2KrbIntegrationSuite extends DockerKrbJDBCIntegrationSuite {
+  override protected val userName = s"db2/$dockerIp"
+  override protected val keytabFileName = "db2.keytab"
+
+  override val db = new DatabaseOnDocker {
+override val imageName = "ibmcom/db2:11.5.0.0a"
+override val env = Map(
+  "DB2INST1_PASSWORD" -> "rootpass",
+  "LICENSE" -> "accept",
+  "DBNAME" -> "db2"
+)
+override val usesIpc = false
+override val jdbcPort = 5
+override val privileged = true
+override def getJdbcUrl(ip: String, port: Int): String = 
s"jdbc:db2://$ip:$port/db2"
+override def getJdbcProperties() = {
+  val options = new JDBCOptions(Map[String, String](
+JDBCOptions.JDBC_URL -> getJdbcUrl(dockerIp, externalPort),
+JDBCOptions.JDBC_TABLE_NAME -> "bar",
+JDBCOptions.JDBC_KEYTAB -> keytabFileName,
+JDBCOptions.JDBC_PRINCIPAL -> principal
+  ))
+  new DB2ConnectionProvider(null, options).getAdditionalProperties()
+}
+
+override def beforeContainerStart(
+hostConfigBuilder: HostConfig.Builder,
+containerConfigBuilder: ContainerConfig.Builder): Unit = {
+  copyExecutableResource("db2_krb_setup.sh", initDbDir, replaceIp)
+
+  hostConfigBuilder.appendBinds(
+HostConfig.Bind.from(initDbDir.getAbsolutePath)
+  .to("/var/custom").readOnly(true).build()
+  )
+}
+  }
+
+  override protected def setAuthentication(keytabFile: String, principal: 
String): Unit = {
+val config = new SecureConnectionProvider.JDBCConfiguration(
+  Configuration.getConfiguration, "JaasClient", keytabFile, principal)
+Configuration.setConfiguration(config)

Review comment:
   Is this safe when simultaneously scanning tables in different secure 
databases ?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619735958







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619736195







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on a change in pull request #28328: [SPARK-31553][SQL] Fix isInCollection for collection sizes above the optimisation threshold

2020-04-26 Thread GitBox


MaxGekk commented on a change in pull request #28328:
URL: https://github.com/apache/spark/pull/28328#discussion_r415525400



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala
##
@@ -426,7 +426,10 @@ case class In(value: Expression, list: Seq[Expression]) 
extends Predicate {
  * Optimized version of In clause, when all filter values of In clause are
  * static.
  */
-case class InSet(child: Expression, hset: Set[Any]) extends UnaryExpression 
with Predicate {
+case class InSet(
+child: Expression,
+hset: Set[Any],
+hsetElemType: DataType) extends UnaryExpression with Predicate {

Review comment:
   And `InSet.sql()` is called from `Dataset.select` _.named:
   ```scala
   Project(untypedCols.map(_.named), logicalPlan)
   ```
   The `named` method calls `toPrettySQL(expr)`:
   ```scala
   case expr: Expression => Alias(expr, toPrettySQL(expr))()
   ```
   The `toPrettySQL` method calls `sql`:
   ```scala
   def toPrettySQL(e: Expression): String = usePrettyExpression(e).sql
   ```





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619736026


   **[Test build #121874 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121874/testReport)**
 for PR 28350 at commit 
[`f73b8a8`](https://github.com/apache/spark/commit/f73b8a83481aae569b504fcddfff43a13af5b844).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619717524


   **[Test build #121874 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121874/testReport)**
 for PR 28350 at commit 
[`f73b8a8`](https://github.com/apache/spark/commit/f73b8a83481aae569b504fcddfff43a13af5b844).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619736195







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619735958







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


SparkQA commented on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619735451


   **[Test build #121875 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121875/testReport)**
 for PR 28356 at commit 
[`0f9a186`](https://github.com/apache/spark/commit/0f9a18655b53630ad4690ce03a066f76f23f5812).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28326: [SPARK-27340][SS] Alias on TimeWindow expression cause watermark metadata lost

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28326:
URL: https://github.com/apache/spark/pull/28326#issuecomment-619735441


   > I'm wondering why Alias has an explicit metadata and hides the actual 
attribute's metadata
   
   Sometimes we have to, for example, `a + b as c`, what should be the metadata 
of `c`?
   
   I think the root cause is we rely on column metadata to store some important 
information, and we should make column metadata reliable as possible as we can. 
For example, `a as c`, if users do not specify the metadata of `c` explicitly, 
it should inherit metadata from `a`.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619733515


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28055: [SPARK-31289][TEST][test-hive1.2] Eliminate org.apache.spark.sql.hive.thriftserver.CliSuite flakiness

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28055:
URL: https://github.com/apache/spark/pull/28055#issuecomment-619733134


   I'm OK to have this patch to reduce test time, but we may need to update the 
PR description and do some experiments to prove this does reduce test time.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on pull request #28055: [SPARK-31289][TEST][test-hive1.2] Eliminate org.apache.spark.sql.hive.thriftserver.CliSuite flakiness

2020-04-26 Thread GitBox


yaooqinn commented on pull request #28055:
URL: https://github.com/apache/spark/pull/28055#issuecomment-619731713


   In the latest master branch, I notice that we will wait for the suite 
process to be shutdown gracefully for 1 minute, which may reduce the flakiness 
of CliSuite but introduce more test time.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on a change in pull request #28328: [SPARK-31553][SQL] Fix isInCollection for collection sizes above the optimisation threshold

2020-04-26 Thread GitBox


MaxGekk commented on a change in pull request #28328:
URL: https://github.com/apache/spark/pull/28328#discussion_r415521493



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala
##
@@ -426,7 +426,10 @@ case class In(value: Expression, list: Seq[Expression]) 
extends Predicate {
  * Optimized version of In clause, when all filter values of In clause are
  * static.
  */
-case class InSet(child: Expression, hset: Set[Any]) extends UnaryExpression 
with Predicate {
+case class InSet(
+child: Expression,
+hset: Set[Any],
+hsetElemType: DataType) extends UnaryExpression with Predicate {

Review comment:
   When InSet is created from isInCollection, in that case `child.dataType` 
is `NullType`. For example, it is NullType in the test 
https://github.com/apache/spark/pull/28328/files#diff-aa655ba249e00d2591b21cf6a360cf82R886
 because child is PrettyAttribute when the `sql` method is called.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28310: [SPARK-31527][SQL] date add/subtract interval only allow those day precision in ansi mode

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28310:
URL: https://github.com/apache/spark/pull/28310#issuecomment-619730253


   thanks, merging to master/3.0!



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #28109: [SPARK-31524][SQL][followup] Add metric to the split task number for skew optimization

2020-04-26 Thread GitBox


cloud-fan commented on a change in pull request #28109:
URL: https://github.com/apache/spark/pull/28109#discussion_r415519376



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/metric/SQLMetrics.scala
##
@@ -87,6 +87,7 @@ object SQLMetrics {
   private val TIMING_METRIC = "timing"
   private val NS_TIMING_METRIC = "nsTiming"
   private val AVERAGE_METRIC = "average"
+  private val NUMBER_METRIC = "num"

Review comment:
   This is no longer needed.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #28109: [SPARK-31524][SQL][followup] Add metric to the split task number for skew optimization

2020-04-26 Thread GitBox


cloud-fan commented on a change in pull request #28109:
URL: https://github.com/apache/spark/pull/28109#discussion_r415519482



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CustomShuffleReaderExec.scala
##
@@ -153,7 +170,9 @@ case class CustomShuffleReaderExec private(
   } ++ {
 if (hasSkewedPartition) {
   Map("numSkewedPartitions" ->
-SQLMetrics.createMetric(sparkContext, "number of skewed 
partitions"))
+SQLMetrics.createMetric(sparkContext, "number of skewed 
partitions"),
+"numSkewedSplits" ->
+  SQLMetrics.createNumMetric(sparkContext, "number of skewed 
partition splits"))

Review comment:
   We just set the value once. This can be simply `createMetric`





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #28109: [SPARK-31524][SQL][followup] Add metric to the split task number for skew optimization

2020-04-26 Thread GitBox


cloud-fan commented on a change in pull request #28109:
URL: https://github.com/apache/spark/pull/28109#discussion_r415519133



##
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CustomShuffleReaderExec.scala
##
@@ -112,6 +113,16 @@ case class CustomShuffleReaderExec private(
 }
   }
 
+  @transient private lazy val skewedSpecs: 
Option[Seq[PartialReducerPartitionSpec]] = {
+if (hasSkewedPartition) {
+  Some(partitionSpecs.collect {
+case p: PartialReducerPartitionSpec => p

Review comment:
   This is very short now. Can we just inline this lazy val?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn edited a comment on pull request #28055: [SPARK-31289][TEST][test-hive1.2] Eliminate org.apache.spark.sql.hive.thriftserver.CliSuite flakiness

2020-04-26 Thread GitBox


yaooqinn edited a comment on pull request #28055:
URL: https://github.com/apache/spark/pull/28055#issuecomment-619726953


   According to the error stack trace in the failed test, the test failed to 
instantiate a hive metastore client because of derby requirements.
   ```
   Caused by: ERROR XBM0A:
The database directory 
'/home/jenkins/workspace/SparkPullRequestBuilder/target/tmp/spark-9249ce52-0a06-42b6-a3df-e6295e880df0'
 exists. 
   However, it does not contain the expected 'service.properties' file.
   Perhaps Derby was brought down in the middle of creating this database.
   You may want to delete this directory and try creating the database again.
   ```
   
   The derby requires the metastore dir does not exist, but it does exist 
probably due to the test case before it failed to clear the metastore dir 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619726343







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #28328: [SPARK-31553][SQL] Fix isInCollection for collection sizes above the optimisation threshold

2020-04-26 Thread GitBox


cloud-fan commented on a change in pull request #28328:
URL: https://github.com/apache/spark/pull/28328#discussion_r415517516



##
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala
##
@@ -426,7 +426,10 @@ case class In(value: Expression, list: Seq[Expression]) 
extends Predicate {
  * Optimized version of In clause, when all filter values of In clause are
  * static.
  */
-case class InSet(child: Expression, hset: Set[Any]) extends UnaryExpression 
with Predicate {
+case class InSet(
+child: Expression,
+hset: Set[Any],
+hsetElemType: DataType) extends UnaryExpression with Predicate {

Review comment:
   when `hsetElemType` can be different from `child.dataType`?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619725756


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/121862/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #26624: [SPARK-8981][core] Add MDC support in Executor

2020-04-26 Thread GitBox


cloud-fan commented on pull request #26624:
URL: https://github.com/apache/spark/pull/26624#issuecomment-619727014


   cc @Ngone51 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] yaooqinn commented on pull request #28055: [SPARK-31289][TEST][test-hive1.2] Eliminate org.apache.spark.sql.hive.thriftserver.CliSuite flakiness

2020-04-26 Thread GitBox


yaooqinn commented on pull request #28055:
URL: https://github.com/apache/spark/pull/28055#issuecomment-619726953


   According to the error stack trace in the failed test, the test failed to 
instantiate a hive metastore client because of derby requirements.
   ```
   Caused by: ERROR XBM0A: The database directory 
'/home/jenkins/workspace/SparkPullRequestBuilder/target/tmp/spark-9249ce52-0a06-42b6-a3df-e6295e880df0'
 exists. However, it does not contain the expected 'service.properties' file. 
Perhaps Derby was brought down in the middle of creating this database. You may 
want to delete this directory and try creating the database again.
   ```
   
   The derby requires the metastore dir does not exist, but it does exist 
probably due to the test case before it failed to clear the metastore dir 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619726343







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619725747


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619709672


   **[Test build #121872 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121872/testReport)**
 for PR 28327 at commit 
[`3ec641d`](https://github.com/apache/spark/commit/3ec641dd49def557323160a1c6c31c945aca4842).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619725747







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


SparkQA commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619725648


   **[Test build #121872 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121872/testReport)**
 for PR 28327 at commit 
[`3ec641d`](https://github.com/apache/spark/commit/3ec641dd49def557323160a1c6c31c945aca4842).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619678050


   **[Test build #121862 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121862/testReport)**
 for PR 28356 at commit 
[`0f9a186`](https://github.com/apache/spark/commit/0f9a18655b53630ad4690ce03a066f76f23f5812).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28356: [SPARK-31485][CORE][3.0] Avoid application hang if only partial barrier tasks launched

2020-04-26 Thread GitBox


SparkQA commented on pull request #28356:
URL: https://github.com/apache/spark/pull/28356#issuecomment-619725253


   **[Test build #121862 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121862/testReport)**
 for PR 28356 at commit 
[`0f9a186`](https://github.com/apache/spark/commit/0f9a18655b53630ad4690ce03a066f76f23f5812).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HeartSaVioR commented on pull request #28326: [SPARK-27340][SS] Alias on TimeWindow expression cause watermark metadata lost

2020-04-26 Thread GitBox


HeartSaVioR commented on pull request #28326:
URL: https://github.com/apache/spark/pull/28326#issuecomment-619722391


   (Please consider that I'm not an expert of SQL area.)
   
   I've read through the code around Alias and played with the reproducer 
(test), and feel #24457 is the fix addressing root cause.
   
   Looking into the reproducer with debugger,
   
   ```
   val left = df1.select('key, window('leftTime, "10 second") as 'leftWindow, 
'leftValue)
   ```
   
   `'leftTime` in here is **unresolved** when applying `as` (hence `window` is 
also **unresolved** as well), and `as` sets the metadata with what original 
column has whereas the metadata of `'leftWindow` cannot be determined here, 
hence the problem arises.
   
   Even without the analysis, logically thinking, I'm wondering why Alias has 
an explicit metadata and hides the actual attribute's metadata, except the case 
of optimization which should be done without side-effects.
   
   Sorry for the dumb question, but is there any real case to do it, and even 
if it's valid, is it intentional to hide the metadata of actual attribute? 
Shouldn't we retain the metadata of actual attribute's metadata as well?
   
   If we concern about the performance about not having shortcut of Alias 
metadata, below fix may bring same effect with #24457, whereas it only changes 
the behavior when Alias renames Column which has unresolved expression:
   
   ```
   def name(alias: String): Column = withExpr {
   normalizedExpr() match {
 case ne: NamedExpression if ne.resolved =>
   Alias(expr, alias)(explicitMetadata = Some(ne.metadata))
 case other => Alias(other, alias)()
   }
 }
   ```



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun edited a comment on pull request #28358: [WIP][SPARK-31567][R][TESTS] Update AppVeyor R version to 4.0.0

2020-04-26 Thread GitBox


dongjoon-hyun edited a comment on pull request #28358:
URL: https://github.com/apache/spark/pull/28358#issuecomment-619711936


   AppVeyor seems to complain at `SparkR` installation.
   However, on Mac, installation is okay and JVM call is okay and some SQL 
queries (CREATE/INSERT/SELECT) works.
   ```
   R version 4.0.0 (2020-04-24) -- "Arbor Day"
   ...
   > head(sql("select version()"))
  version()
   1 3.1.0 1951fe1451ae7e145f83f97a961172590a7f4d77
   
   > sparkR.callJStatic("java.lang.System", "getProperty", "java.home")
   [1] "/Library/Java/JavaVirtualMachines/adoptopenjdk-8.jdk/Contents/Home/jre"
   ```
   The real situation seems that Spark is hitting another behavior change of R 
4.0. I'll investigate this later.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun edited a comment on pull request #28358: [WIP][SPARK-31567][R][TESTS] Update AppVeyor R version to 4.0.0

2020-04-26 Thread GitBox


dongjoon-hyun edited a comment on pull request #28358:
URL: https://github.com/apache/spark/pull/28358#issuecomment-619711936


   AppVeyor seems to complain at `SparkR` installation.
   However, on Mac, installation is okay and JVM call is okay and some SQL 
queries (CREATE/INSERT/SELECT) works.
   ```
   R version 4.0.0 (2020-04-24) -- "Arbor Day"
   ...
   > sparkR.callJStatic("java.lang.System", "getProperty", "java.home")
   [1] "/Library/Java/JavaVirtualMachines/adoptopenjdk-8.jdk/Contents/Home/jre"
   ```
   The real situation seems that Spark is hitting another behavior change of R 
4.0. I'll investigate this later.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28345: [SPARK-31557][SQL] Legacy time parser should return Gregorian days rather than Julian days

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28345:
URL: https://github.com/apache/spark/pull/28345#issuecomment-619721051


   @MaxGekk can you help to fix the timestamp part as well?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28345: [SPARK-31557][SQL] Legacy time parser should return Gregorian days rather than Julian days

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28345:
URL: https://github.com/apache/spark/pull/28345#issuecomment-619719490


   thanks, merging to master/3.0!



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619718080







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619716830







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619712257


   **[Test build #121873 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121873/testReport)**
 for PR 28350 at commit 
[`6a04693`](https://github.com/apache/spark/commit/6a04693ed33da945a3b7b21e8871e48980ac9b36).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619717524


   **[Test build #121874 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121874/testReport)**
 for PR 28350 at commit 
[`f73b8a8`](https://github.com/apache/spark/commit/f73b8a83481aae569b504fcddfff43a13af5b844).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619716830







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619716814


   **[Test build #121873 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121873/testReport)**
 for PR 28350 at commit 
[`6a04693`](https://github.com/apache/spark/commit/6a04693ed33da945a3b7b21e8871e48980ac9b36).
* This patch **fails R style tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #26339: [SPARK-27194][SPARK-29302][SQL] Fix the issue that for dynamic partition overwrite a task would conflict with its speculative t

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #26339:
URL: https://github.com/apache/spark/pull/26339#issuecomment-619712667







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619712716







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #26339: [SPARK-27194][SPARK-29302][SQL] Fix the issue that for dynamic partition overwrite a task would conflict with its speculative task

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #26339:
URL: https://github.com/apache/spark/pull/26339#issuecomment-619712667







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619712716







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619712257


   **[Test build #121873 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121873/testReport)**
 for PR 28350 at commit 
[`6a04693`](https://github.com/apache/spark/commit/6a04693ed33da945a3b7b21e8871e48980ac9b36).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #26339: [SPARK-27194][SPARK-29302][SQL] Fix the issue that for dynamic partition overwrite a task would conflict with its speculative task

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #26339:
URL: https://github.com/apache/spark/pull/26339#issuecomment-619657326


   **[Test build #121859 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121859/testReport)**
 for PR 26339 at commit 
[`756`](https://github.com/apache/spark/commit/756c6d086542b9de0d991a6de6660fc86fca).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #28358: [WIP][SPARK-31567][R][TESTS] Update AppVeyor R version to 4.0.0

2020-04-26 Thread GitBox


dongjoon-hyun commented on pull request #28358:
URL: https://github.com/apache/spark/pull/28358#issuecomment-619711936


   AppVeyor seems to complain at `SparkR` installation.
   However, on Mac, installation is okay and some SQL queries 
(CREATE/INSERT/SELECT) works.
   The real situation seems that Spark is hitting another behavior change of R 
4.0. I'll investigate this later.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #26339: [SPARK-27194][SPARK-29302][SQL] Fix the issue that for dynamic partition overwrite a task would conflict with its speculative task

2020-04-26 Thread GitBox


SparkQA commented on pull request #26339:
URL: https://github.com/apache/spark/pull/26339#issuecomment-619711667


   **[Test build #121859 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121859/testReport)**
 for PR 26339 at commit 
[`756`](https://github.com/apache/spark/commit/756c6d086542b9de0d991a6de6660fc86fca).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MichaelChirico commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


MichaelChirico commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619711106


   @HyukjinKwon finished merging the Rds. Checked the output of 
`roxygen2::roxygenize('.')` & looks OK. Thanks for pushing in a better 
direction.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MichaelChirico commented on a change in pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


MichaelChirico commented on a change in pull request #28350:
URL: https://github.com/apache/spark/pull/28350#discussion_r415503595



##
File path: R/pkg/R/functions.R
##
@@ -3951,7 +3951,6 @@ setMethod("map_values",
 #' @rdname column_collection_functions
 #' @aliases map_zip_with 
map_zip_with,characterOrColumn,characterOrColumn,function-method
 #'
-#' @examples

Review comment:
   this was throwing a warning from `roxygen2` about empty `@examples`.
   
   should add an example, but just deleted for now





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619710075







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619710075







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


SparkQA commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619709672


   **[Test build #121872 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121872/testReport)**
 for PR 28327 at commit 
[`3ec641d`](https://github.com/apache/spark/commit/3ec641dd49def557323160a1c6c31c945aca4842).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28156: [SPARK-31388][SQL][TESTS] org.apache.spark.sql.hive.thriftserver.CliSuite doesn't match results correctly

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28156:
URL: https://github.com/apache/spark/pull/28156#issuecomment-619708264


   thanks, merging to master/3.0!



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on pull request #28055: [SPARK-31289][TEST][test-hive1.2] Eliminate org.apache.spark.sql.hive.thriftserver.CliSuite flakiness

2020-04-26 Thread GitBox


cloud-fan commented on pull request #28055:
URL: https://github.com/apache/spark/pull/28055#issuecomment-619707782


   > CliSuite seems to be flaky while using metastoreDir per test
   
   Do you know the reason?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


HyukjinKwon commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619707526


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619707013


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/121868/
   Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619707004







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619698750


   **[Test build #121868 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121868/testReport)**
 for PR 28327 at commit 
[`3ec641d`](https://github.com/apache/spark/commit/3ec641dd49def557323160a1c6c31c945aca4842).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619707004


   Merged build finished. Test FAILed.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28327: [SPARK-29664][PYTHON][SQL][FOLLOW-UP] Add deprecation warnings for getItem instead

2020-04-26 Thread GitBox


SparkQA commented on pull request #28327:
URL: https://github.com/apache/spark/pull/28327#issuecomment-619706792


   **[Test build #121868 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121868/testReport)**
 for PR 28327 at commit 
[`3ec641d`](https://github.com/apache/spark/commit/3ec641dd49def557323160a1c6c31c945aca4842).
* This patch **fails PySpark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28353: [WIP] Use R 4.0.0 in AppVeyor for Apache Spark 3.1.0

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28353:
URL: https://github.com/apache/spark/pull/28353#issuecomment-619705683







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619705453







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619700668


   **[Test build #121869 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121869/testReport)**
 for PR 28354 at commit 
[`a082de8`](https://github.com/apache/spark/commit/a082de8654fe1dc3182e4147c59a6f7688ed93f5).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28353: [WIP] Use R 4.0.0 in AppVeyor for Apache Spark 3.1.0

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28353:
URL: https://github.com/apache/spark/pull/28353#issuecomment-619705683







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619705453







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


SparkQA commented on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619705351


   **[Test build #121869 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121869/testReport)**
 for PR 28354 at commit 
[`a082de8`](https://github.com/apache/spark/commit/a082de8654fe1dc3182e4147c59a6f7688ed93f5).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28353: [WIP] Use R 4.0.0 in AppVeyor for Apache Spark 3.1.0

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28353:
URL: https://github.com/apache/spark/pull/28353#issuecomment-619654357


   **[Test build #121856 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121856/testReport)**
 for PR 28353 at commit 
[`27642f6`](https://github.com/apache/spark/commit/27642f69b7f39f130da0c29f32f71d0b0677f650).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28353: [WIP] Use R 4.0.0 in AppVeyor for Apache Spark 3.1.0

2020-04-26 Thread GitBox


SparkQA commented on pull request #28353:
URL: https://github.com/apache/spark/pull/28353#issuecomment-619704843


   **[Test build #121856 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121856/testReport)**
 for PR 28353 at commit 
[`27642f6`](https://github.com/apache/spark/commit/27642f69b7f39f130da0c29f32f71d0b0677f650).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619704025







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619704025







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA removed a comment on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619692020


   **[Test build #121866 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121866/testReport)**
 for PR 28350 at commit 
[`c5725db`](https://github.com/apache/spark/commit/c5725dbaf4b63d9d5348d180eb4e43b88aeacb64).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


HyukjinKwon commented on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619704133


   cc @gengliangwang FYI



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28350: [SPARK-31568][R][DOCS] Add detail about func/key in gapply to documentation

2020-04-26 Thread GitBox


SparkQA commented on pull request #28350:
URL: https://github.com/apache/spark/pull/28350#issuecomment-619703834


   **[Test build #121866 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121866/testReport)**
 for PR 28350 at commit 
[`c5725db`](https://github.com/apache/spark/commit/c5725dbaf4b63d9d5348d180eb4e43b88aeacb64).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28359: [SPARK-31534][WEBUI][3.0] Text for tooltip should be escaped

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28359:
URL: https://github.com/apache/spark/pull/28359#issuecomment-619702942







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28359: [SPARK-31534][WEBUI][3.0] Text for tooltip should be escaped

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28359:
URL: https://github.com/apache/spark/pull/28359#issuecomment-619702942







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sarutak commented on pull request #28359: [SPARK-31534][WEBUI][3.0] Text for tooltip should be escaped

2020-04-26 Thread GitBox


sarutak commented on pull request #28359:
URL: https://github.com/apache/spark/pull/28359#issuecomment-619702971


   The error message mentioned 
[here](https://github.com/apache/spark/pull/28317#issuecomment-619438678) is 
not shown in this branch.
   I found it's because the version of `bootstrap.js` is different between 
`branch-3.0` and `master`.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] sarutak opened a new pull request #28359: [SPARK-31534][WEBUI][3.0] Text for tooltip should be escaped

2020-04-26 Thread GitBox


sarutak opened a new pull request #28359:
URL: https://github.com/apache/spark/pull/28359


   ### What changes were proposed in this pull request?
   This PR backports #28317 which escapes text for tooltip for DAG Viz and 
Timeline View.
   
   ### Why are the changes needed?
   
   This is a bug.
   Normally, DAG Viz and Timeline View show tooltip like as follows.
   
   https://user-images.githubusercontent.com/4736016/80127481-5a6c6880-85cf-11ea-8daf-cfd59aa3ba09.png;>
   https://user-images.githubusercontent.com/4736016/80127500-60624980-85cf-11ea-9b0f-cce301019e3a.png;>
   
   They contain a callsite properly.
   However, if a callsite contains characters which should be escaped for HTML 
without escaping , the corresponding tooltips wouldn't show the callsite and 
its following text properly.
   https://user-images.githubusercontent.com/4736016/80128480-b1267200-85d0-11ea-8035-ad68ae5fbcab.png;>
   https://user-images.githubusercontent.com/4736016/80128492-b5eb2600-85d0-11ea-9556-c48490110244.png;>
   
   The reason of this issue is that the source texts of the tooltip texts are 
not escaped.
   
   ### Does this PR introduce any user-facing change?
   
   No.
   
   ### How was this patch tested?
   
   I tested manually.
   First, I ran a job `sc.parallelize(1 to 10).collect` in Spark Shell then, 
visited AllJobsPage and JobPage and confirmed tooltip texts.
   https://user-images.githubusercontent.com/4736016/80332616-41411180-8886-11ea-8d93-28e1c5265115.png;>
   https://user-images.githubusercontent.com/4736016/80332625-44d49880-8886-11ea-8f2e-de8df1369e62.png;>
   And I confirmed that the appearance of the label of DAG-viz in StagePage is 
not changed. 
   https://user-images.githubusercontent.com/4736016/80332725-9b41d700-8886-11ea-9adb-40d50ad29f86.png;>
   
   I also added a testcase.
   With this testcase, an error message related to JavaScript is shown.
   
   `TypeError: Cannot call method "indexOf" of undefined 
(http://192.168.1.209:59376/static/spark-dag-viz.js#378)`
   
   This is thrown from `interpretLineBreak` in `spark-dag-viz.js`.
   HtmlUnit seems not to support `innerHTML` for text content (I tried 
replacing it with `textContent` and the error message is not shown).
   But DOMs which is needed by the testcase added is already rendered before 
`interpretLineBreak` is called. So the testcase successfully passes.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28359: [SPARK-31534][WEBUI][3.0] Text for tooltip should be escaped

2020-04-26 Thread GitBox


SparkQA commented on pull request #28359:
URL: https://github.com/apache/spark/pull/28359#issuecomment-619702598


   **[Test build #121871 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121871/testReport)**
 for PR 28359 at commit 
[`90d3dbf`](https://github.com/apache/spark/commit/90d3dbf3aa671280b494a82ee25c0a13a5f532f1).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] erenavsarogullari commented on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


erenavsarogullari commented on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619701829


   Hi @HyukjinKwon,
   Thanks for the review. I removed the dependency so current documentation 
just points the existing functionality on `SqlResource`. So, this can go 
independently now.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] huaxingao commented on pull request #28348: [MINOR][SQL][DOCS] Remove two leading spaces from sql tables

2020-04-26 Thread GitBox


huaxingao commented on pull request #28348:
URL: https://github.com/apache/spark/pull/28348#issuecomment-619701454


   @gatorsmile 
   We actually highlight SQL keywords. We have 
   ```
   {% highlight sql %}
   ..
   -- Correlated Subquery in `WHERE` clause.
   SELECT * FROM person AS parent
   WHERE EXISTS (
   SELECT 1 FROM person AS child
   WHERE parent.id = child.id AND child.age IS NULL
   );
   ..
   {% endhighlight %}
   
   ```
   and it looks like this
   
   https://user-images.githubusercontent.com/13592258/80332710-74d95880-8800-11ea-9c8a-79f377cc4bb3.png;>
   
   The only exception I am aware of is the identifier doc I added recently. The 
backtick totally messed up the format. 
   https://user-images.githubusercontent.com/13592258/80332862-f6c98180-8800-11ea-95f6-09d4606b702d.png;>
   
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] mengxr commented on pull request #28279: [SPARK-31497][ML][PYSPARK] Fix Pyspark CrossValidator/TrainValidationSplit with pipeline estimator cannot save and load model

2020-04-26 Thread GitBox


mengxr commented on pull request #28279:
URL: https://github.com/apache/spark/pull/28279#issuecomment-619701317


   LGTM. Merging into master and spark-3.0.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619700962







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619700962







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #27006: [SPARK-30352][SQL] DataSourceV2: Add CURRENT_CATALOG function

2020-04-26 Thread GitBox


SparkQA commented on pull request #27006:
URL: https://github.com/apache/spark/pull/27006#issuecomment-619700713


   **[Test build #121870 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121870/testReport)**
 for PR 27006 at commit 
[`180fe0a`](https://github.com/apache/spark/commit/180fe0a14409043e22d2eb51c17b60a27f678eb7).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on pull request #28354: [SPARK-31566][SQL][DOCS] Add SQL Rest API Documentation

2020-04-26 Thread GitBox


SparkQA commented on pull request #28354:
URL: https://github.com/apache/spark/pull/28354#issuecomment-619700668


   **[Test build #121869 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/121869/testReport)**
 for PR 28354 at commit 
[`a082de8`](https://github.com/apache/spark/commit/a082de8654fe1dc3182e4147c59a6f7688ed93f5).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on pull request #28353: [WIP] Use R 4.0.0 in AppVeyor for Apache Spark 3.1.0

2020-04-26 Thread GitBox


AmplabJenkins commented on pull request #28353:
URL: https://github.com/apache/spark/pull/28353#issuecomment-619699789







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on pull request #28353: [WIP] Use R 4.0.0 in AppVeyor for Apache Spark 3.1.0

2020-04-26 Thread GitBox


AmplabJenkins removed a comment on pull request #28353:
URL: https://github.com/apache/spark/pull/28353#issuecomment-619699789







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   >