[GitHub] dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] Set default precision and scale for elements of postgres numeric array

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] 
Set default precision and scale for elements of postgres numeric array
URL: https://github.com/apache/spark/pull/23456#discussion_r246661712
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala
 ##
 @@ -60,7 +60,12 @@ private object PostgresDialect extends JdbcDialect {
 case "bytea" => Some(BinaryType)
 case "timestamp" | "timestamptz" | "time" | "timetz" => Some(TimestampType)
 case "date" => Some(DateType)
-case "numeric" | "decimal" => Some(DecimalType.bounded(precision, scale))
+case "numeric" | "decimal" => if (precision > 0) {
+  Some(DecimalType.bounded(precision, scale))
+} else {
+  // SPARK-26538: handle numeric without explicit precision and scale.
+  Some(DecimalType. SYSTEM_DEFAULT)
+}
 
 Review comment:
   Hi, @a-shkarupin . Thank you for your first contribution.
   Could you follow the existing succinct style? What I mean is having two 
`case "numeric" | "decimal"`s.
   - 
https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala#L215-L217


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] Set default precision and scale for elements of postgres numeric array

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] 
Set default precision and scale for elements of postgres numeric array
URL: https://github.com/apache/spark/pull/23456#discussion_r246661712
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala
 ##
 @@ -60,7 +60,12 @@ private object PostgresDialect extends JdbcDialect {
 case "bytea" => Some(BinaryType)
 case "timestamp" | "timestamptz" | "time" | "timetz" => Some(TimestampType)
 case "date" => Some(DateType)
-case "numeric" | "decimal" => Some(DecimalType.bounded(precision, scale))
+case "numeric" | "decimal" => if (precision > 0) {
+  Some(DecimalType.bounded(precision, scale))
+} else {
+  // SPARK-26538: handle numeric without explicit precision and scale.
+  Some(DecimalType. SYSTEM_DEFAULT)
+}
 
 Review comment:
   Hi, @a-shkarupin .
   Could you follow the existing succinct style? What I mean is having two 
`case "numeric" | "decimal"`s.
   - 
https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala#L215-L217


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] Set default precision and scale for elements of postgres numeric array

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] 
Set default precision and scale for elements of postgres numeric array
URL: https://github.com/apache/spark/pull/23456#discussion_r246661712
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala
 ##
 @@ -60,7 +60,12 @@ private object PostgresDialect extends JdbcDialect {
 case "bytea" => Some(BinaryType)
 case "timestamp" | "timestamptz" | "time" | "timetz" => Some(TimestampType)
 case "date" => Some(DateType)
-case "numeric" | "decimal" => Some(DecimalType.bounded(precision, scale))
+case "numeric" | "decimal" => if (precision > 0) {
+  Some(DecimalType.bounded(precision, scale))
+} else {
+  // SPARK-26538: handle numeric without explicit precision and scale.
+  Some(DecimalType. SYSTEM_DEFAULT)
+}
 
 Review comment:
   Hi, @a-shkarupin .
   Could you follow the existing style? What I mean is having two `case 
"numeric" | "decimal"`s.
   - 
https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala#L215-L217


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] Set default precision and scale for elements of postgres numeric array

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23456: [SPARK-26538][SQL] 
Set default precision and scale for elements of postgres numeric array
URL: https://github.com/apache/spark/pull/23456#discussion_r246661712
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/jdbc/PostgresDialect.scala
 ##
 @@ -60,7 +60,12 @@ private object PostgresDialect extends JdbcDialect {
 case "bytea" => Some(BinaryType)
 case "timestamp" | "timestamptz" | "time" | "timetz" => Some(TimestampType)
 case "date" => Some(DateType)
-case "numeric" | "decimal" => Some(DecimalType.bounded(precision, scale))
+case "numeric" | "decimal" => if (precision > 0) {
+  Some(DecimalType.bounded(precision, scale))
+} else {
+  // SPARK-26538: handle numeric without explicit precision and scale.
+  Some(DecimalType. SYSTEM_DEFAULT)
+}
 
 Review comment:
   Hi, @a-shkarupin .
   Could you follow the existing style?
   - 
https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/jdbc/JdbcUtils.scala#L215-L217


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add 
additional integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453001362
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/6885/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453001355
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453001321
 
 
   Kubernetes integration test status failure
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/testing-k8s-prb-make-spark-distribution-unified/6885/
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add 
additional integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453001355
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453001362
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/6885/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #22093: [SPARK-25100][CORE] Fix no registering TaskCommitMessage bug

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #22093: [SPARK-25100][CORE] Fix no 
registering TaskCommitMessage bug
URL: https://github.com/apache/spark/pull/22093#issuecomment-412548219
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453000850
 
 
   Kubernetes integration test starting
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/testing-k8s-prb-make-spark-distribution-unified/6885/
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #22093: [SPARK-25100][CORE] Fix no registering TaskCommitMessage bug

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #22093: [SPARK-25100][CORE] Fix no registering 
TaskCommitMessage bug
URL: https://github.com/apache/spark/pull/22093#issuecomment-453000918
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453000313
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101008/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add 
additional integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453000313
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101008/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-452997827
 
 
   **[Test build #101008 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101008/testReport)**
 for PR 23504 at commit 
[`4400077`](https://github.com/apache/spark/commit/4400077ff8daf162cf7d1f7adcfc1bf5b77017f0).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23502: [SPARK-22128][CORE][BUILD] Add `paranamer` dependency to `core` module

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23502: 
[SPARK-22128][CORE][BUILD] Add `paranamer` dependency to `core` module
URL: https://github.com/apache/spark/pull/23502#discussion_r246659697
 
 

 ##
 File path: core/pom.xml
 ##
 @@ -33,6 +33,10 @@
   Spark Project Core
   http://spark.apache.org/
   
+
+  com.thoughtworks.paranamer
 
 Review comment:
   Thank you for review, @felixcheung .
   
   This is not a new dependency.
   - https://github.com/apache/spark/blob/master/pom.xml#L1948-L1952
   
   The license is here.
   - https://github.com/apache/spark/blob/master/LICENSE-binary#L414


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453000310
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23504: [SPARK-26585][K8S] Add 
additional integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453000310
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-453000250
 
 
   **[Test build #101008 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101008/testReport)**
 for PR 23504 at commit 
[`4400077`](https://github.com/apache/spark/commit/4400077ff8daf162cf7d1f7adcfc1bf5b77017f0).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23502: [SPARK-22128][CORE][BUILD] Add `paranamer` dependency to `core` module

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23502: 
[SPARK-22128][CORE][BUILD] Add `paranamer` dependency to `core` module
URL: https://github.com/apache/spark/pull/23502#discussion_r246659697
 
 

 ##
 File path: core/pom.xml
 ##
 @@ -33,6 +33,10 @@
   Spark Project Core
   http://spark.apache.org/
   
+
+  com.thoughtworks.paranamer
 
 Review comment:
   This is not a new dependency.
   - https://github.com/apache/spark/blob/master/pom.xml#L1948-L1952
   
   The license is here.
   - https://github.com/apache/spark/blob/master/LICENSE-binary#L414


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] LantaoJin commented on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
LantaoJin commented on issue #22874: [SPARK-25865][CORE] Add GC information to 
ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452999783
 
 
   retest this please


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun closed pull request #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
dongjoon-hyun closed pull request #23505: [SPARK-26491][K8S][FOLLOWUP] Fix 
compile failure
URL: https://github.com/apache/spark/pull/23505
 
 
   

This is a PR merged from a forked repository.
As GitHub hides the original diff on merge, it is displayed below for
the sake of provenance:

As this is a foreign pull request (from a fork), the diff is supplied
below (as it won't show otherwise due to GitHub magic):

diff --git 
a/resource-managers/kubernetes/integration-tests/src/test/scala/org/apache/spark/deploy/k8s/integrationtest/KubernetesTestComponents.scala
 
b/resource-managers/kubernetes/integration-tests/src/test/scala/org/apache/spark/deploy/k8s/integrationtest/KubernetesTestComponents.scala
index cc89683949010..250eaab9e9808 100644
--- 
a/resource-managers/kubernetes/integration-tests/src/test/scala/org/apache/spark/deploy/k8s/integrationtest/KubernetesTestComponents.scala
+++ 
b/resource-managers/kubernetes/integration-tests/src/test/scala/org/apache/spark/deploy/k8s/integrationtest/KubernetesTestComponents.scala
@@ -68,7 +68,7 @@ private[spark] class KubernetesTestComponents(defaultClient: 
DefaultKubernetesCl
   .set("spark.executors.instances", "1")
   .set("spark.app.name", "spark-test-app")
   .set("spark.ui.enabled", "true")
-  .set(IS_TESTING, false)
+  .set(IS_TESTING.key, "false")
   .set("spark.kubernetes.submission.waitAppCompletion", "false")
   .set("spark.kubernetes.authenticate.driver.serviceAccountName", 
serviceAccountName)
   }


 


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] LantaoJin commented on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
LantaoJin commented on issue #22874: [SPARK-25865][CORE] Add GC information to 
ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452999732
 
 
   Fails by [Flaky Test: 
HiveClientSuites](https://issues.apache.org/jira/browse/SPARK-23622)


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
dongjoon-hyun commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix 
compile failure
URL: https://github.com/apache/spark/pull/23505#issuecomment-452999024
 
 
   Thank you for review and approval, @felixcheung .
   Merged to master.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #19616: [SPARK-22404][YARN] Provide an option to use unmanaged AM in yarn-client mode

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #19616: [SPARK-22404][YARN] Provide an 
option to use unmanaged AM in yarn-client mode
URL: https://github.com/apache/spark/pull/19616#issuecomment-452997772
 
 
   Build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23327: [SPARK-26222][SQL] Track file listing time

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23327: [SPARK-26222][SQL] Track file 
listing time
URL: https://github.com/apache/spark/pull/23327#issuecomment-452997438
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101005/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #19616: [SPARK-22404][YARN] Provide an option to use unmanaged AM in yarn-client mode

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #19616: [SPARK-22404][YARN] Provide an 
option to use unmanaged AM in yarn-client mode
URL: https://github.com/apache/spark/pull/19616#issuecomment-452997774
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100997/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #19616: [SPARK-22404][YARN] Provide an option to use unmanaged AM in yarn-client mode

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #19616: [SPARK-22404][YARN] Provide an option 
to use unmanaged AM in yarn-client mode
URL: https://github.com/apache/spark/pull/19616#issuecomment-452997772
 
 
   Build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #23327: [SPARK-26222][SQL] Track file listing time

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #23327: [SPARK-26222][SQL] Track file 
listing time
URL: https://github.com/apache/spark/pull/23327#issuecomment-452966816
 
 
   **[Test build #101005 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101005/testReport)**
 for PR 23327 at commit 
[`16d80c5`](https://github.com/apache/spark/commit/16d80c504c807d3626da1ff808f3c4c447187ea3).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #19616: [SPARK-22404][YARN] Provide an option to use unmanaged AM in yarn-client mode

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #19616: [SPARK-22404][YARN] Provide an 
option to use unmanaged AM in yarn-client mode
URL: https://github.com/apache/spark/pull/19616#issuecomment-452944227
 
 
   **[Test build #100997 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100997/testReport)**
 for PR 19616 at commit 
[`dc31940`](https://github.com/apache/spark/commit/dc31940ab81e0f119303caf1dc9da99928d509a7).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
SparkQA commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-452997827
 
 
   **[Test build #101008 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101008/testReport)**
 for PR 23504 at commit 
[`4400077`](https://github.com/apache/spark/commit/4400077ff8daf162cf7d1f7adcfc1bf5b77017f0).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #19616: [SPARK-22404][YARN] Provide an option to use unmanaged AM in yarn-client mode

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #19616: [SPARK-22404][YARN] Provide an option 
to use unmanaged AM in yarn-client mode
URL: https://github.com/apache/spark/pull/19616#issuecomment-452997774
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100997/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] felixcheung commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
felixcheung commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-452997703
 
 
   Jenkins, retest this please


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23327: [SPARK-26222][SQL] Track file listing time

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23327: [SPARK-26222][SQL] Track file 
listing time
URL: https://github.com/apache/spark/pull/23327#issuecomment-452997434
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] felixcheung commented on a change in pull request #23502: [SPARK-22128][CORE][BUILD] Add `paranamer` dependency to `core` module

2019-01-09 Thread GitBox
felixcheung commented on a change in pull request #23502: 
[SPARK-22128][CORE][BUILD] Add `paranamer` dependency to `core` module
URL: https://github.com/apache/spark/pull/23502#discussion_r246657269
 
 

 ##
 File path: core/pom.xml
 ##
 @@ -33,6 +33,10 @@
   Spark Project Core
   http://spark.apache.org/
   
+
+  com.thoughtworks.paranamer
 
 Review comment:
   do we need to include this in license? looks like the project has 
non-standard license text


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #19616: [SPARK-22404][YARN] Provide an option to use unmanaged AM in yarn-client mode

2019-01-09 Thread GitBox
SparkQA commented on issue #19616: [SPARK-22404][YARN] Provide an option to use 
unmanaged AM in yarn-client mode
URL: https://github.com/apache/spark/pull/19616#issuecomment-452997362
 
 
   **[Test build #100997 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100997/testReport)**
 for PR 19616 at commit 
[`dc31940`](https://github.com/apache/spark/commit/dc31940ab81e0f119303caf1dc9da99928d509a7).
* This patch passes all tests.
* This patch **does not merge cleanly**.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23327: [SPARK-26222][SQL] Track file listing time

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23327: [SPARK-26222][SQL] Track file listing 
time
URL: https://github.com/apache/spark/pull/23327#issuecomment-452997438
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101005/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23327: [SPARK-26222][SQL] Track file listing time

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23327: [SPARK-26222][SQL] Track file listing 
time
URL: https://github.com/apache/spark/pull/23327#issuecomment-452997434
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23327: [SPARK-26222][SQL] Track file listing time

2019-01-09 Thread GitBox
SparkQA commented on issue #23327: [SPARK-26222][SQL] Track file listing time
URL: https://github.com/apache/spark/pull/23327#issuecomment-452997241
 
 
   **[Test build #101005 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101005/testReport)**
 for PR 23327 at commit 
[`16d80c5`](https://github.com/apache/spark/commit/16d80c504c807d3626da1ff808f3c4c447187ea3).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] felixcheung commented on a change in pull request #23498: [SPARK-26580][SQL] remove Scala 2.11 hack for Scala UDF

2019-01-09 Thread GitBox
felixcheung commented on a change in pull request #23498: [SPARK-26580][SQL] 
remove Scala 2.11 hack for Scala UDF
URL: https://github.com/apache/spark/pull/23498#discussion_r246656972
 
 

 ##
 File path: docs/sql-migration-guide-upgrade.md
 ##
 @@ -43,6 +43,8 @@ displayTitle: Spark SQL Upgrading Guide
 
   - Since Spark 3.0, JSON datasource and JSON function `schema_of_json` infer 
TimestampType from string values if they match to the pattern defined by the 
JSON option `timestampFormat`. Set JSON option `inferTimestamp` to `false` to 
disable such type inferring.
 
+  - In Spark version 2.4 and earlier, if 
`org.apache.spark.sql.functions.udf(Any, DataType)` gets a Scala closure with 
primitive-type argument, the returned UDF will return null if the input values 
is null. Since Spark 3.0, the UDF will return the default value of the Java 
type if the input value is null. For example, `val f = udf((x: Int) => x, 
IntegerType)`, `f($"x")` will return null in Spark 2.4 and earlier if column 
`x` is null, and return 0 in Spark 3.0.
 
 Review comment:
   should this say this is because of Scala 2.12?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] cloud-fan commented on a change in pull request #20303: [SPARK-23128][SQL] A new approach to do adaptive execution in Spark SQL

2019-01-09 Thread GitBox
cloud-fan commented on a change in pull request #20303: [SPARK-23128][SQL] A 
new approach to do adaptive execution in Spark SQL
URL: https://github.com/apache/spark/pull/20303#discussion_r246655398
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/QueryStage.scala
 ##
 @@ -0,0 +1,222 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.adaptive
+
+import scala.concurrent.{ExecutionContext, Future}
+import scala.concurrent.duration.Duration
+
+import org.apache.spark.MapOutputStatistics
+import org.apache.spark.broadcast
+import org.apache.spark.rdd.RDD
+import org.apache.spark.sql.catalyst.InternalRow
+import org.apache.spark.sql.catalyst.expressions._
+import org.apache.spark.sql.catalyst.plans.physical.Partitioning
+import org.apache.spark.sql.execution._
+import org.apache.spark.sql.execution.exchange._
+import 
org.apache.spark.sql.execution.ui.SparkListenerSQLAdaptiveExecutionUpdate
+import org.apache.spark.util.ThreadUtils
+
+/**
+ * In adaptive execution mode, an execution plan is divided into multiple 
QueryStages. Each
+ * QueryStage is a sub-tree that runs in a single stage.
+ */
+abstract class QueryStage extends UnaryExecNode {
+
+  var child: SparkPlan
+
+  // Ignore this wrapper for canonicalizing.
+  override def doCanonicalize(): SparkPlan = child.canonicalized
+
+  override def output: Seq[Attribute] = child.output
+
+  override def outputPartitioning: Partitioning = child.outputPartitioning
+
+  override def outputOrdering: Seq[SortOrder] = child.outputOrdering
+
+  /**
+   * Execute childStages and wait until all stages are completed. Use a thread 
pool to avoid
+   * blocking on one child stage.
+   */
+  def executeChildStages(): Unit = {
+// Handle broadcast stages
+val broadcastQueryStages: Seq[BroadcastQueryStage] = child.collect {
+  case bqs: BroadcastQueryStageInput => bqs.childStage
+}
+val broadcastFutures = broadcastQueryStages.map { queryStage =>
+  Future { queryStage.prepareBroadcast() }(QueryStage.executionContext)
+}
+
+// Submit shuffle stages
+val executionId = 
sqlContext.sparkContext.getLocalProperty(SQLExecution.EXECUTION_ID_KEY)
+val shuffleQueryStages: Seq[ShuffleQueryStage] = child.collect {
+  case sqs: ShuffleQueryStageInput => sqs.childStage
+}
+val shuffleStageFutures = shuffleQueryStages.map { queryStage =>
+  Future {
+SQLExecution.withExecutionId(sqlContext.sparkContext, executionId) {
+  queryStage.execute()
+}
+  }(QueryStage.executionContext)
+}
+
+ThreadUtils.awaitResult(
+  Future.sequence(broadcastFutures)(implicitly, 
QueryStage.executionContext), Duration.Inf)
+ThreadUtils.awaitResult(
+  Future.sequence(shuffleStageFutures)(implicitly, 
QueryStage.executionContext), Duration.Inf)
+  }
+
+  /**
+   * Before executing the plan in this query stage, we execute all child 
stages, optimize the plan
+   * in this stage and determine the reducer number based on the child stages' 
statistics. Finally
+   * we do a codegen for this query stage and update the UI with the new plan.
+   */
+  def prepareExecuteStage(): Unit = {
+// 1. Execute childStages
+executeChildStages()
+// It is possible to optimize this stage's plan here based on the child 
stages' statistics.
+
+// 2. Determine reducer number
+val queryStageInputs: Seq[ShuffleQueryStageInput] = child.collect {
+  case input: ShuffleQueryStageInput => input
+}
+val childMapOutputStatistics = 
queryStageInputs.map(_.childStage.mapOutputStatistics)
+  .filter(_ != null).toArray
+if (childMapOutputStatistics.length > 0) {
+  val exchangeCoordinator = new ExchangeCoordinator(
+conf.targetPostShuffleInputSize,
+conf.minNumPostShufflePartitions)
+
+  val partitionStartIndices =
+
exchangeCoordinator.estimatePartitionStartIndices(childMapOutputStatistics)
+  child = child.transform {
+case ShuffleQueryStageInput(childStage, output, _) =>
+  ShuffleQueryStageInput(childStage, output, 
Some(partitionStartIndices))
+  }
+}
+
+// 3. Codegen and update the UI
+

[GitHub] cloud-fan commented on a change in pull request #20303: [SPARK-23128][SQL] A new approach to do adaptive execution in Spark SQL

2019-01-09 Thread GitBox
cloud-fan commented on a change in pull request #20303: [SPARK-23128][SQL] A 
new approach to do adaptive execution in Spark SQL
URL: https://github.com/apache/spark/pull/20303#discussion_r246654084
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/PlanQueryStage.scala
 ##
 @@ -0,0 +1,79 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.adaptive
+
+import scala.collection.mutable
+import scala.collection.mutable.ArrayBuffer
+
+import org.apache.spark.sql.catalyst.rules.Rule
+import org.apache.spark.sql.execution.SparkPlan
+import org.apache.spark.sql.execution.command.ExecutedCommandExec
+import org.apache.spark.sql.execution.exchange.{BroadcastExchangeExec, 
Exchange, ShuffleExchangeExec}
+import org.apache.spark.sql.internal.SQLConf
+import org.apache.spark.sql.types.StructType
+
+/**
+ * Divide the spark plan into multiple QueryStages. For each Exchange in the 
plan, it adds a
+ * QueryStage and a QueryStageInput. If reusing Exchange is enabled, it finds 
duplicated exchanges
+ * and uses the same QueryStage for all the references.
 
 Review comment:
   add a comment that this rule must be run after `EnsureRequirements`.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] cloud-fan commented on a change in pull request #20303: [SPARK-23128][SQL] A new approach to do adaptive execution in Spark SQL

2019-01-09 Thread GitBox
cloud-fan commented on a change in pull request #20303: [SPARK-23128][SQL] A 
new approach to do adaptive execution in Spark SQL
URL: https://github.com/apache/spark/pull/20303#discussion_r246654954
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/QueryStage.scala
 ##
 @@ -0,0 +1,222 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.adaptive
+
+import scala.concurrent.{ExecutionContext, Future}
+import scala.concurrent.duration.Duration
+
+import org.apache.spark.MapOutputStatistics
+import org.apache.spark.broadcast
+import org.apache.spark.rdd.RDD
+import org.apache.spark.sql.catalyst.InternalRow
+import org.apache.spark.sql.catalyst.expressions._
+import org.apache.spark.sql.catalyst.plans.physical.Partitioning
+import org.apache.spark.sql.execution._
+import org.apache.spark.sql.execution.exchange._
+import 
org.apache.spark.sql.execution.ui.SparkListenerSQLAdaptiveExecutionUpdate
+import org.apache.spark.util.ThreadUtils
+
+/**
+ * In adaptive execution mode, an execution plan is divided into multiple 
QueryStages. Each
+ * QueryStage is a sub-tree that runs in a single stage.
+ */
+abstract class QueryStage extends UnaryExecNode {
+
+  var child: SparkPlan
+
+  // Ignore this wrapper for canonicalizing.
+  override def doCanonicalize(): SparkPlan = child.canonicalized
+
+  override def output: Seq[Attribute] = child.output
+
+  override def outputPartitioning: Partitioning = child.outputPartitioning
+
+  override def outputOrdering: Seq[SortOrder] = child.outputOrdering
+
+  /**
+   * Execute childStages and wait until all stages are completed. Use a thread 
pool to avoid
+   * blocking on one child stage.
+   */
+  def executeChildStages(): Unit = {
+// Handle broadcast stages
+val broadcastQueryStages: Seq[BroadcastQueryStage] = child.collect {
+  case bqs: BroadcastQueryStageInput => bqs.childStage
+}
+val broadcastFutures = broadcastQueryStages.map { queryStage =>
+  Future { queryStage.prepareBroadcast() }(QueryStage.executionContext)
+}
+
+// Submit shuffle stages
+val executionId = 
sqlContext.sparkContext.getLocalProperty(SQLExecution.EXECUTION_ID_KEY)
+val shuffleQueryStages: Seq[ShuffleQueryStage] = child.collect {
+  case sqs: ShuffleQueryStageInput => sqs.childStage
+}
+val shuffleStageFutures = shuffleQueryStages.map { queryStage =>
+  Future {
+SQLExecution.withExecutionId(sqlContext.sparkContext, executionId) {
+  queryStage.execute()
+}
+  }(QueryStage.executionContext)
+}
+
+ThreadUtils.awaitResult(
+  Future.sequence(broadcastFutures)(implicitly, 
QueryStage.executionContext), Duration.Inf)
+ThreadUtils.awaitResult(
+  Future.sequence(shuffleStageFutures)(implicitly, 
QueryStage.executionContext), Duration.Inf)
+  }
+
+  /**
+   * Before executing the plan in this query stage, we execute all child 
stages, optimize the plan
+   * in this stage and determine the reducer number based on the child stages' 
statistics. Finally
+   * we do a codegen for this query stage and update the UI with the new plan.
+   */
+  def prepareExecuteStage(): Unit = {
+// 1. Execute childStages
+executeChildStages()
+// It is possible to optimize this stage's plan here based on the child 
stages' statistics.
+
+// 2. Determine reducer number
+val queryStageInputs: Seq[ShuffleQueryStageInput] = child.collect {
+  case input: ShuffleQueryStageInput => input
+}
+val childMapOutputStatistics = 
queryStageInputs.map(_.childStage.mapOutputStatistics)
+  .filter(_ != null).toArray
 
 Review comment:
   when `mapOutputStatistics` can be null?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For 

[GitHub] dongjoon-hyun edited a comment on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
dongjoon-hyun edited a comment on issue #23504: [SPARK-26585][K8S] Add 
additional integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-452995851
 
 
   Hi, @ramaddepally . The Kubernetes integration test failure is irrelevant to 
this PR.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on issue #23504: [SPARK-26585][K8S] Add additional integration tests for K8s Scheduler Backend

2019-01-09 Thread GitBox
dongjoon-hyun commented on issue #23504: [SPARK-26585][K8S] Add additional 
integration tests for K8s Scheduler Backend
URL: https://github.com/apache/spark/pull/23504#issuecomment-452995851
 
 
   Hi, @ramaddepally . The failure is irrelevant to this PR.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun edited a comment on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
dongjoon-hyun edited a comment on issue #23506: [SPARK-26577][SQL] Add input 
optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452995469
 
 
   Hi, @Deegue . Thank you for your first contribution. Please update both PRs 
(this one and the previous one).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
dongjoon-hyun commented on issue #23506: [SPARK-26577][SQL] Add input optimizer 
when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452995469
 
 
   Hi, @Deegue . Thank you for your first contribution. Please update this PR.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #22874: [SPARK-25865][CORE] Add GC 
information to ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452995005
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101001/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23506: [SPARK-26577][SQL] 
Add input optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#discussion_r246655454
 
 

 ##
 File path: 
sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveTableScanSuite.scala
 ##
 @@ -192,4 +192,54 @@ class HiveTableScanSuite extends HiveComparisonTest with 
SQLTestUtils with TestH
   case p: HiveTableScanExec => p
 }.get
   }
-}
+
+  test("Test the InputFormat optimizer") {
+withTable("table_old", "table_pt_old", "table_new", "table_pt_new") {
+  sql("set spark.sql.hive.fileInputFormat.enabled=true")
+  sql("set spark.sql.hive.fileInputFormat.split.maxsize=134217728")
+  sql("set spark.sql.hive.fileInputFormat.split.minsize=134217728")
+  sql(
+s"""
+   |CREATE TABLE table_old (id int)
+   |STORED AS
+   |INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat'
+   |OUTPUTFORMAT 
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
+ """.stripMargin)
+  sql(
+s"""
+   |CREATE TABLE table_pt_old (id int)
+   |PARTITIONED BY (a int, b int)
+   |STORED AS
+   |INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat'
+   |OUTPUTFORMAT 
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
+ """.stripMargin)
+  sql(
+s"""
+   |CREATE TABLE table_new (id int)
+   |STORED AS
+   |INPUTFORMAT 'org.apache.hadoop.mapreduce.lib.input.TextInputFormat'
+   |OUTPUTFORMAT 
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
+   """.stripMargin)
+  sql(
+s"""
+   |CREATE TABLE table_pt_new (id int)
+   |PARTITIONED BY (a int, b int)
+   |STORED AS
+   |INPUTFORMAT 'org.apache.hadoop.mapreduce.lib.input.TextInputFormat'
+   |OUTPUTFORMAT 
'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
+   """.stripMargin)
+  intercept[Exception] {
+sql("SELECT count(1) FROM table_old")
+  }
 
 Review comment:
   We had better check the specific exception (instead of `Exception`) and the 
error message. For example, the following pattern is used.
   ```scala
   val m1 = intercept[AnalysisException] {
   sql("SELECT count(1) FROM table_old")
   }.getMessage
   assert(m1.contains("Your error message"))
   ```


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #22874: [SPARK-25865][CORE] Add GC information 
to ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452995003
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #22874: [SPARK-25865][CORE] Add GC information 
to ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452995005
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101001/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #22874: [SPARK-25865][CORE] Add GC 
information to ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452995003
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
SparkQA commented on issue #22874: [SPARK-25865][CORE] Add GC information to 
ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452994781
 
 
   **[Test build #101001 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101001/testReport)**
 for PR 22874 at commit 
[`3d9a02c`](https://github.com/apache/spark/commit/3d9a02c8a9e1ceba3f097bd76399616c136892a9).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23208: [SPARK-25530][SQL] data source v2 API refactor (batch write)

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23208: [SPARK-25530][SQL] data source 
v2 API refactor (batch write)
URL: https://github.com/apache/spark/pull/23208#issuecomment-452994710
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23208: [SPARK-25530][SQL] data source v2 API refactor (batch write)

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23208: [SPARK-25530][SQL] data source 
v2 API refactor (batch write)
URL: https://github.com/apache/spark/pull/23208#issuecomment-452994716
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101000/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #22874: [SPARK-25865][CORE] Add GC information to ExecutorMetrics

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #22874: [SPARK-25865][CORE] Add GC 
information to ExecutorMetrics
URL: https://github.com/apache/spark/pull/22874#issuecomment-452957976
 
 
   **[Test build #101001 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101001/testReport)**
 for PR 22874 at commit 
[`3d9a02c`](https://github.com/apache/spark/commit/3d9a02c8a9e1ceba3f097bd76399616c136892a9).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23208: [SPARK-25530][SQL] data source v2 API refactor (batch write)

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23208: [SPARK-25530][SQL] data source v2 API 
refactor (batch write)
URL: https://github.com/apache/spark/pull/23208#issuecomment-452994710
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23208: [SPARK-25530][SQL] data source v2 API refactor (batch write)

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23208: [SPARK-25530][SQL] data source v2 API 
refactor (batch write)
URL: https://github.com/apache/spark/pull/23208#issuecomment-452994716
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/101000/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23506: [SPARK-26577][SQL] 
Add input optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#discussion_r246654460
 
 

 ##
 File path: sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala
 ##
 @@ -311,6 +309,35 @@ class HadoopTableReader(
 // Only take the value (skip the key) because Hive works only with values.
 rdd.map(_._2)
   }
+
+  /**
+* If `spark.sql.hive.fileInputFormat.enabled` is true, this function will 
optimize the input
 
 Review comment:
   nit. indentation?
   Also, [line 
318](https://github.com/apache/spark/pull/23506/files#diff-8887a877bd52611df9aea06ccfe3a2d7R318)?
 Please check the style. It seems to be broken due to copy


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #23208: [SPARK-25530][SQL] data source v2 API refactor (batch write)

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #23208: [SPARK-25530][SQL] data source v2 
API refactor (batch write)
URL: https://github.com/apache/spark/pull/23208#issuecomment-452957052
 
 
   **[Test build #101000 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101000/testReport)**
 for PR 23208 at commit 
[`ee7acbc`](https://github.com/apache/spark/commit/ee7acbced67d3d084b1479b8303fe815701893ef).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on a change in pull request #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
dongjoon-hyun commented on a change in pull request #23506: [SPARK-26577][SQL] 
Add input optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#discussion_r246654460
 
 

 ##
 File path: sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala
 ##
 @@ -311,6 +309,35 @@ class HadoopTableReader(
 // Only take the value (skip the key) because Hive works only with values.
 rdd.map(_._2)
   }
+
+  /**
+* If `spark.sql.hive.fileInputFormat.enabled` is true, this function will 
optimize the input
 
 Review comment:
   nit. indentation?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23208: [SPARK-25530][SQL] data source v2 API refactor (batch write)

2019-01-09 Thread GitBox
SparkQA commented on issue #23208: [SPARK-25530][SQL] data source v2 API 
refactor (batch write)
URL: https://github.com/apache/spark/pull/23208#issuecomment-452994268
 
 
   **[Test build #101000 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/101000/testReport)**
 for PR 23208 at commit 
[`ee7acbc`](https://github.com/apache/spark/commit/ee7acbced67d3d084b1479b8303fe815701893ef).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23390: [SPARK-26459][SQL] replace UpdateNullabilityInAttributeReferences with FixNullability

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23390: [SPARK-26459][SQL] replace 
UpdateNullabilityInAttributeReferences with FixNullability
URL: https://github.com/apache/spark/pull/23390#issuecomment-452989298
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23390: [SPARK-26459][SQL] replace UpdateNullabilityInAttributeReferences with FixNullability

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23390: [SPARK-26459][SQL] replace 
UpdateNullabilityInAttributeReferences with FixNullability
URL: https://github.com/apache/spark/pull/23390#issuecomment-452989298
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23390: [SPARK-26459][SQL] replace UpdateNullabilityInAttributeReferences with FixNullability

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23390: [SPARK-26459][SQL] replace 
UpdateNullabilityInAttributeReferences with FixNullability
URL: https://github.com/apache/spark/pull/23390#issuecomment-452989301
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100999/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23390: [SPARK-26459][SQL] replace UpdateNullabilityInAttributeReferences with FixNullability

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23390: [SPARK-26459][SQL] replace 
UpdateNullabilityInAttributeReferences with FixNullability
URL: https://github.com/apache/spark/pull/23390#issuecomment-452989301
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100999/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #23390: [SPARK-26459][SQL] replace UpdateNullabilityInAttributeReferences with FixNullability

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #23390: [SPARK-26459][SQL] replace 
UpdateNullabilityInAttributeReferences with FixNullability
URL: https://github.com/apache/spark/pull/23390#issuecomment-452950845
 
 
   **[Test build #100999 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100999/testReport)**
 for PR 23390 at commit 
[`bcb5667`](https://github.com/apache/spark/commit/bcb56670c4d35df880ff7b4dbe1d29d1359e7791).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23390: [SPARK-26459][SQL] replace UpdateNullabilityInAttributeReferences with FixNullability

2019-01-09 Thread GitBox
SparkQA commented on issue #23390: [SPARK-26459][SQL] replace 
UpdateNullabilityInAttributeReferences with FixNullability
URL: https://github.com/apache/spark/pull/23390#issuecomment-452988931
 
 
   **[Test build #100999 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100999/testReport)**
 for PR 23390 at commit 
[`bcb5667`](https://github.com/apache/spark/commit/bcb56670c4d35df880ff7b4dbe1d29d1359e7791).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] Deegue commented on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
Deegue commented on issue #23506: [SPARK-26577][SQL] Add input optimizer when 
reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452988195
 
 
   Thanks @gatorsmile and I wonder if it would be better to change the conf 
name to `spark.sql.hive.input.optimizer.enable` ,
   `spark.sql.hive.input.split.maxsize` and 
   `spark.sql.hive.input.split.minsize` ?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23506: [SPARK-26577][SQL] Add input 
optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452986459
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23506: [SPARK-26577][SQL] Add input optimizer 
when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452986740
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23506: [SPARK-26577][SQL] Add input 
optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452986402
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] Deegue commented on issue #23496: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
Deegue commented on issue #23496: [SPARK-26577][SQL] Add input optimizer when 
reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23496#issuecomment-452986433
 
 
   > Let me think about the conf naming. Could you first rebase it to master 
branch? Now it is targeting to 2.4.
   
   Thank you, I've picked the changes to master branch and create a new PR.
   #23506 


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23506: [SPARK-26577][SQL] Add input optimizer 
when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452986459
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23506: [SPARK-26577][SQL] Add input optimizer 
when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506#issuecomment-452986402
 
 
   Can one of the admins verify this patch?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] Deegue opened a new pull request #23506: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
Deegue opened a new pull request #23506: [SPARK-26577][SQL] Add input optimizer 
when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23506
 
 
   ## What changes were proposed in this pull request?
   
   When using SparkSQL, for example the ThriftServer, if we set
   
   `spark.sql.hive.fileInputFormat.enabled=true`
   
   we can optimize the InputFormat to CombineTextInputFormat automatically if 
it's TextInputFormat before. And we can also change the max/min size of input 
splits by setting, for example 
   
   `spark.sql.hive.fileInputFormat.split.maxsize=268435456`
   `spark.sql.hive.fileInputFormat.split.minsize=134217728`
   
   Otherwise, we have to modify Hive Configs and structure of tables.
   
   And we made a test by using a Hive table with a lot of small files in HDFS 
and haven't combined :
   
   Before improved:
   
![image](https://user-images.githubusercontent.com/25916266/50877374-85e43780-140c-11e9-9724-31d367739552.png)
   
   
   After improved:
   
![image](https://user-images.githubusercontent.com/25916266/50877387-9694ad80-140c-11e9-99e2-f55a3c7285e0.png)
   
   
   
   ## How was this patch tested?
   
   Added a test.
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] Deegue commented on a change in pull request #23496: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
Deegue commented on a change in pull request #23496: [SPARK-26577][SQL] Add 
input optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23496#discussion_r246647183
 
 

 ##
 File path: sql/hive/src/main/scala/org/apache/spark/sql/hive/TableReader.scala
 ##
 @@ -123,8 +123,26 @@ class HadoopTableReader(
 val inputPathStr = applyFilterIfNeeded(tablePath, filterOpt)
 
 // logDebug("Table input: %s".format(tablePath))
-val ifc = hiveTable.getInputFormatClass
+var ifc = hiveTable.getInputFormatClass
   .asInstanceOf[java.lang.Class[InputFormat[Writable, Writable]]]
+if (conf.fileInputFormatEnabled) {
+  hadoopConf.set("mapreduce.input.fileinputformat.split.maxsize",
 
 Review comment:
   sorry for that, done


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23503: [SPARK-26584][SQL] Remove `spark.sql.orc.copyBatchToSpark` internal conf

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23503: [SPARK-26584][SQL] Remove 
`spark.sql.orc.copyBatchToSpark` internal conf
URL: https://github.com/apache/spark/pull/23503#issuecomment-452984841
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23503: [SPARK-26584][SQL] Remove `spark.sql.orc.copyBatchToSpark` internal conf

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23503: [SPARK-26584][SQL] Remove 
`spark.sql.orc.copyBatchToSpark` internal conf
URL: https://github.com/apache/spark/pull/23503#issuecomment-452984846
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100998/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23503: [SPARK-26584][SQL] Remove `spark.sql.orc.copyBatchToSpark` internal conf

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23503: [SPARK-26584][SQL] Remove 
`spark.sql.orc.copyBatchToSpark` internal conf
URL: https://github.com/apache/spark/pull/23503#issuecomment-452984846
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100998/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] Deegue commented on a change in pull request #23496: [SPARK-26577][SQL] Add input optimizer when reading Hive table by SparkSQL

2019-01-09 Thread GitBox
Deegue commented on a change in pull request #23496: [SPARK-26577][SQL] Add 
input optimizer when reading Hive table by SparkSQL
URL: https://github.com/apache/spark/pull/23496#discussion_r246647067
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala
 ##
 @@ -561,6 +561,23 @@ object SQLConf {
 .checkValues(HiveCaseSensitiveInferenceMode.values.map(_.toString))
 .createWithDefault(HiveCaseSensitiveInferenceMode.INFER_AND_SAVE.toString)
 
+  val HIVE_FILE_INPUT_FORMAT_ENABLED = 
buildConf("spark.sql.hive.fileInputFormat.enabled")
 
 Review comment:
   yes I see, done!


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23503: [SPARK-26584][SQL] Remove `spark.sql.orc.copyBatchToSpark` internal conf

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23503: [SPARK-26584][SQL] Remove 
`spark.sql.orc.copyBatchToSpark` internal conf
URL: https://github.com/apache/spark/pull/23503#issuecomment-452984841
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #23503: [SPARK-26584][SQL] Remove `spark.sql.orc.copyBatchToSpark` internal conf

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #23503: [SPARK-26584][SQL] Remove 
`spark.sql.orc.copyBatchToSpark` internal conf
URL: https://github.com/apache/spark/pull/23503#issuecomment-452945229
 
 
   **[Test build #100998 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100998/testReport)**
 for PR 23503 at commit 
[`e2574c7`](https://github.com/apache/spark/commit/e2574c77b399a8787182c214b53cbf054a7b97f1).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23503: [SPARK-26584][SQL] Remove `spark.sql.orc.copyBatchToSpark` internal conf

2019-01-09 Thread GitBox
SparkQA commented on issue #23503: [SPARK-26584][SQL] Remove 
`spark.sql.orc.copyBatchToSpark` internal conf
URL: https://github.com/apache/spark/pull/23503#issuecomment-452984520
 
 
   **[Test build #100998 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100998/testReport)**
 for PR 23503 at commit 
[`e2574c7`](https://github.com/apache/spark/commit/e2574c77b399a8787182c214b53cbf054a7b97f1).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] jzhuge commented on a change in pull request #21978: [SPARK-25006][SQL] Add CatalogTableIdentifier.

2019-01-09 Thread GitBox
jzhuge commented on a change in pull request #21978: [SPARK-25006][SQL] Add 
CatalogTableIdentifier.
URL: https://github.com/apache/spark/pull/21978#discussion_r246645835
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/identifiers.scala
 ##
 @@ -18,30 +18,32 @@
 package org.apache.spark.sql.catalyst
 
 /**
- * An identifier that optionally specifies a database.
+ * An identifier that optionally specifies a database and catalog.
  *
  * Format (unquoted): "name" or "db.name"
  * Format (quoted): "`name`" or "`db`.`name`"
  */
-sealed trait IdentifierWithDatabase {
+sealed trait IdentifierWithOptionalDatabaseAndCatalog {
   val identifier: String
 
   def database: Option[String]
 
+  def catalog: Option[String]
 
 Review comment:
   Default to `None`?


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] dongjoon-hyun commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
dongjoon-hyun commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix 
compile failure
URL: https://github.com/apache/spark/pull/23505#issuecomment-452983429
 
 
   K8S integration test passed.
   
   cc @vanzin and @mgaido91 .


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] HeartSaVioR commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
HeartSaVioR commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for 
hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452976138
 
 
   @kiszk @vanzin 
   I guess I addressed review comments and the build is green. 
   Could you please take a look again? Thanks in advance!


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] HeartSaVioR edited a comment on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
HeartSaVioR edited a comment on issue #23423: [SPARK-26482][CORE] Use 
ConfigEntry for hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452976138
 
 
   @vanzin @kiszk 
   I guess I addressed review comments and the build is green. 
   Could you please take a look again? Thanks in advance!


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23423: [SPARK-26482][CORE] Use 
ConfigEntry for hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452976044
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100994/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry 
for hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452976044
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/100994/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry 
for hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452976040
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23423: [SPARK-26482][CORE] Use 
ConfigEntry for hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452976040
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA removed a comment on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
SparkQA removed a comment on issue #23423: [SPARK-26482][CORE] Use ConfigEntry 
for hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452927381
 
 
   **[Test build #100994 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100994/testReport)**
 for PR 23423 at commit 
[`8d61242`](https://github.com/apache/spark/commit/8d61242aa85661592fc530a7cd5a8956f6f75bb8).


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for hardcoded configs for ui categories

2019-01-09 Thread GitBox
SparkQA commented on issue #23423: [SPARK-26482][CORE] Use ConfigEntry for 
hardcoded configs for ui categories
URL: https://github.com/apache/spark/pull/23423#issuecomment-452975754
 
 
   **[Test build #100994 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/100994/testReport)**
 for PR 23423 at commit 
[`8d61242`](https://github.com/apache/spark/commit/8d61242aa85661592fc530a7cd5a8956f6f75bb8).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] felixcheung commented on issue #23492: [SPARK-26565][BUILD] skip gpg signing/svn publish for jenkins package builds

2019-01-09 Thread GitBox
felixcheung commented on issue #23492: [SPARK-26565][BUILD] skip gpg 
signing/svn publish for jenkins package builds
URL: https://github.com/apache/spark/pull/23492#issuecomment-452975088
 
 
   I’d defer to you and others to decide whether keeping the output artifacts 
can be useful...
   
   
   
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix 
compile failure
URL: https://github.com/apache/spark/pull/23505#issuecomment-452974845
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23505: [SPARK-26491][K8S][FOLLOWUP] 
Fix compile failure
URL: https://github.com/apache/spark/pull/23505#issuecomment-452974845
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
AmplabJenkins commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix 
compile failure
URL: https://github.com/apache/spark/pull/23505#issuecomment-452974847
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/6884/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] AmplabJenkins removed a comment on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
AmplabJenkins removed a comment on issue #23505: [SPARK-26491][K8S][FOLLOWUP] 
Fix compile failure
URL: https://github.com/apache/spark/pull/23505#issuecomment-452974847
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/6884/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] SparkQA commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile failure

2019-01-09 Thread GitBox
SparkQA commented on issue #23505: [SPARK-26491][K8S][FOLLOWUP] Fix compile 
failure
URL: https://github.com/apache/spark/pull/23505#issuecomment-452974834
 
 
   Kubernetes integration test status success
   URL: 
https://amplab.cs.berkeley.edu/jenkins/job/testing-k8s-prb-make-spark-distribution-unified/6884/
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   6   7   8   9   >