[GitHub] [spark] SparkQA commented on issue #27644: [SPARK-30868][SQL] Throw Exception if runHive(sql) failed

2020-02-20 Thread GitBox
SparkQA commented on issue #27644: [SPARK-30868][SQL] Throw Exception if 
runHive(sql) failed
URL: https://github.com/apache/spark/pull/27644#issuecomment-589541503
 
 
   **[Test build #118756 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118756/testReport)**
 for PR 27644 at commit 
[`6d7435e`](https://github.com/apache/spark/commit/6d7435ed11e502c8ae517f33eaf6fb2ea12aae29).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #27644: [SPARK-30868][SQL] Throw Exception if runHive(sql) failed

2020-02-20 Thread GitBox
SparkQA removed a comment on issue #27644: [SPARK-30868][SQL] Throw Exception 
if runHive(sql) failed
URL: https://github.com/apache/spark/pull/27644#issuecomment-589510776
 
 
   **[Test build #118756 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118756/testReport)**
 for PR 27644 at commit 
[`6d7435e`](https://github.com/apache/spark/commit/6d7435ed11e502c8ae517f33eaf6fb2ea12aae29).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27644: [SPARK-30868][SQL] Throw Exception if runHive(sql) failed

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27644: [SPARK-30868][SQL] Throw Exception if 
runHive(sql) failed
URL: https://github.com/apache/spark/pull/27644#issuecomment-589541605
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27644: [SPARK-30868][SQL] Throw Exception if runHive(sql) failed

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27644: [SPARK-30868][SQL] Throw 
Exception if runHive(sql) failed
URL: https://github.com/apache/spark/pull/27644#issuecomment-589541605
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27644: [SPARK-30868][SQL] Throw Exception if runHive(sql) failed

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27644: [SPARK-30868][SQL] Throw Exception if 
runHive(sql) failed
URL: https://github.com/apache/spark/pull/27644#issuecomment-589541612
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118756/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on issue #27495: [SPARK-28880][SQL] Support ANSI nested bracketed comments

2020-02-20 Thread GitBox
cloud-fan commented on issue #27495: [SPARK-28880][SQL] Support ANSI nested 
bracketed comments
URL: https://github.com/apache/spark/pull/27495#issuecomment-589540986
 
 
   I have a new idea. Without considering hint, the rule to match comment is 
very simple:
   ```
   '/*' (BRACKETED_COMMENT|.)*? '*/' -> channel(HIDDEN)
   ```
   
   And there is a better way to check if it's hint
   ```
   '/*' (BRACKETED_COMMENT|.)*? '*/' {isHint()}? -> channel(HIDDEN)
   ```
   
   We can follow `isValidDecimal()` in `Sqlbase.g4` to implement `isHint()`, 
which checks the first 3 chars and see if they are "/*-"


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27560: [SPARK-30809][SQL] Review and fix issues in SQL API docs

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27560: [SPARK-30809][SQL] Review and fix 
issues in SQL API docs
URL: https://github.com/apache/spark/pull/27560#issuecomment-589539592
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27560: [SPARK-30809][SQL] Review and fix issues in SQL API docs

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27560: [SPARK-30809][SQL] Review and 
fix issues in SQL API docs
URL: https://github.com/apache/spark/pull/27560#issuecomment-589539598
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118749/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27560: [SPARK-30809][SQL] Review and fix issues in SQL API docs

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27560: [SPARK-30809][SQL] Review and fix 
issues in SQL API docs
URL: https://github.com/apache/spark/pull/27560#issuecomment-589539598
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118749/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27560: [SPARK-30809][SQL] Review and fix issues in SQL API docs

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27560: [SPARK-30809][SQL] Review and 
fix issues in SQL API docs
URL: https://github.com/apache/spark/pull/27560#issuecomment-589539592
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #27560: [SPARK-30809][SQL] Review and fix issues in SQL API docs

2020-02-20 Thread GitBox
SparkQA removed a comment on issue #27560: [SPARK-30809][SQL] Review and fix 
issues in SQL API docs
URL: https://github.com/apache/spark/pull/27560#issuecomment-589493045
 
 
   **[Test build #118749 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118749/testReport)**
 for PR 27560 at commit 
[`14b121e`](https://github.com/apache/spark/commit/14b121e28ee9695ff1d8ac1c2731f3250e6590a8).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27560: [SPARK-30809][SQL] Review and fix issues in SQL API docs

2020-02-20 Thread GitBox
SparkQA commented on issue #27560: [SPARK-30809][SQL] Review and fix issues in 
SQL API docs
URL: https://github.com/apache/spark/pull/27560#issuecomment-589538924
 
 
   **[Test build #118749 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118749/testReport)**
 for PR 27560 at commit 
[`14b121e`](https://github.com/apache/spark/commit/14b121e28ee9695ff1d8ac1c2731f3250e6590a8).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27655: [SPARK-30897][SQL] The behavior of ArrayExists should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27655: [SPARK-30897][SQL] The behavior of 
ArrayExists should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27655#issuecomment-589538601
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23514/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27655: [SPARK-30897][SQL] The behavior of ArrayExists should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27655: [SPARK-30897][SQL] The 
behavior of ArrayExists should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27655#issuecomment-589538601
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23514/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27655: [SPARK-30897][SQL] The behavior of ArrayExists should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27655: [SPARK-30897][SQL] The 
behavior of ArrayExists should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27655#issuecomment-589538595
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27655: [SPARK-30897][SQL] The behavior of ArrayExists should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27655: [SPARK-30897][SQL] The behavior of 
ArrayExists should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27655#issuecomment-589538595
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27655: [SPARK-30897][SQL] The behavior of ArrayExists should not depend on SQLConf.get

2020-02-20 Thread GitBox
SparkQA commented on issue #27655: [SPARK-30897][SQL] The behavior of 
ArrayExists should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27655#issuecomment-589538273
 
 
   **[Test build #118763 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118763/testReport)**
 for PR 27655 at commit 
[`87aac55`](https://github.com/apache/spark/commit/87aac55538108f2007aabc0286a463104231f74e).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon closed pull request #27661: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
HyukjinKwon closed pull request #27661: [SPARK-30869][SQL] Convert dates 
to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27661
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] peter-toth commented on a change in pull request #27655: [SPARK-30897][SQL] The behavior of ArrayExists should not depend on SQLConf.get

2020-02-20 Thread GitBox
peter-toth commented on a change in pull request #27655: [SPARK-30897][SQL] The 
behavior of ArrayExists should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27655#discussion_r382435704
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/higherOrderFunctions.scala
 ##
 @@ -522,11 +522,14 @@ case class ArrayFilter(
   since = "2.4.0")
 case class ArrayExists(
 argument: Expression,
-function: Expression)
+function: Expression,
+followThreeValuedLogic: Boolean)
   extends ArrayBasedSimpleHigherOrderFunction with CodegenFallback {
 
-  private val followThreeValuedLogic =
-SQLConf.get.getConf(SQLConf.LEGACY_ARRAY_EXISTS_FOLLOWS_THREE_VALUED_LOGIC)
+  def this(argument: Expression, function: Expression) = {
+this(argument, function,
+  
SQLConf.get.getConf(SQLConf.LEGACY_ARRAY_EXISTS_FOLLOWS_THREE_VALUED_LOGIC))
 
 Review comment:
   Thanks, fixed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27597: 
[SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy 
when insert into table
URL: https://github.com/apache/spark/pull/27597#discussion_r382433518
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala
 ##
 @@ -104,7 +105,17 @@ case class DataSourceAnalysis(conf: SQLConf) extends 
Rule[LogicalPlan] with Cast
 None
   } else if (potentialSpecs.size == 1) {
 val partValue = potentialSpecs.head._2
-Some(Alias(cast(Literal(partValue), field.dataType), field.name)())
+conf.storeAssignmentPolicy match {
+  // SPARK-30844: try our best to follow StoreAssignmentPolicy for 
static partition
+  // values but not completely follow because we can't 
use`DataType.canWrite` due to
 
 Review comment:
   nit: be more general like `... because we can't do static type checking due 
to ...`


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27597: [SPARK-30844][SQL]Static partition 
should also follow StoreAssignmentPolicy when insert into table
URL: https://github.com/apache/spark/pull/27597#issuecomment-589534209
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23513/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27597: [SPARK-30844][SQL]Static partition 
should also follow StoreAssignmentPolicy when insert into table
URL: https://github.com/apache/spark/pull/27597#issuecomment-589534205
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27597: [SPARK-30844][SQL]Static 
partition should also follow StoreAssignmentPolicy when insert into table
URL: https://github.com/apache/spark/pull/27597#issuecomment-589534205
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27597: [SPARK-30844][SQL]Static 
partition should also follow StoreAssignmentPolicy when insert into table
URL: https://github.com/apache/spark/pull/27597#issuecomment-589534209
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23513/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan closed pull request #27658: [SPARK-30894][SQL] Make Size's nullable independent from SQL config changes

2020-02-20 Thread GitBox
cloud-fan closed pull request #27658: [SPARK-30894][SQL] Make Size's nullable 
independent from SQL config changes
URL: https://github.com/apache/spark/pull/27658
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on issue #27658: [SPARK-30894][SQL] Make Size's nullable independent from SQL config changes

2020-02-20 Thread GitBox
cloud-fan commented on issue #27658: [SPARK-30894][SQL] Make Size's nullable 
independent from SQL config changes
URL: https://github.com/apache/spark/pull/27658#issuecomment-589534103
 
 
   thanks, merging to master/3.0!


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
SparkQA commented on issue #27597: [SPARK-30844][SQL]Static partition should 
also follow StoreAssignmentPolicy when insert into table
URL: https://github.com/apache/spark/pull/27597#issuecomment-589533846
 
 
   **[Test build #118762 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118762/testReport)**
 for PR 27597 at commit 
[`a70c89c`](https://github.com/apache/spark/commit/a70c89c1138bd1e643fa95de8cad7c0c152ff76b).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on a change in pull request #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
Ngone51 commented on a change in pull request #27597: [SPARK-30844][SQL]Static 
partition should also follow StoreAssignmentPolicy when insert into table
URL: https://github.com/apache/spark/pull/27597#discussion_r382432246
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala
 ##
 @@ -104,7 +105,13 @@ case class DataSourceAnalysis(conf: SQLConf) extends 
Rule[LogicalPlan] with Cast
 None
   } else if (potentialSpecs.size == 1) {
 val partValue = potentialSpecs.head._2
-Some(Alias(cast(Literal(partValue), field.dataType), field.name)())
+conf.storeAssignmentPolicy match {
+  case StoreAssignmentPolicy.ANSI | StoreAssignmentPolicy.STRICT =>
 
 Review comment:
   Added, thanks.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on issue #27627: [WIP][SPARK-28067][SQL] Fix incorrect results for decimal aggregate sum by returning null on decimal overflow

2020-02-20 Thread GitBox
cloud-fan commented on issue #27627: [WIP][SPARK-28067][SQL] Fix incorrect 
results for decimal aggregate sum by returning null on decimal overflow
URL: https://github.com/apache/spark/pull/27627#issuecomment-589530921
 
 
   > Sum does not take care of possibility of overflow for the intermediate 
steps. ie the updateExpressions and mergeExpressions.
   
   I'm a little confused. These expressions are used in non-whole-stage-codegen 
as well, why only whole-stage-codegen has the problem? 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27662: [MINOR][SQL] Fix error position of 
NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589529896
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27662: [MINOR][SQL] Fix error 
position of NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589529900
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118743/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27662: [MINOR][SQL] Fix error position of 
NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589529900
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118743/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27662: [MINOR][SQL] Fix error 
position of NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589529896
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
SparkQA removed a comment on issue #27662: [MINOR][SQL] Fix error position of 
NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589468236
 
 
   **[Test build #118743 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118743/testReport)**
 for PR 27662 at commit 
[`d467c7d`](https://github.com/apache/spark/commit/d467c7d27a448742b8738c99d59d8f337b63852a).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
SparkQA commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589529295
 
 
   **[Test build #118743 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118743/testReport)**
 for PR 27662 at commit 
[`d467c7d`](https://github.com/apache/spark/commit/d467c7d27a448742b8738c99d59d8f337b63852a).
* This patch passes all tests.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: 
CreateArray/CreateMap's data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589526227
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118761/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's 
data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589526227
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118761/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: 
CreateArray/CreateMap's data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589526219
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's 
data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589526219
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
SparkQA removed a comment on issue #27657: [SPARK-30899]: 
CreateArray/CreateMap's data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589521877
 
 
   **[Test build #118761 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118761/testReport)**
 for PR 27657 at commit 
[`7bbf40e`](https://github.com/apache/spark/commit/7bbf40e1451f96eb97a479d407735c81f6742279).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
SparkQA commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data 
type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589526122
 
 
   **[Test build #118761 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118761/testReport)**
 for PR 27657 at commit 
[`7bbf40e`](https://github.com/apache/spark/commit/7bbf40e1451f96eb97a479d407735c81f6742279).
* This patch **fails to generate documentation**.
* This patch merges cleanly.
* This patch adds the following public classes _(experimental)_:
 * `case class CreateArray(children: Seq[Expression], 
useStringTypeWhenEmpty: Boolean)`
 * `case class CreateMap(children: Seq[Expression], useStringTypeWhenEmpty: 
Boolean)`


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on issue #27488: [SPARK-26580][SQL][ML][FOLLOW-UP] Throw exception when use untyped UDF by default

2020-02-20 Thread GitBox
Ngone51 commented on issue #27488: [SPARK-26580][SQL][ML][FOLLOW-UP] Throw 
exception when use untyped UDF by default
URL: https://github.com/apache/spark/pull/27488#issuecomment-589525950
 
 
   thanks all!


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
Ngone51 commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589526023
 
 
   thanks all!


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on issue #27540: Revert "[SPARK-28093][SQL] Fix TRIM/LTRIM/RTRIM function parameter order issue"

2020-02-20 Thread GitBox
Ngone51 commented on issue #27540: Revert "[SPARK-28093][SQL] Fix 
TRIM/LTRIM/RTRIM function parameter order issue"
URL: https://github.com/apache/spark/pull/27540#issuecomment-589525865
 
 
   thanks all!!


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27657: [SPARK-30899]: 
CreateArray/CreateMap's data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#discussion_r382423049
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala
 ##
 @@ -606,7 +610,9 @@ object TypeCoercion {
   case None => m.values
 }
 
-CreateMap(newKeys.zip(newValues).flatMap { case (k, v) => Seq(k, v) })
+CreateMap(
 
 Review comment:
   ditto


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27657: [SPARK-30899]: 
CreateArray/CreateMap's data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#discussion_r382422992
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala
 ##
 @@ -553,10 +553,14 @@ object TypeCoercion {
   // Skip nodes who's children have not been resolved yet.
   case e if !e.childrenResolved => e
 
-  case a @ CreateArray(children) if 
!haveSameType(children.map(_.dataType)) =>
+  case a @ CreateArray(children, useStringTypeWhenEmpty)
+if !haveSameType(children.map(_.dataType)) =>
 val types = children.map(_.dataType)
 findWiderCommonType(types) match {
-  case Some(finalDataType) => 
CreateArray(children.map(castIfNotSameType(_, finalDataType)))
+  case Some(finalDataType) => CreateArray(
 
 Review comment:
   nit: `a.copy(children = ...)`, then we don't need the 
`useStringTypeWhenEmpty` parameter and can just write `case a @ 
CreateArray(children, _)`


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on issue #27661: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
HyukjinKwon commented on issue #27661: [SPARK-30869][SQL] Convert dates to/from 
timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27661#issuecomment-589523563
 
 
   Let's close and review #27618 


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on issue #27488: [SPARK-26580][SQL][ML][FOLLOW-UP] Throw exception when use untyped UDF by default

2020-02-20 Thread GitBox
cloud-fan commented on issue #27488: [SPARK-26580][SQL][ML][FOLLOW-UP] Throw 
exception when use untyped UDF by default
URL: https://github.com/apache/spark/pull/27488#issuecomment-589522387
 
 
   thanks, merging to master/3.0!


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan closed pull request #27488: [SPARK-26580][SQL][ML][FOLLOW-UP] Throw exception when use untyped UDF by default

2020-02-20 Thread GitBox
cloud-fan closed pull request #27488: [SPARK-26580][SQL][ML][FOLLOW-UP] Throw 
exception when use untyped UDF by default
URL: https://github.com/apache/spark/pull/27488
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
SparkQA commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data 
type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589521877
 
 
   **[Test build #118761 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118761/testReport)**
 for PR 27657 at commit 
[`7bbf40e`](https://github.com/apache/spark/commit/7bbf40e1451f96eb97a479d407735c81f6742279).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on a change in pull request #27604: [SPARK-30849][CORE][SHUFFLE]Fix application failed due to failed to get MapStatuses broadcast block

2020-02-20 Thread GitBox
Ngone51 commented on a change in pull request #27604: 
[SPARK-30849][CORE][SHUFFLE]Fix application failed due to failed to get 
MapStatuses broadcast block
URL: https://github.com/apache/spark/pull/27604#discussion_r382411144
 
 

 ##
 File path: core/src/main/scala/org/apache/spark/MapOutputTracker.scala
 ##
 @@ -824,11 +825,15 @@ private[spark] class MapOutputTrackerWorker(conf: 
SparkConf) extends MapOutputTr
   endPartition: Int): Iterator[(BlockManagerId, Seq[(BlockId, Long, 
Int)])] = {
 logDebug(s"Fetching outputs for shuffle $shuffleId, mappers 
$startMapIndex-$endMapIndex" +
   s"partitions $startPartition-$endPartition")
-val statuses = getStatuses(shuffleId, conf)
 try {
+  val statuses = getStatuses(shuffleId, conf)
   MapOutputTracker.convertMapStatuses(
 shuffleId, startPartition, endPartition, statuses, startMapIndex, 
endMapIndex)
 } catch {
+  case e: IOException if
+
Throwables.getCausalChain(e).asScala.exists(_.isInstanceOf[BlockNotFoundException])
 =>
+mapStatuses.clear()
+throw new MetadataFetchFailedException(shuffleId, -1, 
Throwables.getStackTraceAsString(e))
 
 Review comment:
   I'm wondering that we may could add a new type of `FetchFailedException` to 
indicate the whole `MapStatus` lost rather than a piece of it.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on a change in pull request #27604: [SPARK-30849][CORE][SHUFFLE]Fix application failed due to failed to get MapStatuses broadcast block

2020-02-20 Thread GitBox
Ngone51 commented on a change in pull request #27604: 
[SPARK-30849][CORE][SHUFFLE]Fix application failed due to failed to get 
MapStatuses broadcast block
URL: https://github.com/apache/spark/pull/27604#discussion_r382412090
 
 

 ##
 File path: core/src/main/scala/org/apache/spark/MapOutputTracker.scala
 ##
 @@ -824,11 +825,15 @@ private[spark] class MapOutputTrackerWorker(conf: 
SparkConf) extends MapOutputTr
   endPartition: Int): Iterator[(BlockManagerId, Seq[(BlockId, Long, 
Int)])] = {
 logDebug(s"Fetching outputs for shuffle $shuffleId, mappers 
$startMapIndex-$endMapIndex" +
   s"partitions $startPartition-$endPartition")
-val statuses = getStatuses(shuffleId, conf)
 try {
+  val statuses = getStatuses(shuffleId, conf)
 
 Review comment:
   I'd prefer to add another try catch when we call `broadcast.vaule` at:
   
   
https://github.com/apache/spark/blob/a2aa966ef64bc06f65a646777568427d360605e9/core/src/main/scala/org/apache/spark/MapOutputTracker.scala#L969
   
   and throw `FetchFailedException` when we catch `SparkException` from 
broadcast.
   
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's 
data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589520239
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23512/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27657: [SPARK-30899]: CreateArray/CreateMap's 
data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589520236
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] Ngone51 commented on a change in pull request #27604: [SPARK-30849][CORE][SHUFFLE]Fix application failed due to failed to get MapStatuses broadcast block

2020-02-20 Thread GitBox
Ngone51 commented on a change in pull request #27604: 
[SPARK-30849][CORE][SHUFFLE]Fix application failed due to failed to get 
MapStatuses broadcast block
URL: https://github.com/apache/spark/pull/27604#discussion_r382410631
 
 

 ##
 File path: 
core/src/main/scala/org/apache/spark/broadcast/TorrentBroadcast.scala
 ##
 @@ -186,7 +186,7 @@ private[spark] class TorrentBroadcast[T: ClassTag](obj: T, 
id: Long)
   }
   blocks(pid) = new ByteBufferBlockData(b, true)
 case None =>
-  throw new SparkException(s"Failed to get $pieceId of 
$broadcastId")
 
 Review comment:
   Why not just catch `SparkException` outside?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: 
CreateArray/CreateMap's data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589520236
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: CreateArray/CreateMap's data type should not depend on SQLConf.get

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27657: [SPARK-30899]: 
CreateArray/CreateMap's data type should not depend on SQLConf.get
URL: https://github.com/apache/spark/pull/27657#issuecomment-589520239
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23512/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
SparkQA commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in 
CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589519875
 
 
   **[Test build #118760 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118760/testReport)**
 for PR 27659 at commit 
[`95ddfab`](https://github.com/apache/spark/commit/95ddfab6e953692cf4f1fcc6d2c523209c6ba2a3).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off 
AQE in CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589518224
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in 
CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589518226
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23511/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in 
CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589518224
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off 
AQE in CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589518226
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23511/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
cloud-fan commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in 
CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589517992
 
 
   retest this please


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27618: [SPARK-30869][SQL] 
Convert dates to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#discussion_r382416037
 
 

 ##
 File path: 
sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/util/DateTimeUtilsSuite.scala
 ##
 @@ -608,16 +608,16 @@ class DateTimeUtilsSuite extends SparkFunSuite with 
Matchers with SQLHelper {
   val skipped = skipped_days.getOrElse(tz.getID, Set.empty)
   (-2 to 2).foreach { d =>
 if (!skipped.contains(d)) {
-  assert(millisToDays(daysToMillis(d, tz.toZoneId), tz.toZoneId) === d,
+  assert(microsToDays(daysToMicros(d, tz.toZoneId), tz.toZoneId) === d,
 s"Round trip of ${d} did not work in tz ${tz}")
 }
   }
 }
   }
 
   test("toMillis") {
 
 Review comment:
   `microsToMillis`


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off 
AQE in CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589516919
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118753/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27659: [SPARK-30906][SQL] Turning off 
AQE in CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589516913
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27618: [SPARK-30869][SQL] 
Convert dates to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#discussion_r382415458
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala
 ##
 @@ -88,14 +86,14 @@ object DateTimeUtils {
* Returns the number of days since epoch from java.sql.Date.
*/
   def fromJavaDate(date: Date): SQLDate = {
-millisToDays(date.getTime)
+microsToDays(Math.multiplyExact(date.getTime, MICROS_PER_MILLIS))
   }
 
   /**
* Returns a java.sql.Date from number of days since epoch.
*/
   def toJavaDate(daysSinceEpoch: SQLDate): Date = {
-new Date(daysToMillis(daysSinceEpoch))
+new Date(daysToMicros(daysSinceEpoch) / MICROS_PER_MILLIS)
 
 Review comment:
   ditto, `microsToMillis(daysToMicros(daysSinceEpoch))`?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
SparkQA commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in 
CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589516807
 
 
   **[Test build #118753 has 
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118753/testReport)**
 for PR 27659 at commit 
[`95ddfab`](https://github.com/apache/spark/commit/95ddfab6e953692cf4f1fcc6d2c523209c6ba2a3).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in 
CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589516919
 
 
   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/118753/
   Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27618: [SPARK-30869][SQL] 
Convert dates to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#discussion_r382415344
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala
 ##
 @@ -88,14 +86,14 @@ object DateTimeUtils {
* Returns the number of days since epoch from java.sql.Date.
*/
   def fromJavaDate(date: Date): SQLDate = {
-millisToDays(date.getTime)
+microsToDays(Math.multiplyExact(date.getTime, MICROS_PER_MILLIS))
 
 Review comment:
   shall we call `millisToMicros(date.getTime)` instead of 
`Math.multiplyExact(date.getTime, MICROS_PER_MILLIS)`?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA removed a comment on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
SparkQA removed a comment on issue #27659: [SPARK-30906][SQL] Turning off AQE 
in CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589499680
 
 
   **[Test build #118753 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118753/testReport)**
 for PR 27659 at commit 
[`95ddfab`](https://github.com/apache/spark/commit/95ddfab6e953692cf4f1fcc6d2c523209c6ba2a3).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in CacheManager is not thread-safe

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27659: [SPARK-30906][SQL] Turning off AQE in 
CacheManager is not thread-safe
URL: https://github.com/apache/spark/pull/27659#issuecomment-589516913
 
 
   Merged build finished. Test FAILed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
maropu commented on issue #27662: [MINOR][SQL] Fix error position of NOSCAN
URL: https://github.com/apache/spark/pull/27662#issuecomment-589516586
 
 
   Thanks! Merged to master/3.0.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589516320
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589516325
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23510/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27618: [SPARK-30869][SQL] Convert dates 
to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#issuecomment-589516297
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23509/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27618: [SPARK-30869][SQL] Convert dates 
to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#issuecomment-589516294
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589516325
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23510/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27618: [SPARK-30869][SQL] Convert 
dates to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#issuecomment-589516294
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu closed pull request #27662: [MINOR][SQL] Fix error position of NOSCAN

2020-02-20 Thread GitBox
maropu closed pull request #27662: [MINOR][SQL] Fix error position of NOSCAN
URL: https://github.com/apache/spark/pull/27662
 
 
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27618: [SPARK-30869][SQL] Convert 
dates to/from timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#issuecomment-589516297
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23509/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589516320
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] 
Refactor DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#discussion_r382414618
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/TimestampFormatter.scala
 ##
 @@ -144,7 +144,7 @@ class LegacyFastTimestampFormatter(
 cal.getTimeInMillis * MICROS_PER_MILLIS + micros
   }
 
-  def format(timestamp: SQLTimestamp): String = {
+  def format(timestamp: Long): String = {
 
 Review comment:
   `microsec`?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27618: [SPARK-30869][SQL] Convert dates to/from timestamps in microseconds precision

2020-02-20 Thread GitBox
SparkQA commented on issue #27618: [SPARK-30869][SQL] Convert dates to/from 
timestamps in microseconds precision
URL: https://github.com/apache/spark/pull/27618#issuecomment-589516028
 
 
   **[Test build #118758 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118758/testReport)**
 for PR 27618 at commit 
[`f78d773`](https://github.com/apache/spark/commit/f78d773c41b7843c1979f7a462384b9339f30b51).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
SparkQA commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589516013
 
 
   **[Test build #118759 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118759/testReport)**
 for PR 27617 at commit 
[`c475f27`](https://github.com/apache/spark/commit/c475f271b24f8d9ef7bbbc0f93a238600d70a717).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] 
Refactor DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#discussion_r382414377
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala
 ##
 @@ -407,53 +398,53 @@ object DateTimeUtils {
 }
   }
 
-  private def localTimestamp(microsec: SQLTimestamp, zoneId: ZoneId): 
LocalDateTime = {
+  private def localTimestamp(microsec: Long, zoneId: ZoneId): LocalDateTime = {
 microsToInstant(microsec).atZone(zoneId).toLocalDateTime
   }
 
   /**
* Returns the hour value of a given timestamp value. The timestamp is 
expressed in microseconds.
*/
-  def getHours(microsec: SQLTimestamp, zoneId: ZoneId): Int = {
+  def getHours(microsec: Long, zoneId: ZoneId): Int = {
 localTimestamp(microsec, zoneId).getHour
   }
 
   /**
* Returns the minute value of a given timestamp value. The timestamp is 
expressed in
* microseconds.
*/
-  def getMinutes(microsec: SQLTimestamp, zoneId: ZoneId): Int = {
+  def getMinutes(microsec: Long, zoneId: ZoneId): Int = {
 localTimestamp(microsec, zoneId).getMinute
   }
 
   /**
* Returns the second value of a given timestamp value. The timestamp is 
expressed in
* microseconds.
*/
-  def getSeconds(microsec: SQLTimestamp, zoneId: ZoneId): Int = {
+  def getSeconds(microsec: Long, zoneId: ZoneId): Int = {
 localTimestamp(microsec, zoneId).getSecond
   }
 
   /**
* Returns the seconds part and its fractional part with microseconds.
*/
-  def getSecondsWithFraction(microsec: SQLTimestamp, zoneId: ZoneId): Decimal 
= {
+  def getSecondsWithFraction(microsec: Long, zoneId: ZoneId): Decimal = {
 Decimal(getMicroseconds(microsec, zoneId), 8, 6)
   }
 
   /**
* Returns seconds, including fractional parts, multiplied by 1000. The 
timestamp
* is expressed in microseconds since the epoch.
*/
-  def getMilliseconds(timestamp: SQLTimestamp, zoneId: ZoneId): Decimal = {
+  def getMilliseconds(timestamp: Long, zoneId: ZoneId): Decimal = {
 
 Review comment:
   shall we be consistent and use parameter name "microsec"?


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu commented on a change in pull request #27651: [SPARK-30903][SQL] Fail fast on duplicate columns when analyze columns

2020-02-20 Thread GitBox
maropu commented on a change in pull request #27651: [SPARK-30903][SQL] Fail 
fast on duplicate columns when analyze columns
URL: https://github.com/apache/spark/pull/27651#discussion_r382414058
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala
 ##
 @@ -3202,8 +3202,16 @@ class AstBuilder(conf: SQLConf) extends 
SqlBaseBaseVisitor[AnyRef] with Logging
   AnalyzeTableStatement(tableName, partitionSpec, noScan = ctx.identifier 
!= null)
 } else {
   checkPartitionSpec()
-  AnalyzeColumnStatement(
-tableName, Option(visitIdentifierSeq(ctx.identifierSeq())), allColumns 
= false)
+  val colsOpt = Option(visitIdentifierSeq(ctx.identifierSeq()))
+  colsOpt.foreach { cols =>
+val duplicates = cols.groupBy(identity).filter(_._2.size > 1).keys
+if (duplicates.nonEmpty) {
+  throw new ParseException(
 
 Review comment:
   +1


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu commented on a change in pull request #27651: [SPARK-30903][SQL] Fail fast on duplicate columns when analyze columns

2020-02-20 Thread GitBox
maropu commented on a change in pull request #27651: [SPARK-30903][SQL] Fail 
fast on duplicate columns when analyze columns
URL: https://github.com/apache/spark/pull/27651#discussion_r382414020
 
 

 ##
 File path: 
sql/core/src/test/scala/org/apache/spark/sql/StatisticsCollectionSuite.scala
 ##
 @@ -650,4 +651,17 @@ class StatisticsCollectionSuite extends 
StatisticsCollectionTestBase with Shared
   }
 }
   }
+
+  test("SPARK-30903: Fail fast on duplicate columns when analyze columns") {
+val table = "test_table"
+withTable(table) {
+  sql(s"""
+   |CREATE TABLE $table (value string, name string)
+   |USING PARQUET""".stripMargin)
+  val errorMsg = intercept[ParseException] {
+sql(s"ANALYZE TABLE $table COMPUTE STATISTICS FOR COLUMNS value, name, 
name, value")
 
 Review comment:
   Yea, we need some refactoring for this use... the refactoring itself looks 
ok to me.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] 
Refactor DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#discussion_r382413997
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala
 ##
 @@ -138,25 +129,25 @@ object DateTimeUtils {
* Converts the timestamp to milliseconds since epoch. In spark timestamp 
values have microseconds
* precision, so this conversion is lossy.
*/
-  def toMillis(us: SQLTimestamp): Long = {
+  def toMillis(us: Long): Long = {
 // When the timestamp is negative i.e before 1970, we need to adjust the 
millseconds portion.
 // Example - 1965-01-01 10:11:12.123456 is represented as 
(-157700927876544) in micro precision.
 // In millis precision the above needs to be represented as 
(-157700927877).
 Math.floorDiv(us, MICROS_PER_MILLIS)
   }
 
   /*
-   * Converts milliseconds since epoch to SQLTimestamp.
+   * Converts milliseconds since the epoch to microseconds.
*/
-  def fromMillis(millis: Long): SQLTimestamp = {
+  def fromMillis(millis: Long): Long = {
 
 Review comment:
   ditto


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] iRakson commented on issue #27580: [SPARK-27619][SQL]MapType should be prohibited in hash expressions

2020-02-20 Thread GitBox
iRakson commented on issue #27580: [SPARK-27619][SQL]MapType should be 
prohibited in hash expressions
URL: https://github.com/apache/spark/pull/27580#issuecomment-589515559
 
 
   retest this please


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27617: [SPARK-30865][SQL] 
Refactor DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#discussion_r382413974
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala
 ##
 @@ -138,25 +129,25 @@ object DateTimeUtils {
* Converts the timestamp to milliseconds since epoch. In spark timestamp 
values have microseconds
* precision, so this conversion is lossy.
*/
-  def toMillis(us: SQLTimestamp): Long = {
+  def toMillis(us: Long): Long = {
 
 Review comment:
   nit: shall we make the name clearer? `microsToMillis`


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] iRakson commented on issue #27648: [SPARK-24994][SQL] : Support filter pushdown for short and byte without explicit casting

2020-02-20 Thread GitBox
iRakson commented on issue #27648: [SPARK-24994][SQL] : Support filter pushdown 
for short and byte without explicit casting
URL: https://github.com/apache/spark/pull/27648#issuecomment-589515318
 
 
   
   > Is that pushdown legal? if `id` is short, `id = 2` is parsed as `cast(id 
as int) = 2`. So, I feel that the semantics are different between them.
   
   Naive users might not cast `literal` to `shortType` when passing a value 
within short range.
   Also, if there are large number of queries which deals with short and byte 
fields, it is cumbersome to always explicitly cast them. 
   In those cases users can't get push-down benefits.
   
   In my opinion we should push the filter if `literals` are in `short`/`byte` 
range.
   


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589512791
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589512797
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23508/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins removed a comment on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589512791
 
 
   Merged build finished. Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27651: [SPARK-30903][SQL] Fail fast on duplicate columns when analyze columns

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27651: [SPARK-30903][SQL] Fail 
fast on duplicate columns when analyze columns
URL: https://github.com/apache/spark/pull/27651#discussion_r382411322
 
 

 ##
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/AstBuilder.scala
 ##
 @@ -3202,8 +3202,16 @@ class AstBuilder(conf: SQLConf) extends 
SqlBaseBaseVisitor[AnyRef] with Logging
   AnalyzeTableStatement(tableName, partitionSpec, noScan = ctx.identifier 
!= null)
 } else {
   checkPartitionSpec()
-  AnalyzeColumnStatement(
-tableName, Option(visitIdentifierSeq(ctx.identifierSeq())), allColumns 
= false)
+  val colsOpt = Option(visitIdentifierSeq(ctx.identifierSeq()))
+  colsOpt.foreach { cols =>
+val duplicates = cols.groupBy(identity).filter(_._2.size > 1).keys
+if (duplicates.nonEmpty) {
+  throw new ParseException(
 
 Review comment:
   shall we check it in `CheckAnalysis`? It's not a parser error really


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
AmplabJenkins commented on issue #27617: [SPARK-30865][SQL] Refactor 
DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589512797
 
 
   Test PASSed.
   Refer to this link for build results (access rights to CI server needed): 
   
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/23508/
   Test PASSed.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] SparkQA commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils

2020-02-20 Thread GitBox
SparkQA commented on issue #27617: [SPARK-30865][SQL] Refactor DateTimeUtils
URL: https://github.com/apache/spark/pull/27617#issuecomment-589512485
 
 
   **[Test build #118757 has 
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/118757/testReport)**
 for PR 27617 at commit 
[`9ba2d3d`](https://github.com/apache/spark/commit/9ba2d3d4ceb8f7b863ef0887f4e2c879a4ef05d4).


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] cloud-fan commented on a change in pull request #27597: [SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy when insert into table

2020-02-20 Thread GitBox
cloud-fan commented on a change in pull request #27597: 
[SPARK-30844][SQL]Static partition should also follow StoreAssignmentPolicy 
when insert into table
URL: https://github.com/apache/spark/pull/27597#discussion_r382410965
 
 

 ##
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala
 ##
 @@ -104,7 +105,13 @@ case class DataSourceAnalysis(conf: SQLConf) extends 
Rule[LogicalPlan] with Cast
 None
   } else if (potentialSpecs.size == 1) {
 val partValue = potentialSpecs.head._2
-Some(Alias(cast(Literal(partValue), field.dataType), field.name)())
+conf.storeAssignmentPolicy match {
+  case StoreAssignmentPolicy.ANSI | StoreAssignmentPolicy.STRICT =>
 
 Review comment:
   yea let's add some comments


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] maropu commented on issue #27648: [SPARK-24994][SQL] : Support filter pushdown for short and byte without explicit casting

2020-02-20 Thread GitBox
maropu commented on issue #27648: [SPARK-24994][SQL] : Support filter pushdown 
for short and byte without explicit casting
URL: https://github.com/apache/spark/pull/27648#issuecomment-589512300
 
 
   > 2 is within short range still filter is not pushed down.
   > But it pushes filter if we cast the literal to smallint:
   > select * from t where id = cast(2 as smallint) ; 
   
   Is that pushdown legal? if `id` is short, `id = 2` is parsed as `cast(id as 
int) = 2`. So, I feel that the semantics  are different between them.


This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



  1   2   3   4   5   6   7   8   9   10   >