[jira] [Commented] (SPARK-30440) Flaky test: org.apache.spark.scheduler.TaskSetManagerSuite.reset

2020-01-06 Thread Ajith S (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009433#comment-17009433 ] Ajith S commented on SPARK-30440: - Found a race between reviveOffers in

[jira] [Commented] (SPARK-30440) Flaky test: org.apache.spark.scheduler.TaskSetManagerSuite.reset

2020-01-06 Thread wuyi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009427#comment-17009427 ] wuyi commented on SPARK-30440: -- [~kabhwan] thanks for reporting it. Let me take a look. > Flaky test:

[jira] [Commented] (SPARK-30429) WideSchemaBenchmark fails with OOM

2020-01-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009420#comment-17009420 ] Dongjoon Hyun commented on SPARK-30429: --- Thank you so much for the investigation! cc [~smilegator]

[jira] [Commented] (SPARK-30440) Flaky test: org.apache.spark.scheduler.TaskSetManagerSuite.reset

2020-01-06 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009407#comment-17009407 ] Jungtaek Lim commented on SPARK-30440: -- cc. [~Ngone51] since the failing test is added from

[jira] [Commented] (SPARK-30429) WideSchemaBenchmark fails with OOM

2020-01-06 Thread Maxim Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009398#comment-17009398 ] Maxim Gekk commented on SPARK-30429: Bisect have found the first bad commit. I specified the recent

[jira] [Comment Edited] (SPARK-30411) saveAsTable does not honor spark.hadoop.hive.warehouse.subdir.inherit.perms

2020-01-06 Thread Sanket Reddy (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009381#comment-17009381 ] Sanket Reddy edited comment on SPARK-30411 at 1/7/20 5:39 AM: -- [~yumwang]  

[jira] [Updated] (SPARK-30432) reduce degree recomputation in StronglyConnectedComponents

2020-01-06 Thread li xiaosen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] li xiaosen updated SPARK-30432: --- Shepherd: (was: Saisai Shao) > reduce degree recomputation in StronglyConnectedComponents >

[jira] [Commented] (SPARK-30411) saveAsTable does not honor spark.hadoop.hive.warehouse.subdir.inherit.perms

2020-01-06 Thread Sanket Reddy (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009381#comment-17009381 ] Sanket Reddy commented on SPARK-30411: -- [~yumwang]  

[jira] [Resolved] (SPARK-30414) Optimizations for arrays and maps in ParquetRowConverter

2020-01-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30414. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 27089

[jira] [Resolved] (SPARK-30433) Make conflict attributes resolution more scalable in ResolveReferences

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-30433. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 27105

[jira] [Assigned] (SPARK-30433) Make conflict attributes resolution more scalable in ResolveReferences

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-30433: --- Assignee: wuyi > Make conflict attributes resolution more scalable in ResolveReferences >

[jira] [Commented] (SPARK-30444) The same job will be computated for many times when using Dataset.show()

2020-01-06 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009370#comment-17009370 ] Aman Omer commented on SPARK-30444: --- I am looking into this one. > The same job will be computated

[jira] [Comment Edited] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale edited comment on SPARK-2984 at 1/7/20 4:47 AM:

[jira] [Comment Edited] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale edited comment on SPARK-2984 at 1/7/20 4:25 AM:

[jira] [Comment Edited] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale edited comment on SPARK-2984 at 1/7/20 4:17 AM:

[jira] [Comment Edited] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale edited comment on SPARK-2984 at 1/7/20 4:16 AM: Hi Steve

[jira] [Comment Edited] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale edited comment on SPARK-2984 at 1/7/20 4:14 AM:

[jira] [Updated] (SPARK-30432) reduce degree recomputation in StronglyConnectedComponents

2020-01-06 Thread li xiaosen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] li xiaosen updated SPARK-30432: --- Description:   So the computation happens every time in the do-while loop, the first time the

[jira] [Comment Edited] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale edited comment on SPARK-2984 at 1/7/20 4:10 AM: Hi Steve

[jira] [Comment Edited] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale edited comment on SPARK-2984 at 1/7/20 4:09 AM: Hi Steve

[jira] [Commented] (SPARK-2984) FileNotFoundException on _temporary directory

2020-01-06 Thread Andrew F Vitale (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009361#comment-17009361 ] Andrew F Vitale commented on SPARK-2984: Hi Steve Is your recommended solution for HDFS still

[jira] [Updated] (SPARK-30432) reduce degree recomputation in StronglyConnectedComponents

2020-01-06 Thread li xiaosen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] li xiaosen updated SPARK-30432: --- Fix Version/s: (was: 2.4.4) Target Version/s: 2.4.5, 3.0.0 (was: 2.4.4, 2.4.5)

[jira] [Resolved] (SPARK-25403) Broadcast join is changing to sort merge join , after spark-beeline session restarts.

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-25403. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 22721

[jira] [Assigned] (SPARK-25403) Broadcast join is changing to sort merge join , after spark-beeline session restarts.

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-25403: --- Assignee: Yuming Wang > Broadcast join is changing to sort merge join , after

[jira] [Assigned] (SPARK-19784) refresh datasource table after alter the location

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-19784: --- Assignee: Yuming Wang > refresh datasource table after alter the location >

[jira] [Resolved] (SPARK-19784) refresh datasource table after alter the location

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19784. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 22721

[jira] [Resolved] (SPARK-30260) Spark-Shell throw ClassNotFoundException exception for more than one statement to use UDF jar

2020-01-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-30260. --- Resolution: Duplicate > Spark-Shell throw ClassNotFoundException exception for more than

[jira] [Updated] (SPARK-30260) Spark-Shell throw ClassNotFoundException exception for more than one statement to use UDF jar

2020-01-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-30260: -- Fix Version/s: (was: 2.4.3) (was: 2.3.0) > Spark-Shell throw

[jira] [Updated] (SPARK-30260) Spark-Shell throw ClassNotFoundException exception for more than one statement to use UDF jar

2020-01-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-30260: -- Target Version/s: (was: 2.3.0, 2.4.3) > Spark-Shell throw ClassNotFoundException exception

[jira] [Commented] (SPARK-30411) saveAsTable does not honor spark.hadoop.hive.warehouse.subdir.inherit.perms

2020-01-06 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009293#comment-17009293 ] Yuming Wang commented on SPARK-30411: - Please see

[jira] [Updated] (SPARK-30444) The same job will be computated for many times when using Dataset.show()

2020-01-06 Thread Dong Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Wang updated SPARK-30444: -- Description: When I run the example sql.SparkSQLExample, df.show() at line 60 would trigger an

[jira] [Updated] (SPARK-30444) The same job will be computated for many times when using Dataset.show()

2020-01-06 Thread Dong Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Wang updated SPARK-30444: -- Description: When I run the example sql.SparkSQLExample, df.show() at line 60 would trigger an

[jira] [Created] (SPARK-30444) The same job will be computated for many times when using Dataset.show()

2020-01-06 Thread Dong Wang (Jira)
Dong Wang created SPARK-30444: - Summary: The same job will be computated for many times when using Dataset.show() Key: SPARK-30444 URL: https://issues.apache.org/jira/browse/SPARK-30444 Project: Spark

[jira] [Comment Edited] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix()

2020-01-06 Thread Ping Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009273#comment-17009273 ] Ping Liu edited comment on SPARK-27842 at 1/7/20 1:21 AM: -- Hi Peter, I just

[jira] [Comment Edited] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix()

2020-01-06 Thread Ping Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009273#comment-17009273 ] Ping Liu edited comment on SPARK-27842 at 1/7/20 1:19 AM: -- Hi Peter, I just

[jira] [Comment Edited] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix()

2020-01-06 Thread Ping Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009273#comment-17009273 ] Ping Liu edited comment on SPARK-27842 at 1/7/20 1:20 AM: -- Hi Peter, I just

[jira] [Comment Edited] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix()

2020-01-06 Thread Ping Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009273#comment-17009273 ] Ping Liu edited comment on SPARK-27842 at 1/7/20 1:17 AM: -- Hi Peter, I just

[jira] [Commented] (SPARK-27842) Inconsistent results of Statistics.corr() and PearsonCorrelation.computeCorrelationMatrix()

2020-01-06 Thread Ping Liu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009273#comment-17009273 ] Ping Liu commented on SPARK-27842: -- Hi Peter, I just did some investigation.  I found this is probably

[jira] [Resolved] (SPARK-30430) Add a note that UserDefinedFunction's constructor is private

2020-01-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-30430. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 27101

[jira] [Assigned] (SPARK-30430) Add a note that UserDefinedFunction's constructor is private

2020-01-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-30430: Assignee: Hyukjin Kwon > Add a note that UserDefinedFunction's constructor is private >

[jira] [Resolved] (SPARK-30154) PySpark UDF to convert MLlib vectors to dense arrays

2020-01-06 Thread Xiangrui Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-30154. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26910

[jira] [Created] (SPARK-30443) "Managed memory leak detected" even with no calls to take() or limit()

2020-01-06 Thread Luke Richter (Jira)
Luke Richter created SPARK-30443: Summary: "Managed memory leak detected" even with no calls to take() or limit() Key: SPARK-30443 URL: https://issues.apache.org/jira/browse/SPARK-30443 Project:

[jira] [Commented] (SPARK-30429) WideSchemaBenchmark fails with OOM

2020-01-06 Thread Maxim Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009189#comment-17009189 ] Maxim Gekk commented on SPARK-30429: [~dongjoon] I ran git bisect. Let see what it will find during

[jira] [Created] (SPARK-30442) Write mode ignored when using CodecStreams

2020-01-06 Thread Jesse Collins (Jira)
Jesse Collins created SPARK-30442: - Summary: Write mode ignored when using CodecStreams Key: SPARK-30442 URL: https://issues.apache.org/jira/browse/SPARK-30442 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-30429) WideSchemaBenchmark fails with OOM

2020-01-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009118#comment-17009118 ] Dongjoon Hyun commented on SPARK-30429: --- Thank you for reporting, [~maxgekk]. Do you have any idea

[jira] [Assigned] (SPARK-30313) Flaky test: MasterSuite.master/worker web ui available with reverseProxy

2020-01-06 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin reassigned SPARK-30313: -- Assignee: Jungtaek Lim > Flaky test: MasterSuite.master/worker web

[jira] [Resolved] (SPARK-30313) Flaky test: MasterSuite.master/worker web ui available with reverseProxy

2020-01-06 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-30313. Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Updated] (SPARK-30441) Improve the memory usage in StronglyConnectedComponents

2020-01-06 Thread jiamuzhou (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiamuzhou updated SPARK-30441: -- Affects Version/s: 2.1.0 2.3.0 2.4.0 > Improve the

[jira] [Updated] (SPARK-30441) Improve the memory usage in StronglyConnectedComponents

2020-01-06 Thread jiamuzhou (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiamuzhou updated SPARK-30441: -- Description: This is very consume memory when It use StronglyConnectedComponents(see figure1.png). 

[jira] [Updated] (SPARK-30441) Improve the memory usage in StronglyConnectedComponents

2020-01-06 Thread jiamuzhou (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiamuzhou updated SPARK-30441: -- Fix Version/s: (was: 3.1.0) (was: 2.4.5) Target Version/s: 3.0.0

[jira] [Created] (SPARK-30441) Improve the memory usage in StronglyConnectedComponents

2020-01-06 Thread jiamuzhou (Jira)
jiamuzhou created SPARK-30441: - Summary: Improve the memory usage in StronglyConnectedComponents Key: SPARK-30441 URL: https://issues.apache.org/jira/browse/SPARK-30441 Project: Spark Issue

[jira] [Updated] (SPARK-30441) Improve the memory usage in StronglyConnectedComponents

2020-01-06 Thread jiamuzhou (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiamuzhou updated SPARK-30441: -- Description: This is very consume memory when It use StronglyConnectedComponents(see figure1.png). 

[jira] [Updated] (SPARK-30441) Improve the memory usage in StronglyConnectedComponents

2020-01-06 Thread jiamuzhou (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiamuzhou updated SPARK-30441: -- Attachment: figure2.png figure1.png > Improve the memory usage in

[jira] [Resolved] (SPARK-30226) Remove withXXX functions in WriteBuilder

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-30226. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26678

[jira] [Assigned] (SPARK-30226) Remove withXXX functions in WriteBuilder

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-30226: --- Assignee: Ximo Guanter > Remove withXXX functions in WriteBuilder >

[jira] [Assigned] (SPARK-29800) Rewrite non-correlated EXISTS subquery use ScalaSubquery to optimize perf

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-29800: --- Assignee: angerszhu > Rewrite non-correlated EXISTS subquery use ScalaSubquery to optimize

[jira] [Resolved] (SPARK-29800) Rewrite non-correlated EXISTS subquery use ScalaSubquery to optimize perf

2020-01-06 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-29800. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26437

[jira] [Created] (SPARK-30439) support NOT NULL in column data type

2020-01-06 Thread Wenchen Fan (Jira)
Wenchen Fan created SPARK-30439: --- Summary: support NOT NULL in column data type Key: SPARK-30439 URL: https://issues.apache.org/jira/browse/SPARK-30439 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-30440) Flaky test: org.apache.spark.scheduler.TaskSetManagerSuite.reset

2020-01-06 Thread Jungtaek Lim (Jira)
Jungtaek Lim created SPARK-30440: Summary: Flaky test: org.apache.spark.scheduler.TaskSetManagerSuite.reset Key: SPARK-30440 URL: https://issues.apache.org/jira/browse/SPARK-30440 Project: Spark

[jira] [Created] (SPARK-30438) Flaky test: org.apache.spark.deploy.StandaloneDynamicAllocationSuite."dynamic allocation default behavior"

2020-01-06 Thread Jungtaek Lim (Jira)
Jungtaek Lim created SPARK-30438: Summary: Flaky test: org.apache.spark.deploy.StandaloneDynamicAllocationSuite."dynamic allocation default behavior" Key: SPARK-30438 URL:

[jira] [Updated] (SPARK-30437) Uneven spaces for some fields in EXPLAIN FORMATTED

2020-01-06 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aman Omer updated SPARK-30437: -- Summary: Uneven spaces for some fields in EXPLAIN FORMATTED (was: Uneven spaces for some fields for

[jira] [Created] (SPARK-30437) Uneven spaces for some fields for EXPLAIN FORMATTED

2020-01-06 Thread Aman Omer (Jira)
Aman Omer created SPARK-30437: - Summary: Uneven spaces for some fields for EXPLAIN FORMATTED Key: SPARK-30437 URL: https://issues.apache.org/jira/browse/SPARK-30437 Project: Spark Issue Type:

[jira] [Commented] (SPARK-30436) CREATE EXTERNAL TABLE doesn't work without STORED AS

2020-01-06 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17008793#comment-17008793 ] Jungtaek Lim commented on SPARK-30436: -- Working on this. Given create external table is only valid

[jira] [Created] (SPARK-30436) CREATE EXTERNAL TABLE doesn't work without STORED AS

2020-01-06 Thread Jungtaek Lim (Jira)
Jungtaek Lim created SPARK-30436: Summary: CREATE EXTERNAL TABLE doesn't work without STORED AS Key: SPARK-30436 URL: https://issues.apache.org/jira/browse/SPARK-30436 Project: Spark Issue

[jira] [Updated] (SPARK-30400) Test failure in SQL module on ppc64le

2020-01-06 Thread AK97 (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] AK97 updated SPARK-30400: - Component/s: (was: Build) Tests SQL > Test failure in SQL module on

[jira] [Updated] (SPARK-30400) Test failure in SQL module on ppc64le

2020-01-06 Thread AK97 (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] AK97 updated SPARK-30400: - Issue Type: Bug (was: Test) > Test failure in SQL module on ppc64le > - >

[jira] [Commented] (SPARK-30421) Dropped columns still available for filtering

2020-01-06 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17008708#comment-17008708 ] Aman Omer commented on SPARK-30421: --- I am panning to add a marker to Project when it is introduced via

[jira] [Created] (SPARK-30435) update Spark SQL guide of Supported Hive Features

2020-01-06 Thread angerszhu (Jira)
angerszhu created SPARK-30435: - Summary: update Spark SQL guide of Supported Hive Features Key: SPARK-30435 URL: https://issues.apache.org/jira/browse/SPARK-30435 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-30421) Dropped columns still available for filtering

2020-01-06 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17008705#comment-17008705 ] Aman Omer commented on SPARK-30421: --- In Analysis planning phase, rule _*ResolveMissingReferences*_ is

[jira] [Created] (SPARK-30434) Move pandas related functionalities into 'pandas' sub-package

2020-01-06 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-30434: Summary: Move pandas related functionalities into 'pandas' sub-package Key: SPARK-30434 URL: https://issues.apache.org/jira/browse/SPARK-30434 Project: Spark

[jira] [Commented] (SPARK-30421) Dropped columns still available for filtering

2020-01-06 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17008699#comment-17008699 ] Aman Omer commented on SPARK-30421: --- Plans for {code:java} df.drop("bar").where($"bar" ===

[jira] [Comment Edited] (SPARK-30421) Dropped columns still available for filtering

2020-01-06 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17008699#comment-17008699 ] Aman Omer edited comment on SPARK-30421 at 1/6/20 10:07 AM: Plans for

[jira] [Updated] (SPARK-30433) Make conflict attributes resolution more scalable in ResolveReferences

2020-01-06 Thread wuyi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wuyi updated SPARK-30433: - Summary: Make conflict attributes resolution more scalable in ResolveReferences (was: Make conflict attributes

[jira] [Created] (SPARK-30433) Make conflict attributes resolution more scalable

2020-01-06 Thread wuyi (Jira)
wuyi created SPARK-30433: Summary: Make conflict attributes resolution more scalable Key: SPARK-30433 URL: https://issues.apache.org/jira/browse/SPARK-30433 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-30432) reduce degree recomputation in StronglyConnectedComponents

2020-01-06 Thread li xiaosen (Jira)
li xiaosen created SPARK-30432: -- Summary: reduce degree recomputation in StronglyConnectedComponents Key: SPARK-30432 URL: https://issues.apache.org/jira/browse/SPARK-30432 Project: Spark Issue

[jira] [Commented] (SPARK-30421) Dropped columns still available for filtering

2020-01-06 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17008640#comment-17008640 ] Aman Omer commented on SPARK-30421: --- Thanks [~tobias_hermann] for reporting this issue. It seems,

[jira] [Updated] (SPARK-30381) GBT reuse treePoints for all trees

2020-01-06 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-30381: - Summary: GBT reuse treePoints for all trees (was: GBT reuse splits for all trees) > GBT reuse

[jira] [Created] (SPARK-30431) Update SqlBase.g4 to create commentSpec pattern as same as locationSpec

2020-01-06 Thread Kent Yao (Jira)
Kent Yao created SPARK-30431: Summary: Update SqlBase.g4 to create commentSpec pattern as same as locationSpec Key: SPARK-30431 URL: https://issues.apache.org/jira/browse/SPARK-30431 Project: Spark

[jira] [Updated] (SPARK-30430) Add a note that UserDefinedFunction's constructor is private

2020-01-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-30430: - Priority: Trivial (was: Major) > Add a note that UserDefinedFunction's constructor is private

[jira] [Updated] (SPARK-30430) Add a note that UserDefinedFunction's constructor is private

2020-01-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-30430: - Issue Type: Documentation (was: New Feature) > Add a note that UserDefinedFunction's

[jira] [Created] (SPARK-30430) Add a note that UserDefinedFunction's constructor is private

2020-01-06 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-30430: Summary: Add a note that UserDefinedFunction's constructor is private Key: SPARK-30430 URL: https://issues.apache.org/jira/browse/SPARK-30430 Project: Spark