[jira] [Resolved] (SPARK-27916) SparkThreatServer memory leak when 'spark.sql.hive.thriftServer.singleSession' enabled

2019-08-07 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu resolved SPARK-27916. --- Resolution: Invalid > SparkThreatServer memory leak when >

[jira] [Updated] (SPARK-28169) Spark can’t push down partition predicate for OR expression

2019-08-07 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28169: -- Summary: Spark can’t push down partition predicate for OR expression (was: Spark can’t push down

[jira] [Resolved] (SPARK-28618) SparkSQL Plan CollectLimitExec may cause OOM

2019-08-05 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu resolved SPARK-28618. --- Resolution: Auto Closed > SparkSQL Plan CollectLimitExec may cause OOM >

[jira] [Updated] (SPARK-28618) SparkSQL Plan CollectLimitExec may cause OOM

2019-08-05 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28618: -- Description: When we call a limit in SQL or DataFrame, it will execute like below,if we call

[jira] [Updated] (SPARK-28618) SparkSQL Plan CollectLimitExec may cause OOM

2019-08-05 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28618: -- Description: When we call a limit in SQL or DataFrame, it will execute like below,if we call

[jira] [Updated] (SPARK-28618) SparkSQL Plan CollectLimitExec may cause OOM

2019-08-05 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28618: -- Attachment: image-2019-08-05-14-03-16-112.png > SparkSQL Plan CollectLimitExec may cause OOM >

[jira] [Updated] (SPARK-28618) SparkSQL Plan CollectLimitExec may cause OOM

2019-08-05 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28618: -- Attachment: image-2019-08-05-14-02-51-577.png > SparkSQL Plan CollectLimitExec may cause OOM >

[jira] [Created] (SPARK-28618) SparkSQL Plan CollectLimitExec may cause OOM

2019-08-05 Thread angerszhu (JIRA)
angerszhu created SPARK-28618: - Summary: SparkSQL Plan CollectLimitExec may cause OOM Key: SPARK-28618 URL: https://issues.apache.org/jira/browse/SPARK-28618 Project: Spark Issue Type: Wish

[jira] [Updated] (SPARK-28613) Spark SQL action collect just judge size of compressed RDD's size, not accurate enough

2019-08-04 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28613: -- Description: When we run action DataFrame.collect() , for the configuration 

[jira] [Updated] (SPARK-28613) Spark SQL action collect just judge size of compressed RDD's size, not accurate enough

2019-08-04 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28613: -- Description: When we run action DataFrame.collect() , for the configuration 

[jira] [Created] (SPARK-28613) Spark SQL action collect just judge size of compressed RDD's size, not accurate enough

2019-08-04 Thread angerszhu (JIRA)
angerszhu created SPARK-28613: - Summary: Spark SQL action collect just judge size of compressed RDD's size, not accurate enough Key: SPARK-28613 URL: https://issues.apache.org/jira/browse/SPARK-28613

[jira] [Commented] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning

2019-07-29 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16895737#comment-16895737 ] angerszhu commented on SPARK-27602: --- Seems in hive 1.2.1, we can't get true partition level stats too.

[jira] [Commented] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning

2019-07-29 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16895736#comment-16895736 ] angerszhu commented on SPARK-27602: --- [~lishuming] It's hard to combine several  partition's stats

[jira] [Updated] (SPARK-28551) CTAS SQL with LOCATION properties won't check location path, cause data under this location lose

2019-07-29 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28551: -- Description: When we run SQL like  {code:java} CRETE TABLE TBL LOCATION 'PATH_URI' AS SELECT

[jira] [Updated] (SPARK-28551) CTAS SQL with LOCATION properties won't check location path, cause data under this location lose

2019-07-29 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28551: -- Labels: SQL (was: ) > CTAS SQL with LOCATION properties won't check location path, cause data under

[jira] [Created] (SPARK-28551) CTAS SQL with LOCATION properties won't check location path, cause data under this location lose

2019-07-29 Thread angerszhu (JIRA)
angerszhu created SPARK-28551: - Summary: CTAS SQL with LOCATION properties won't check location path, cause data under this location lose Key: SPARK-28551 URL: https://issues.apache.org/jira/browse/SPARK-28551

[jira] [Commented] (SPARK-27442) ParquetFileFormat fails to read column named with invalid characters

2019-07-24 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16892366#comment-16892366 ] angerszhu commented on SPARK-27442: --- auto writing data, change format to orc. orc can support it  >

[jira] [Commented] (SPARK-27442) ParquetFileFormat fails to read column named with invalid characters

2019-07-24 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16892348#comment-16892348 ] angerszhu commented on SPARK-27442: --- Meet same problem, can't write parquet file with name such as

[jira] [Commented] (SPARK-28450) When scan hive data of a not existed partition, it return an error

2019-07-23 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16890713#comment-16890713 ] angerszhu commented on SPARK-28450: --- [~shivuson...@gmail.com] Just select a partition table with an

[jira] [Updated] (SPARK-28466) FileSystem closed error when to call Hive.moveFile

2019-07-21 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28466: -- Description: When we close a session of STS, if this session has done some SQL about insert, then

[jira] [Updated] (SPARK-28466) FileSystem closed error when to call Hive.moveFile

2019-07-21 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28466: -- Attachment: image-2019-07-22-09-58-55-107.png > FileSystem closed error when to call Hive.moveFile >

[jira] [Updated] (SPARK-28466) FileSystem closed error when to call Hive.moveFile

2019-07-21 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28466: -- Attachment: image-2019-07-22-09-58-19-023.png > FileSystem closed error when to call Hive.moveFile >

[jira] [Updated] (SPARK-28466) FileSystem closed error when to call Hive.moveFile

2019-07-21 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28466: -- Description: When we close a session of STS, if this session has done some SQL about insert, then

[jira] [Created] (SPARK-28466) FileSystem closed error when to call Hive.moveFile

2019-07-21 Thread angerszhu (JIRA)
angerszhu created SPARK-28466: - Summary: FileSystem closed error when to call Hive.moveFile Key: SPARK-28466 URL: https://issues.apache.org/jira/browse/SPARK-28466 Project: Spark Issue Type:

[jira] [Updated] (SPARK-28450) When scan hive data of a not existed partition, it return an error

2019-07-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28450: -- Attachment: image-2019-07-19-20-51-12-861.png > When scan hive data of a not existed partition, it

[jira] [Updated] (SPARK-28450) When scan hive data of a not existed partition, it return an error

2019-07-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28450: -- Description: When we select data of a un-existed hive partition table's partition, it will return

[jira] [Created] (SPARK-28450) When scan hive data of a not existed partition, it return an error

2019-07-19 Thread angerszhu (JIRA)
angerszhu created SPARK-28450: - Summary: When scan hive data of a not existed partition, it return an error Key: SPARK-28450 URL: https://issues.apache.org/jira/browse/SPARK-28450 Project: Spark

[jira] [Created] (SPARK-28419) A patch for SparkThriftServer support multi-tenant authentication

2019-07-17 Thread angerszhu (JIRA)
angerszhu created SPARK-28419: - Summary: A patch for SparkThriftServer support multi-tenant authentication Key: SPARK-28419 URL: https://issues.apache.org/jira/browse/SPARK-28419 Project: Spark

[jira] [Commented] (SPARK-28327) Spark SQL can't support union with left query have queryOrganization

2019-07-10 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16882581#comment-16882581 ] angerszhu commented on SPARK-28327: --- [~yumwang]Thank you for you Seems current SparkSQL's SQL

[jira] [Updated] (SPARK-28327) Spark SQL can't support union with left query have queryOrganization

2019-07-10 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28327: -- Description: Spark SQL can't support SQL like  {code:java} SELECT A FROM TABLE_1 LIMIT 1 UNION

[jira] [Created] (SPARK-28327) Spark SQL can't support union with left query have queryOrganization

2019-07-10 Thread angerszhu (JIRA)
angerszhu created SPARK-28327: - Summary: Spark SQL can't support union with left query have queryOrganization Key: SPARK-28327 URL: https://issues.apache.org/jira/browse/SPARK-28327 Project: Spark

[jira] [Updated] (SPARK-28311) Spark Thrift Server protocol version compatibility setup too late

2019-07-08 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28311: -- Description: {code:java} SessionHandle getSessionHandle(TOpenSessionReq req, TOpenSessionResp res)

[jira] [Created] (SPARK-28311) Spark Thrift Server protocol version compatibility setup too late

2019-07-08 Thread angerszhu (JIRA)
angerszhu created SPARK-28311: - Summary: Spark Thrift Server protocol version compatibility setup too late Key: SPARK-28311 URL: https://issues.apache.org/jira/browse/SPARK-28311 Project: Spark

[jira] [Created] (SPARK-28292) Enable inject user-defined Hint

2019-07-08 Thread angerszhu (JIRA)
angerszhu created SPARK-28292: - Summary: Enable inject user-defined Hint Key: SPARK-28292 URL: https://issues.apache.org/jira/browse/SPARK-28292 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-28227) Spark can’t support TRANSFORM with aggregation

2019-07-01 Thread angerszhu (JIRA)
angerszhu created SPARK-28227: - Summary: Spark can’t support TRANSFORM with aggregation Key: SPARK-28227 URL: https://issues.apache.org/jira/browse/SPARK-28227 Project: Spark Issue Type:

[jira] [Updated] (SPARK-28169) Spark can’t push down predicate for OR expression

2019-06-26 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28169: -- Description: Spark can't push down filter condition of Or: Such as if I have a table

[jira] [Created] (SPARK-28169) Spark can’t push down predicate for OR expression

2019-06-26 Thread angerszhu (JIRA)
angerszhu created SPARK-28169: - Summary: Spark can’t push down predicate for OR expression Key: SPARK-28169 URL: https://issues.apache.org/jira/browse/SPARK-28169 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Attachment: image-2019-06-20-11-51-06-889.png > Spark SQL add jar with wrong hdfs path, SparkContext

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Description: When we use SparkSQL, about add jar command, if we add a wrong path of HDFS such as

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Attachment: image-2019-06-20-11-50-36-418.png > Spark SQL add jar with wrong hdfs path, SparkContext

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Attachment: image-2019-06-20-11-49-13-691.png > Spark SQL add jar with wrong hdfs path, SparkContext

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Description: When we use SparkSQL, about add jar command, if we add a wrong path of HDFS such as

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Description: When we use SparkSQL, about add jar command, if we add a wrong path of HDFS such as

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Description: When we use SparkSQL, about add jar command, if we add a wrong path of HDFS such as

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Attachment: image-2019-06-19-21-23-22-061.png > Spark SQL add jar with wrong hdfs path, SparkContext

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Description: When we use SparkSQL, about add jar command, if we add a wrong path of HDFS such as

[jira] [Updated] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-28106: -- Description: When we use SparkSQL, about add jar command, if we add a wrong path of HDFS such as

[jira] [Created] (SPARK-28106) Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed

2019-06-19 Thread angerszhu (JIRA)
angerszhu created SPARK-28106: - Summary: Spark SQL add jar with wrong hdfs path, SparkContext still add it to jar path ,and cause Task Failed Key: SPARK-28106 URL: https://issues.apache.org/jira/browse/SPARK-28106

[jira] [Created] (SPARK-27916) SparkThreatServer memory leak

2019-06-01 Thread angerszhu (JIRA)
angerszhu created SPARK-27916: - Summary: SparkThreatServer memory leak Key: SPARK-27916 URL: https://issues.apache.org/jira/browse/SPARK-27916 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning

2019-05-04 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16833209#comment-16833209 ] angerszhu commented on SPARK-27602: --- [~hyukjin.kwon] The first step result is just like this.  The

[jira] [Updated] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning

2019-05-04 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-27602: -- Attachment: image-2019-05-05-11-46-41-240.png > SparkSQL CBO can't get true size of partition table

[jira] [Commented] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning

2019-05-04 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16833194#comment-16833194 ] angerszhu commented on SPARK-27602: --- [~hyukjin.kwon] Extract the framework of CBO how to calculate

[jira] [Commented] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning

2019-04-30 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16830836#comment-16830836 ] angerszhu commented on SPARK-27602: --- Want to do this need to change the calculate model. I am tring to

[jira] [Created] (SPARK-27602) SparkSQL CBO can't get true size of partition table after partition pruning

2019-04-29 Thread angerszhu (JIRA)
angerszhu created SPARK-27602: - Summary: SparkSQL CBO can't get true size of partition table after partition pruning Key: SPARK-27602 URL: https://issues.apache.org/jira/browse/SPARK-27602 Project: Spark

[jira] [Commented] (SPARK-13263) SQL generation support for tablesample

2019-04-22 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16822891#comment-16822891 ] angerszhu commented on SPARK-13263: --- [~Tagar]  I make some change in Spark SQL's ASTBuild, can

[jira] [Created] (SPARK-27185) mapPartition to replace map to speedUp Dataset's toLocalIterator process

2019-03-18 Thread angerszhu (JIRA)
angerszhu created SPARK-27185: - Summary: mapPartition to replace map to speedUp Dataset's toLocalIterator process Key: SPARK-27185 URL: https://issues.apache.org/jira/browse/SPARK-27185 Project: Spark

[jira] [Commented] (SPARK-21918) HiveClient shouldn't share Hive object between different thread

2019-02-21 Thread angerszhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16773890#comment-16773890 ] angerszhu commented on SPARK-21918: --- Come back boy, :([~huLiu] > HiveClient shouldn't share Hive

<    5   6   7   8   9   10