[jira] [Commented] (SPARK-34540) Add convert_dtypes to the DataFrameLike protocol

2021-02-25 Thread Rafal Wojdyla (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290893#comment-17290893 ] Rafal Wojdyla commented on SPARK-34540: --- This is related to https://issues.apache.

[jira] [Commented] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290892#comment-17290892 ] Apache Spark commented on SPARK-34538: -- User 'ulysses-you' has created a pull reque

[jira] [Assigned] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34538: Assignee: (was: Apache Spark) > Hive Metastore support filter by not-in > ---

[jira] [Assigned] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34538: Assignee: Apache Spark > Hive Metastore support filter by not-in > --

[jira] [Created] (SPARK-34540) Add convert_dtypes to the DataFrameLike protocol

2021-02-25 Thread Rafal Wojdyla (Jira)
Rafal Wojdyla created SPARK-34540: - Summary: Add convert_dtypes to the DataFrameLike protocol Key: SPARK-34540 URL: https://issues.apache.org/jira/browse/SPARK-34540 Project: Spark Issue Type

[jira] [Commented] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290890#comment-17290890 ] Apache Spark commented on SPARK-34538: -- User 'ulysses-you' has created a pull reque

[jira] [Created] (SPARK-34539) Zinc standalone server is useless after scala-maven-plugin 4.x

2021-02-25 Thread Yikun Jiang (Jira)
Yikun Jiang created SPARK-34539: --- Summary: Zinc standalone server is useless after scala-maven-plugin 4.x Key: SPARK-34539 URL: https://issues.apache.org/jira/browse/SPARK-34539 Project: Spark

[jira] [Assigned] (SPARK-33537) Hive Metastore filter pushdown improvement

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33537: Assignee: Apache Spark (was: Yuming Wang) > Hive Metastore filter pushdown improvement >

[jira] [Assigned] (SPARK-33537) Hive Metastore filter pushdown improvement

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-33537: Assignee: Yuming Wang (was: Apache Spark) > Hive Metastore filter pushdown improvement >

[jira] [Commented] (SPARK-33537) Hive Metastore filter pushdown improvement

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290888#comment-17290888 ] Apache Spark commented on SPARK-33537: -- User 'ulysses-you' has created a pull reque

[jira] [Updated] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread ulysses you (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ulysses you updated SPARK-34538: Description: `NOT IN` is a useful condition to prune partition, it would be better to support it.

[jira] [Updated] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread ulysses you (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ulysses you updated SPARK-34538: Description: `NOT IN` is a useful condition to prune partition, it would be better to support it.

[jira] [Updated] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread ulysses you (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ulysses you updated SPARK-34538: Description: `NOT IN` is a useful condition to prune partition, it would be better to push it to H

[jira] [Updated] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread ulysses you (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ulysses you updated SPARK-34538: Parent: SPARK-33537 Issue Type: Sub-task (was: Improvement) > Hive Metastore support filt

[jira] [Created] (SPARK-34538) Hive Metastore support filter by not-in

2021-02-25 Thread ulysses you (Jira)
ulysses you created SPARK-34538: --- Summary: Hive Metastore support filter by not-in Key: SPARK-34538 URL: https://issues.apache.org/jira/browse/SPARK-34538 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-34537) Repartition miss/duplicated data

2021-02-25 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-34537: -- Description: We have a SQL {code:java} INSERT OVERWRITE TABLE t1 SELECT /*+ repartition(300) */ * fro

[jira] [Updated] (SPARK-34537) Repartition miss/duplicated data

2021-02-25 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-34537: -- Attachment: image-2021-02-25-19-47-10-005.png > Repartition miss/duplicated data > ---

[jira] [Updated] (SPARK-34537) Repartition miss/duplicated data

2021-02-25 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-34537: -- Attachment: image-2021-02-25-19-46-52-809.png > Repartition miss/duplicated data > ---

[jira] [Created] (SPARK-34537) Repartition miss/duplicated data

2021-02-25 Thread angerszhu (Jira)
angerszhu created SPARK-34537: - Summary: Repartition miss/duplicated data Key: SPARK-34537 URL: https://issues.apache.org/jira/browse/SPARK-34537 Project: Spark Issue Type: Bug Componen

[jira] [Updated] (SPARK-34537) Repartition miss/duplicated data

2021-02-25 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-34537: -- Attachment: image-2021-02-25-19-43-49-687.png > Repartition miss/duplicated data > ---

[jira] [Updated] (SPARK-34537) Repartition miss/duplicated data

2021-02-25 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-34537: -- Description: We have a SQL {code:java} INSERT OVERWRITE TABLE t1 SELECT /*+ repartition(300) */ * fro

[jira] [Commented] (SPARK-30228) Update zstd-jni to 1.4.4-3

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290860#comment-17290860 ] Apache Spark commented on SPARK-30228: -- User 'seayoun' has created a pull request f

[jira] [Commented] (SPARK-30228) Update zstd-jni to 1.4.4-3

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290859#comment-17290859 ] Apache Spark commented on SPARK-30228: -- User 'seayoun' has created a pull request f

[jira] [Resolved] (SPARK-34536) zstd-jni lead to read less shuffle data

2021-02-25 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-34536. -- Resolution: Duplicate > zstd-jni lead to read less shuffle data >

[jira] [Updated] (SPARK-34528) View result are not consistent after a modification inside a struct of the table

2021-02-25 Thread Thomas Prelle (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Prelle updated SPARK-34528: -- Affects Version/s: 3.1.2 3.0.2 > View result are not consistent after a

[jira] [Updated] (SPARK-34536) zstd-jni lead to read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Summary: zstd-jni lead to read less shuffle data (was: zstd-jni lead read less shuffle data) > zstd-

[jira] [Assigned] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34536: Assignee: (was: Apache Spark) > zstd-jni lead read less shuffle data > --

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Description: h2. BackGround I find a rare case which lead some partitions read less data when use zst

[jira] [Assigned] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34536: Assignee: Apache Spark > zstd-jni lead read less shuffle data > -

[jira] [Commented] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290820#comment-17290820 ] Apache Spark commented on SPARK-34536: -- User 'seayoun' has created a pull request f

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Description: h2. BackGround I find a rare case which lead some partitions read less data when use zst

[jira] [Assigned] (SPARK-34436) DPP support LIKE ANY/ALL

2021-02-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-34436: --- Assignee: Yuming Wang > DPP support LIKE ANY/ALL > > >

[jira] [Resolved] (SPARK-34436) DPP support LIKE ANY/ALL

2021-02-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-34436. - Fix Version/s: 3.1.2 3.2.0 Resolution: Fixed Issue resolved by pull re

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Description: h2. BackGround I find a rare case which lead some partitions read less data when use zst

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Description: h2. BackGround I find a rare case which lead some partitions read less data when use zst

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Description: h2. BackGround I find a rare case which lead some partitions read less data when use zst

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Description: h2. BackGround I find a rare case which lead some partitions read less data when use zst

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Attachment: image-2021-02-25-17-51-49-998.png > zstd-jni lead read less shuffle data > ---

[jira] [Updated] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] haiyangyu updated SPARK-34536: -- Attachment: image-2021-02-25-17-50-49-427.png > zstd-jni lead read less shuffle data > ---

[jira] [Created] (SPARK-34536) zstd-jni lead read less shuffle data

2021-02-25 Thread haiyangyu (Jira)
haiyangyu created SPARK-34536: - Summary: zstd-jni lead read less shuffle data Key: SPARK-34536 URL: https://issues.apache.org/jira/browse/SPARK-34536 Project: Spark Issue Type: Bug Comp

[jira] [Assigned] (SPARK-34518) Rename `AlterTableRecoverPartitionsCommand` to `RepairTableCommand`

2021-02-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-34518: --- Assignee: Maxim Gekk > Rename `AlterTableRecoverPartitionsCommand` to `RepairTableCommand`

[jira] [Resolved] (SPARK-34518) Rename `AlterTableRecoverPartitionsCommand` to `RepairTableCommand`

2021-02-25 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-34518. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31635 [https://gith

[jira] [Commented] (SPARK-34198) Add RocksDB StateStore as external module

2021-02-25 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290771#comment-17290771 ] L. C. Hsieh commented on SPARK-34198: - FYI, I ran a benchmark against two open sourc

[jira] [Commented] (SPARK-34535) Cleanup unused symbol in Orc related code

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290760#comment-17290760 ] Apache Spark commented on SPARK-34535: -- User 'LuciferYang' has created a pull reque

[jira] [Assigned] (SPARK-34535) Cleanup unused symbol in Orc related code

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34535: Assignee: Apache Spark > Cleanup unused symbol in Orc related code >

[jira] [Commented] (SPARK-34535) Cleanup unused symbol in Orc related code

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17290759#comment-17290759 ] Apache Spark commented on SPARK-34535: -- User 'LuciferYang' has created a pull reque

[jira] [Assigned] (SPARK-34535) Cleanup unused symbol in Orc related code

2021-02-25 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-34535: Assignee: (was: Apache Spark) > Cleanup unused symbol in Orc related code > -

<    1   2