[jira] [Commented] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187506#comment-17187506 ] Apache Spark commented on SPARK-32753: -- User 'manuzhang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32753: Assignee: (was: Apache Spark) > Deduplicating and repartitioning the same column

[jira] [Assigned] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32753: Assignee: Apache Spark > Deduplicating and repartitioning the same column create

[jira] [Commented] (SPARK-32750) Add code-gen for SortAggregateExec

2020-08-31 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187495#comment-17187495 ] Takeshi Yamamuro commented on SPARK-32750: -- Just FYI: 

[jira] [Updated] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-08-31 Thread Manu Zhang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manu Zhang updated SPARK-32753: --- Description: To reproduce: {code:java}

[jira] [Created] (SPARK-32753) Deduplicating and repartitioning the same column create duplicate rows with AQE

2020-08-31 Thread Manu Zhang (Jira)
Manu Zhang created SPARK-32753: -- Summary: Deduplicating and repartitioning the same column create duplicate rows with AQE Key: SPARK-32753 URL: https://issues.apache.org/jira/browse/SPARK-32753 Project:

[jira] [Resolved] (SPARK-32740) Refactor common partitioning/distribution logic to BaseAggregateExec

2020-08-31 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-32740. -- Fix Version/s: 3.1.0 Assignee: Cheng Su Resolution: Fixed Resolved by 

[jira] [Commented] (SPARK-32714) Port pyspark-stubs

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187486#comment-17187486 ] Apache Spark commented on SPARK-32714: -- User 'zero323' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32714) Port pyspark-stubs

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32714: Assignee: Maciej Szymkiewicz (was: Apache Spark) > Port pyspark-stubs >

[jira] [Assigned] (SPARK-32714) Port pyspark-stubs

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32714: Assignee: Apache Spark (was: Maciej Szymkiewicz) > Port pyspark-stubs >

[jira] [Assigned] (SPARK-32714) Port pyspark-stubs

2020-08-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-32714: Assignee: Maciej Szymkiewicz > Port pyspark-stubs > -- > >

[jira] [Issue Comment Deleted] (SPARK-32681) PySpark type hints support

2020-08-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-32681: - Comment: was deleted (was: User 'zero323' has created a pull request for this issue:

[jira] [Commented] (SPARK-32714) Port pyspark-stubs

2020-08-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187485#comment-17187485 ] Hyukjin Kwon commented on SPARK-32714: -- https://github.com/apache/spark/pull/29591 > Port

[jira] [Commented] (SPARK-32691) Test org.apache.spark.DistributedSuite failed on arm64 jenkins

2020-08-31 Thread huangtianhua (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187478#comment-17187478 ] huangtianhua commented on SPARK-32691: -- [~dongjoon] Seems it doesn't with 'with replication as

[jira] [Commented] (SPARK-32747) Deduplicate configuration set/unset in test_sparkSQL_arrow.R

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187473#comment-17187473 ] Apache Spark commented on SPARK-32747: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-32747) Deduplicate configuration set/unset in test_sparkSQL_arrow.R

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32747: Assignee: (was: Apache Spark) > Deduplicate configuration set/unset in

[jira] [Assigned] (SPARK-32747) Deduplicate configuration set/unset in test_sparkSQL_arrow.R

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32747: Assignee: Apache Spark > Deduplicate configuration set/unset in test_sparkSQL_arrow.R >

[jira] [Commented] (SPARK-32747) Deduplicate configuration set/unset in test_sparkSQL_arrow.R

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187475#comment-17187475 ] Apache Spark commented on SPARK-32747: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-32752) Alias breaks for interval typed literals

2020-08-31 Thread Kent Yao (Jira)
Kent Yao created SPARK-32752: Summary: Alias breaks for interval typed literals Key: SPARK-32752 URL: https://issues.apache.org/jira/browse/SPARK-32752 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-32681) PySpark type hints support

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32681: Assignee: Maciej Szymkiewicz (was: Apache Spark) > PySpark type hints support >

[jira] [Commented] (SPARK-32681) PySpark type hints support

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187464#comment-17187464 ] Apache Spark commented on SPARK-32681: -- User 'zero323' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32681) PySpark type hints support

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32681: Assignee: Apache Spark (was: Maciej Szymkiewicz) > PySpark type hints support >

[jira] [Commented] (SPARK-32138) Drop Python 2, 3.4 and 3.5 in codes and documentation

2020-08-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187458#comment-17187458 ] Apache Spark commented on SPARK-32138: -- User 'zero323' has created a pull request for this issue:

[jira] [Commented] (SPARK-32744) request executor cores with decimal when spark on k8s

2020-08-31 Thread Yu Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187457#comment-17187457 ] Yu Wang commented on SPARK-32744: - !screenshot-2.png! [~hyukjin.kwon] Thank you for your answer!, I

[jira] [Updated] (SPARK-32744) request executor cores with decimal when spark on k8s

2020-08-31 Thread Yu Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang updated SPARK-32744: Attachment: screenshot-3.png > request executor cores with decimal when spark on k8s >

[jira] [Updated] (SPARK-32744) request executor cores with decimal when spark on k8s

2020-08-31 Thread Yu Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang updated SPARK-32744: Attachment: screenshot-2.png > request executor cores with decimal when spark on k8s >

[jira] [Resolved] (SPARK-32495) Update jackson-databind versions to fix various vulnerabilities.

2020-08-31 Thread Prashant Sharma (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prashant Sharma resolved SPARK-32495. - Resolution: Won't Fix Resolving it as won't fix for now, as most of us feel the

[jira] [Resolved] (SPARK-32720) Spark 3 Fails to Cast DateType to StringType when comparing result of Max

2020-08-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32720. -- Resolution: Cannot Reproduce > Spark 3 Fails to Cast DateType to StringType when comparing

[jira] [Resolved] (SPARK-32724) java.io.IOException: Stream is corrupted when tried to inner join 4 huge tables. Currently using pyspark version 2.4.0-cdh6.3.1

2020-08-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32724. -- Resolution: Invalid > java.io.IOException: Stream is corrupted when tried to inner join 4

[jira] [Commented] (SPARK-32724) java.io.IOException: Stream is corrupted when tried to inner join 4 huge tables. Currently using pyspark version 2.4.0-cdh6.3.1

2020-08-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17187449#comment-17187449 ] Hyukjin Kwon commented on SPARK-32724: -- Let's ask questions to the mailing list

<    1   2