[jira] [Commented] (SPARK-42346) distinct(count colname) with UNION ALL causes query analyzer bug

2023-02-06 Thread Peter Toth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685124#comment-17685124 ] Peter Toth commented on SPARK-42346: [~ritikam], please use the Pyspark repro in description or add

[jira] [Updated] (SPARK-42017) df["bad_key"] does not raise AnalysisException

2023-02-06 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin updated SPARK-42017: -- Summary: df["bad_key"] does not raise AnalysisException (was: Different error type

[jira] [Assigned] (SPARK-42368) Ignore SparkRemoteFileTest K8s IT test case in GitHub Action

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-42368: - Assignee: Dongjoon Hyun > Ignore SparkRemoteFileTest K8s IT test case in GitHub Action

[jira] [Resolved] (SPARK-42368) Ignore SparkRemoteFileTest K8s IT test case in GitHub Action

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-42368. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39921

[jira] [Commented] (SPARK-41708) Pull v1write information to WriteFiles

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685103#comment-17685103 ] Apache Spark commented on SPARK-41708: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-39851) Improve join stats estimation if one side can keep uniqueness

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-39851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685104#comment-17685104 ] Apache Spark commented on SPARK-39851: -- User 'wankunde' has created a pull request for this issue:

[jira] [Commented] (SPARK-41708) Pull v1write information to WriteFiles

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685102#comment-17685102 ] Apache Spark commented on SPARK-41708: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-42368) Ignore SparkRemoteFileTest K8s IT test case in GitHub Action

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42368: Assignee: Apache Spark > Ignore SparkRemoteFileTest K8s IT test case in GitHub Action >

[jira] [Commented] (SPARK-42368) Ignore SparkRemoteFileTest K8s IT test case in GitHub Action

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685101#comment-17685101 ] Apache Spark commented on SPARK-42368: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-42368) Ignore SparkRemoteFileTest K8s IT test case in GitHub Action

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42368: Assignee: (was: Apache Spark) > Ignore SparkRemoteFileTest K8s IT test case in

[jira] [Resolved] (SPARK-41962) Update the import order of scala package in class SpecificParquetRecordReaderBase

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-41962. -- Fix Version/s: 3.2.4 3.3.2 Resolution: Fixed Issue resolved by pull

[jira] [Assigned] (SPARK-41962) Update the import order of scala package in class SpecificParquetRecordReaderBase

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-41962: Assignee: shuyouZZ > Update the import order of scala package in class >

[jira] [Resolved] (SPARK-42306) Assign name to _LEGACY_ERROR_TEMP_1317

2023-02-06 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-42306. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39877

[jira] [Assigned] (SPARK-42306) Assign name to _LEGACY_ERROR_TEMP_1317

2023-02-06 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk reassigned SPARK-42306: Assignee: Haejoon Lee > Assign name to _LEGACY_ERROR_TEMP_1317 >

[jira] [Updated] (SPARK-42352) Upgrade maven to 3.8.7

2023-02-06 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-42352: - Description: [https://maven.apache.org/docs/3.8.7/release-notes.html]   was:

[jira] [Updated] (SPARK-42352) Upgrade maven to 3.8.7

2023-02-06 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-42352: - Summary: Upgrade maven to 3.8.7 (was: Upgrade maven to 3.9.0) > Upgrade maven to 3.8.7 >

[jira] [Created] (SPARK-42368) Ignore SparkRemoteFileTest K8s IT test case in GitHub Action

2023-02-06 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-42368: - Summary: Ignore SparkRemoteFileTest K8s IT test case in GitHub Action Key: SPARK-42368 URL: https://issues.apache.org/jira/browse/SPARK-42368 Project: Spark

[jira] [Resolved] (SPARK-41612) Support Catalog.isCached

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-41612. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39919

[jira] [Assigned] (SPARK-41600) Support Catalog.cacheTable

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-41600: Assignee: Hyukjin Kwon > Support Catalog.cacheTable > -- > >

[jira] [Assigned] (SPARK-41623) Support Catalog.uncacheTable

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-41623: Assignee: Hyukjin Kwon > Support Catalog.uncacheTable > > >

[jira] [Assigned] (SPARK-41612) Support Catalog.isCached

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-41612: Assignee: Hyukjin Kwon > Support Catalog.isCached > > >

[jira] [Resolved] (SPARK-41623) Support Catalog.uncacheTable

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-41623. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39919

[jira] [Resolved] (SPARK-41600) Support Catalog.cacheTable

2023-02-06 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-41600. -- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39919

[jira] [Created] (SPARK-42367) DataFrame.drop could handle duplicated columns

2023-02-06 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-42367: - Summary: DataFrame.drop could handle duplicated columns Key: SPARK-42367 URL: https://issues.apache.org/jira/browse/SPARK-42367 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-42364) Split 'pyspark.pandas.tests.test_dataframe'

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-42364. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39915

[jira] [Assigned] (SPARK-42364) Split 'pyspark.pandas.tests.test_dataframe'

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-42364: - Assignee: Ruifeng Zheng > Split 'pyspark.pandas.tests.test_dataframe' >

[jira] [Assigned] (SPARK-42363) Remove session.register_udf

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-42363: - Assignee: Hyukjin Kwon > Remove session.register_udf > --- > >

[jira] [Resolved] (SPARK-42363) Remove session.register_udf

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-42363. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39916

[jira] [Resolved] (SPARK-42038) SPJ: Support partially clustered distribution

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-42038. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39633

[jira] [Assigned] (SPARK-42038) SPJ: Support partially clustered distribution

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-42038: - Assignee: Chao Sun > SPJ: Support partially clustered distribution >

[jira] [Updated] (SPARK-42335) Pass the comment option through to univocity if users set it explicitly in CSV dataSource

2023-02-06 Thread Wei Guo (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei Guo updated SPARK-42335: Description: In PR [https://github.com/apache/spark/pull/29516], in order to fix some bugs,

[jira] [Updated] (SPARK-42335) Pass the comment option through to univocity if users set it explicitly in CSV dataSource

2023-02-06 Thread Wei Guo (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei Guo updated SPARK-42335: Description: In PR [https://github.com/apache/spark/pull/29516], in order to fix some bugs,

[jira] [Updated] (SPARK-42335) Pass the comment option through to univocity if users set it explicitly in CSV dataSource

2023-02-06 Thread Wei Guo (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei Guo updated SPARK-42335: Description: In PR [https://github.com/apache/spark/pull/29516], in order to fix some bugs,

[jira] [Updated] (SPARK-42335) Pass the comment option through to univocity if users set it explicitly in CSV dataSource

2023-02-06 Thread Wei Guo (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wei Guo updated SPARK-42335: Description: In PR [https://github.com/apache/spark/pull/29516], in order to fix some bugs,

[jira] [Resolved] (SPARK-42354) Upgrade Jackson to 2.14.2

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-42354. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39898

[jira] [Assigned] (SPARK-42354) Upgrade Jackson to 2.14.2

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-42354: - Assignee: Yang Jie > Upgrade Jackson to 2.14.2 > - > >

[jira] [Assigned] (SPARK-41716) Factor pyspark.sql.connect.Catalog._catalog_to_pandas to client.py

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41716: Assignee: (was: Apache Spark) > Factor

[jira] [Assigned] (SPARK-41716) Factor pyspark.sql.connect.Catalog._catalog_to_pandas to client.py

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41716: Assignee: Apache Spark > Factor pyspark.sql.connect.Catalog._catalog_to_pandas to

[jira] [Commented] (SPARK-41716) Factor pyspark.sql.connect.Catalog._catalog_to_pandas to client.py

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685048#comment-17685048 ] Apache Spark commented on SPARK-41716: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-42362) Upgrade kubernetes-client from 6.4.0 to 6.4.1

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-42362: - Assignee: Bjørn Jørgensen > Upgrade kubernetes-client from 6.4.0 to 6.4.1 >

[jira] [Resolved] (SPARK-42362) Upgrade kubernetes-client from 6.4.0 to 6.4.1

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-42362. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39912

[jira] [Commented] (SPARK-41612) Support Catalog.isCached

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685032#comment-17685032 ] Apache Spark commented on SPARK-41612: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-41623) Support Catalog.uncacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41623: Assignee: (was: Apache Spark) > Support Catalog.uncacheTable >

[jira] [Assigned] (SPARK-41612) Support Catalog.isCached

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41612: Assignee: (was: Apache Spark) > Support Catalog.isCached >

[jira] [Commented] (SPARK-41612) Support Catalog.isCached

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685030#comment-17685030 ] Apache Spark commented on SPARK-41612: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-41623) Support Catalog.uncacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685028#comment-17685028 ] Apache Spark commented on SPARK-41623: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-41612) Support Catalog.isCached

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685031#comment-17685031 ] Apache Spark commented on SPARK-41612: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-41612) Support Catalog.isCached

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41612: Assignee: Apache Spark > Support Catalog.isCached > > >

[jira] [Assigned] (SPARK-41623) Support Catalog.uncacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41623: Assignee: Apache Spark > Support Catalog.uncacheTable > > >

[jira] [Commented] (SPARK-41623) Support Catalog.uncacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685027#comment-17685027 ] Apache Spark commented on SPARK-41623: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-41600) Support Catalog.cacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685024#comment-17685024 ] Apache Spark commented on SPARK-41600: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-41600) Support Catalog.cacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41600: Assignee: Apache Spark > Support Catalog.cacheTable > -- > >

[jira] [Commented] (SPARK-42366) Log shuffle data corruption diagnose cause

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685025#comment-17685025 ] Apache Spark commented on SPARK-42366: -- User 'cxzl25' has created a pull request for this issue:

[jira] [Assigned] (SPARK-42366) Log shuffle data corruption diagnose cause

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42366: Assignee: (was: Apache Spark) > Log shuffle data corruption diagnose cause >

[jira] [Assigned] (SPARK-42366) Log shuffle data corruption diagnose cause

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42366: Assignee: Apache Spark > Log shuffle data corruption diagnose cause >

[jira] [Commented] (SPARK-41600) Support Catalog.cacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685026#comment-17685026 ] Apache Spark commented on SPARK-41600: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-41600) Support Catalog.cacheTable

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-41600: Assignee: (was: Apache Spark) > Support Catalog.cacheTable >

[jira] [Updated] (SPARK-42366) Log shuffle data corruption diagnose cause

2023-02-06 Thread dzcxzl (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dzcxzl updated SPARK-42366: --- Summary: Log shuffle data corruption diagnose cause (was: Log output shuffle data corruption diagnose

[jira] [Updated] (SPARK-42366) Log output shuffle data corruption diagnose cause

2023-02-06 Thread dzcxzl (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dzcxzl updated SPARK-42366: --- Summary: Log output shuffle data corruption diagnose cause (was: Log output shuffle data corruption

[jira] [Created] (SPARK-42366) Log output shuffle data corruption diagnose causes

2023-02-06 Thread dzcxzl (Jira)
dzcxzl created SPARK-42366: -- Summary: Log output shuffle data corruption diagnose causes Key: SPARK-42366 URL: https://issues.apache.org/jira/browse/SPARK-42366 Project: Spark Issue Type:

[jira] [Updated] (SPARK-42352) Upgrade maven to 3.9.0

2023-02-06 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-42352: - Description: [https://maven.apache.org/docs/3.8.7/release-notes.html]   change to upgrade 3.9.0   

[jira] [Updated] (SPARK-42352) Upgrade maven to 3.9.0

2023-02-06 Thread Yang Jie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yang Jie updated SPARK-42352: - Summary: Upgrade maven to 3.9.0 (was: Upgrade maven to 3.8.7) > Upgrade maven to 3.9.0 >

[jira] [Assigned] (SPARK-42365) Split 'pyspark.pandas.tests.test_ops_on_diff_frames'

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42365: Assignee: Apache Spark > Split 'pyspark.pandas.tests.test_ops_on_diff_frames' >

[jira] [Commented] (SPARK-42365) Split 'pyspark.pandas.tests.test_ops_on_diff_frames'

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685016#comment-17685016 ] Apache Spark commented on SPARK-42365: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-42365) Split 'pyspark.pandas.tests.test_ops_on_diff_frames'

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42365: Assignee: (was: Apache Spark) > Split 'pyspark.pandas.tests.test_ops_on_diff_frames'

[jira] [Commented] (SPARK-42365) Split 'pyspark.pandas.tests.test_ops_on_diff_frames'

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685015#comment-17685015 ] Apache Spark commented on SPARK-42365: -- User 'zhengruifeng' has created a pull request for this

[jira] [Created] (SPARK-42365) Split 'pyspark.pandas.tests.test_ops_on_diff_frames'

2023-02-06 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-42365: - Summary: Split 'pyspark.pandas.tests.test_ops_on_diff_frames' Key: SPARK-42365 URL: https://issues.apache.org/jira/browse/SPARK-42365 Project: Spark Issue

[jira] [Assigned] (SPARK-42363) Remove session.register_udf

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42363: Assignee: (was: Apache Spark) > Remove session.register_udf >

[jira] [Assigned] (SPARK-40532) Python version for UDF should follow the servers version

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40532: Assignee: (was: Apache Spark) > Python version for UDF should follow the servers

[jira] [Commented] (SPARK-42363) Remove session.register_udf

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685005#comment-17685005 ] Apache Spark commented on SPARK-42363: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-42363) Remove session.register_udf

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42363: Assignee: Apache Spark > Remove session.register_udf > --- > >

[jira] [Commented] (SPARK-40532) Python version for UDF should follow the servers version

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685004#comment-17685004 ] Apache Spark commented on SPARK-40532: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-40532) Python version for UDF should follow the servers version

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-40532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-40532: Assignee: Apache Spark > Python version for UDF should follow the servers version >

[jira] [Assigned] (SPARK-42364) Split 'pyspark.pandas.tests.test_dataframe'

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42364: Assignee: Apache Spark > Split 'pyspark.pandas.tests.test_dataframe' >

[jira] [Commented] (SPARK-42364) Split 'pyspark.pandas.tests.test_dataframe'

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685003#comment-17685003 ] Apache Spark commented on SPARK-42364: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-42364) Split 'pyspark.pandas.tests.test_dataframe'

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42364: Assignee: (was: Apache Spark) > Split 'pyspark.pandas.tests.test_dataframe' >

[jira] [Created] (SPARK-42364) Split 'pyspark.pandas.tests.test_dataframe'

2023-02-06 Thread Ruifeng Zheng (Jira)
Ruifeng Zheng created SPARK-42364: - Summary: Split 'pyspark.pandas.tests.test_dataframe' Key: SPARK-42364 URL: https://issues.apache.org/jira/browse/SPARK-42364 Project: Spark Issue Type:

[jira] [Created] (SPARK-42363) Remove session.register_udf

2023-02-06 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-42363: Summary: Remove session.register_udf Key: SPARK-42363 URL: https://issues.apache.org/jira/browse/SPARK-42363 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-42346) distinct(count colname) with UNION ALL causes query analyzer bug

2023-02-06 Thread Ritika Maheshwari (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684992#comment-17684992 ] Ritika Maheshwari commented on SPARK-42346: --- I have Spark 3.3.0 and I do not have 39887 fix .

[jira] [Commented] (SPARK-42268) Add UserDefinedType in protos

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684990#comment-17684990 ] Apache Spark commented on SPARK-42268: -- User 'zhengruifeng' has created a pull request for this

[jira] [Commented] (SPARK-42268) Add UserDefinedType in protos

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684989#comment-17684989 ] Apache Spark commented on SPARK-42268: -- User 'zhengruifeng' has created a pull request for this

[jira] [Commented] (SPARK-42362) Upgrade kubernetes-client from 6.4.0 to 6.4.1

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684952#comment-17684952 ] Apache Spark commented on SPARK-42362: -- User 'bjornjorgensen' has created a pull request for this

[jira] [Assigned] (SPARK-42362) Upgrade kubernetes-client from 6.4.0 to 6.4.1

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42362: Assignee: (was: Apache Spark) > Upgrade kubernetes-client from 6.4.0 to 6.4.1 >

[jira] [Assigned] (SPARK-42362) Upgrade kubernetes-client from 6.4.0 to 6.4.1

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42362: Assignee: Apache Spark > Upgrade kubernetes-client from 6.4.0 to 6.4.1 >

[jira] [Created] (SPARK-42362) Upgrade kubernetes-client from 6.4.0 to 6.4.1

2023-02-06 Thread Jira
Bjørn Jørgensen created SPARK-42362: --- Summary: Upgrade kubernetes-client from 6.4.0 to 6.4.1 Key: SPARK-42362 URL: https://issues.apache.org/jira/browse/SPARK-42362 Project: Spark Issue

[jira] [Created] (SPARK-42361) Add an option to use external storage to distribute JAR set in cluster mode on Kube

2023-02-06 Thread Holden Karau (Jira)
Holden Karau created SPARK-42361: Summary: Add an option to use external storage to distribute JAR set in cluster mode on Kube Key: SPARK-42361 URL: https://issues.apache.org/jira/browse/SPARK-42361

[jira] [Commented] (SPARK-36478) Removes outer join if all grouping and aggregate expressions are from the streamed side

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684906#comment-17684906 ] Apache Spark commented on SPARK-36478: -- User 'clubycoder' has created a pull request for this

[jira] [Comment Edited] (SPARK-41793) Incorrect result for window frames defined by a range clause on large decimals

2023-02-06 Thread Gera Shegalov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684903#comment-17684903 ] Gera Shegalov edited comment on SPARK-41793 at 2/6/23 7:38 PM: --- if the

[jira] [Commented] (SPARK-41793) Incorrect result for window frames defined by a range clause on large decimals

2023-02-06 Thread Gera Shegalov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684903#comment-17684903 ] Gera Shegalov commented on SPARK-41793: --- if the consensus is that it's not a correctness bug in

[jira] [Commented] (SPARK-24942) Improve cluster resource management with jobs containing barrier stage

2023-02-06 Thread manpreet singh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684895#comment-17684895 ] manpreet singh commented on SPARK-24942: [~gurwls223]  Any updates on this?  It seems like we

[jira] [Assigned] (SPARK-42357) Log `exitCode` when `SparkContext.stop` starts

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-42357: - Assignee: Dongjoon Hyun > Log `exitCode` when `SparkContext.stop` starts >

[jira] [Resolved] (SPARK-42357) Log `exitCode` when `SparkContext.stop` starts

2023-02-06 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-42357. --- Fix Version/s: 3.4.0 Resolution: Fixed Issue resolved by pull request 39900

[jira] [Commented] (SPARK-42337) Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684856#comment-17684856 ] Apache Spark commented on SPARK-42337: -- User 'allisonwang-db' has created a pull request for this

[jira] [Assigned] (SPARK-42337) Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42337: Assignee: Apache Spark > Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT >

[jira] [Assigned] (SPARK-42337) Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-42337: Assignee: (was: Apache Spark) > Add error class

[jira] [Commented] (SPARK-42337) Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684855#comment-17684855 ] Apache Spark commented on SPARK-42337: -- User 'allisonwang-db' has created a pull request for this

[jira] [Commented] (SPARK-42287) Optimize the packaging strategy of connect client module

2023-02-06 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684849#comment-17684849 ] Apache Spark commented on SPARK-42287: -- User 'zhenlineo' has created a pull request for this issue:

[jira] [Updated] (SPARK-42337) Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT

2023-02-06 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-42337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-42337: - Summary: Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT (was: Add the new error

[jira] [Updated] (SPARK-41470) SPJ: Spark shouldn't assume InternalRow implements equals and hashCode

2023-02-06 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun updated SPARK-41470: - Fix Version/s: 3.4.0 (was: 3.5.0) > SPJ: Spark shouldn't assume InternalRow

[jira] [Assigned] (SPARK-41470) SPJ: Spark shouldn't assume InternalRow implements equals and hashCode

2023-02-06 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-41470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun reassigned SPARK-41470: Assignee: Mars > SPJ: Spark shouldn't assume InternalRow implements equals and hashCode >

  1   2   >