[jira] [Created] (SPARK-36133) The catalog name keep consistent with the namespace naming rule

2021-07-13 Thread PengLei (Jira)
PengLei created SPARK-36133: --- Summary: The catalog name keep consistent with the namespace naming rule Key: SPARK-36133 URL: https://issues.apache.org/jira/browse/SPARK-36133 Project: Spark Issue

[jira] [Created] (SPARK-36132) Support initial state for flatMapGroupsWithState in batch mode

2021-07-13 Thread Rahul Shivu Mahadev (Jira)
Rahul Shivu Mahadev created SPARK-36132: --- Summary: Support initial state for flatMapGroupsWithState in batch mode Key: SPARK-36132 URL: https://issues.apache.org/jira/browse/SPARK-36132

[jira] [Assigned] (SPARK-35640) Refactor Parquet vectorized reader to remove duplicated code paths

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-35640: - Assignee: Chao Sun > Refactor Parquet vectorized reader to remove duplicated code

[jira] [Assigned] (SPARK-35743) Improve Parquet vectorized reader

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-35743: - Assignee: Chao Sun > Improve Parquet vectorized reader >

[jira] [Assigned] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-36123: - Assignee: Chao Sun > Parquet vectorized reader doesn't skip null values correctly >

[jira] [Resolved] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-36129. --- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 3

[jira] [Resolved] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-36131. --- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 4

[jira] [Assigned] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-36131: - Assignee: Chao Sun > Refactor ParquetColumnIndexSuite >

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380334#comment-17380334 ] Apache Spark commented on SPARK-36130: -- User 'cfmcgrady' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36130: Assignee: Apache Spark > UnwrapCastInBinaryComparison fail when in.list contain

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380333#comment-17380333 ] Apache Spark commented on SPARK-36130: -- User 'cfmcgrady' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36130: Assignee: (was: Apache Spark) > UnwrapCastInBinaryComparison fail when in.list

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380326#comment-17380326 ] Chao Sun commented on SPARK-36128: -- Thanks, I'm slightly inclined to reuse the existing config but

[jira] [Updated] (SPARK-36034) Incorrect datetime filter when reading Parquet files written in legacy mode

2021-07-13 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-36034: Priority: Blocker (was: Major) > Incorrect datetime filter when reading Parquet files written in legacy

[jira] [Updated] (SPARK-36034) Incorrect datetime filter when reading Parquet files written in legacy mode

2021-07-13 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-36034: Target Version/s: 3.2.0 > Incorrect datetime filter when reading Parquet files written in legacy mode >

[jira] [Assigned] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36131: Assignee: (was: Apache Spark) > Refactor ParquetColumnIndexSuite >

[jira] [Assigned] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36131: Assignee: Apache Spark > Refactor ParquetColumnIndexSuite >

[jira] [Commented] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380315#comment-17380315 ] Apache Spark commented on SPARK-36131: -- User 'sunchao' has created a pull request for this issue:

[jira] [Commented] (SPARK-35743) Improve Parquet vectorized reader

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380313#comment-17380313 ] Apache Spark commented on SPARK-35743: -- User 'sunchao' has created a pull request for this issue:

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380307#comment-17380307 ] Hyukjin Kwon commented on SPARK-36128: -- That's okay. I was just thinking that we might have to have

[jira] [Created] (SPARK-36131) Refactor ParquetColumnIndexSuite

2021-07-13 Thread Chao Sun (Jira)
Chao Sun created SPARK-36131: Summary: Refactor ParquetColumnIndexSuite Key: SPARK-36131 URL: https://issues.apache.org/jira/browse/SPARK-36131 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380299#comment-17380299 ] Chao Sun commented on SPARK-36128: -- [~hyukjin.kwon] you are right - I didn't know this config is

[jira] [Comment Edited] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380299#comment-17380299 ] Chao Sun edited comment on SPARK-36128 at 7/14/21, 4:24 AM: [~hyukjin.kwon]

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Fu Chen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380291#comment-17380291 ] Fu Chen commented on SPARK-36130: - Hi, [~hyukjin.kwon], Copy from

[jira] [Commented] (SPARK-36121) Write data loss caused by stage retry when enable v2 FileOutputCommitter

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380290#comment-17380290 ] Hyukjin Kwon commented on SPARK-36121: -- Can you see if this is fixed in Spark 3.1? SPARK-27194

[jira] [Commented] (SPARK-36121) Write data loss caused by stage retry when enable v2 FileOutputCommitter

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380289#comment-17380289 ] Hyukjin Kwon commented on SPARK-36121: -- did you enable speculation? > Write data loss caused by

[jira] [Commented] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380287#comment-17380287 ] Hyukjin Kwon commented on SPARK-36128: -- hm, isn't {{spark.sql.hive.metastorePartitionPruning}} only

[jira] [Commented] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380282#comment-17380282 ] Hyukjin Kwon commented on SPARK-36130: -- cc [~sunchao] FYI > UnwrapCastInBinaryComparison fail when

[jira] [Commented] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380273#comment-17380273 ] Apache Spark commented on SPARK-36129: -- User 'sarutak' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36129: Assignee: Apache Spark (was: Kousuke Saruta) > Upgrade commons-compress to 1.21 to deal

[jira] [Assigned] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36129: Assignee: Kousuke Saruta (was: Apache Spark) > Upgrade commons-compress to 1.21 to deal

[jira] [Commented] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380272#comment-17380272 ] Apache Spark commented on SPARK-36129: -- User 'sarutak' has created a pull request for this issue:

[jira] [Created] (SPARK-36130) UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression

2021-07-13 Thread Fu Chen (Jira)
Fu Chen created SPARK-36130: --- Summary: UnwrapCastInBinaryComparison fail when in.list contain CheckOverflow expression Key: SPARK-36130 URL: https://issues.apache.org/jira/browse/SPARK-36130 Project: Spark

[jira] [Created] (SPARK-36129) Upgrade commons-compress to 1.21 to deal with CVEs

2021-07-13 Thread Kousuke Saruta (Jira)
Kousuke Saruta created SPARK-36129: -- Summary: Upgrade commons-compress to 1.21 to deal with CVEs Key: SPARK-36129 URL: https://issues.apache.org/jira/browse/SPARK-36129 Project: Spark Issue

[jira] [Created] (SPARK-36128) CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning

2021-07-13 Thread Chao Sun (Jira)
Chao Sun created SPARK-36128: Summary: CatalogFileIndex.filterPartitions should respect spark.sql.hive.metastorePartitionPruning Key: SPARK-36128 URL: https://issues.apache.org/jira/browse/SPARK-36128

[jira] [Assigned] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36125: Assignee: Apache Spark > Implement non-equality comparison operators between two

[jira] [Assigned] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36125: Assignee: (was: Apache Spark) > Implement non-equality comparison operators between

[jira] [Commented] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380222#comment-17380222 ] Apache Spark commented on SPARK-36125: -- User 'xinrong-databricks' has created a pull request for

[jira] [Reopened] (SPARK-36127) Adjust non-equality comparison operators to accept scalar

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng reopened SPARK-36127: -- > Adjust non-equality comparison operators to accept scalar >

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators of Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Summary: Implement non-equality comparison operators of Categoricals (was: Implement

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Summary: Implement non-equality comparison operators between two Categoricals (was: Implement

[jira] [Resolved] (SPARK-36127) Adjust non-equality comparison operators to accept scalar

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng resolved SPARK-36127. -- Resolution: Duplicate > Adjust non-equality comparison operators to accept scalar >

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Description: Implement non-equality comparison operators between two Categoricals (was:

[jira] [Updated] (SPARK-36125) Implement non-equality comparison operators between two Categoricals

2021-07-13 Thread Xinrong Meng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xinrong Meng updated SPARK-36125: - Summary: Implement non-equality comparison operators between two Categoricals (was: Implement

[jira] [Created] (SPARK-36127) Adjust non-equality comparison operators to accept scalar

2021-07-13 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-36127: Summary: Adjust non-equality comparison operators to accept scalar Key: SPARK-36127 URL: https://issues.apache.org/jira/browse/SPARK-36127 Project: Spark

[jira] [Created] (SPARK-36126) Adjust equality comparison operators of Categorical to follow pandas

2021-07-13 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-36126: Summary: Adjust equality comparison operators of Categorical to follow pandas Key: SPARK-36126 URL: https://issues.apache.org/jira/browse/SPARK-36126 Project: Spark

[jira] [Created] (SPARK-36125) Implement non-equality comparison operators between two categories

2021-07-13 Thread Xinrong Meng (Jira)
Xinrong Meng created SPARK-36125: Summary: Implement non-equality comparison operators between two categories Key: SPARK-36125 URL: https://issues.apache.org/jira/browse/SPARK-36125 Project: Spark

[jira] [Commented] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380214#comment-17380214 ] Apache Spark commented on SPARK-36123: -- User 'sunchao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36123: Assignee: Apache Spark > Parquet vectorized reader doesn't skip null values correctly >

[jira] [Commented] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380213#comment-17380213 ] Apache Spark commented on SPARK-36123: -- User 'sunchao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36123: Assignee: (was: Apache Spark) > Parquet vectorized reader doesn't skip null values

[jira] [Updated] (SPARK-36124) Support set operators to be on correlation paths

2021-07-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-36124: - Description: A correlation path is defined as the sub-tree of all the operators that are on

[jira] [Commented] (SPARK-35917) Disable push-based shuffle until the feature is complete

2021-07-13 Thread shubhangi priya (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380189#comment-17380189 ] shubhangi priya commented on SPARK-35917: - How user 'otterc' creates a pull request for the

[jira] [Assigned] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-28266: Assignee: Apache Spark > data duplication when `path` serde property is present >

[jira] [Assigned] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-28266: Assignee: (was: Apache Spark) > data duplication when `path` serde property is

[jira] [Updated] (SPARK-36124) Support set operators to be on correlation paths

2021-07-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-36124: - Summary: Support set operators to be on correlation paths (was: Support set operators to be on

[jira] [Updated] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-36109: -- Fix Version/s: 3.0.4 3.1.3 > Fix flaky KafkaSourceStressSuite >

[jira] [Updated] (SPARK-35553) Improve correlated subqueries

2021-07-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-35553: - Summary: Improve correlated subqueries (was: Improve correlated subquery) > Improve correlated

[jira] [Comment Edited] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Shardul Mahadik (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380170#comment-17380170 ] Shardul Mahadik edited comment on SPARK-28266 at 7/13/21, 9:27 PM: --- I

[jira] [Comment Edited] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Shardul Mahadik (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380170#comment-17380170 ] Shardul Mahadik edited comment on SPARK-28266 at 7/13/21, 9:27 PM: --- I

[jira] [Commented] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Shardul Mahadik (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380170#comment-17380170 ] Shardul Mahadik commented on SPARK-28266: - I would like to propose another angle to look at the

[jira] [Reopened] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Krogen reopened SPARK-28266: - Re-opening this issue based on [~shardulm]'s example above demonstrating that this is indeed a

[jira] [Updated] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun updated SPARK-36123: - Labels: correctness (was: ) > Parquet vectorized reader doesn't skip null values correctly >

[jira] [Created] (SPARK-36124) Support set operators to be on a correlation path

2021-07-13 Thread Allison Wang (Jira)
Allison Wang created SPARK-36124: Summary: Support set operators to be on a correlation path Key: SPARK-36124 URL: https://issues.apache.org/jira/browse/SPARK-36124 Project: Spark Issue

[jira] [Created] (SPARK-36123) Parquet vectorized reader doesn't skip null values correctly

2021-07-13 Thread Chao Sun (Jira)
Chao Sun created SPARK-36123: Summary: Parquet vectorized reader doesn't skip null values correctly Key: SPARK-36123 URL: https://issues.apache.org/jira/browse/SPARK-36123 Project: Spark Issue

[jira] [Commented] (SPARK-35917) Disable push-based shuffle until the feature is complete

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380099#comment-17380099 ] Apache Spark commented on SPARK-35917: -- User 'otterc' has created a pull request for this issue:

[jira] [Commented] (SPARK-28266) data duplication when `path` serde property is present

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380093#comment-17380093 ] Apache Spark commented on SPARK-28266: -- User 'shardulm94' has created a pull request for this

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380083#comment-17380083 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380081#comment-17380081 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380079#comment-17380079 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-36109) Fix flaky KafkaSourceStressSuite

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380080#comment-17380080 ] Apache Spark commented on SPARK-36109: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-36065) date_trunc returns incorrect output

2021-07-13 Thread Sumeet (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380050#comment-17380050 ] Sumeet commented on SPARK-36065: cc [~maxgekk] > date_trunc returns incorrect output >

[jira] [Updated] (SPARK-36065) date_trunc returns incorrect output

2021-07-13 Thread Sumeet (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sumeet updated SPARK-36065: --- Affects Version/s: 3.2.0 > date_trunc returns incorrect output > --- > >

[jira] [Updated] (SPARK-36108) Add error classes to QueryParsingErrors

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36108: --- Description: Add error classes to

[jira] [Updated] (SPARK-36107) Add error classes to QueryExecutionErrors

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36107: --- Description: Add error classes to

[jira] [Updated] (SPARK-36106) Add error classes to QueryCompilationErrors

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36106: --- Description: Add error classes to

[jira] [Updated] (SPARK-36094) Group SQL component error messages in Spark error class JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Description: To improve auditing, reduce duplication, and improve quality of error messages thrown

[jira] [Updated] (SPARK-36094) Group SQL component error messages in Spark error class JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Summary: Group SQL component error messages in Spark error class JSON file (was: Group error

[jira] [Updated] (SPARK-36094) Group error messages in JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Description: To improve auditing, reduce duplication, and improve quality of error messages thrown

[jira] [Updated] (SPARK-36094) Group error messages in JSON file

2021-07-13 Thread Karen Feng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karen Feng updated SPARK-36094: --- Description: To improve auditing, reduce duplication, and improve quality of error messages thrown

[jira] [Resolved] (SPARK-34891) Introduce state store manager for session window in streaming query

2021-07-13 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh resolved SPARK-34891. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 31989

[jira] [Assigned] (SPARK-34891) Introduce state store manager for session window in streaming query

2021-07-13 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-34891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] L. C. Hsieh reassigned SPARK-34891: --- Assignee: Jungtaek Lim > Introduce state store manager for session window in streaming

[jira] [Updated] (SPARK-35739) [Spark Sql] Add Java-comptable Dataset.join overloads

2021-07-13 Thread Brandon Dahler (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brandon Dahler updated SPARK-35739: --- Description: h2. Problem When using Spark SQL with Java, the required syntax to utilize

[jira] [Commented] (SPARK-35957) Cannot convert Avro schema to catalyst type because schema at path is not compatible

2021-07-13 Thread Erik Krogen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17379978#comment-17379978 ] Erik Krogen commented on SPARK-35957: - [~jkdll] would it be possible for you to try against the

[jira] [Commented] (SPARK-36076) [SQL] ArrayIndexOutOfBounds in CAST string to timestamp

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17379954#comment-17379954 ] Apache Spark commented on SPARK-36076: -- User 'dgd-contributor' has created a pull request for this

[jira] [Created] (SPARK-36122) Spark does not passon needClientAuth to Jetty SSLContextFactory. Does not allow to configure mTLS authentication.

2021-07-13 Thread Seetharama Khandrika (Jira)
Seetharama Khandrika created SPARK-36122: Summary: Spark does not passon needClientAuth to Jetty SSLContextFactory. Does not allow to configure mTLS authentication. Key: SPARK-36122 URL:

[jira] [Resolved] (SPARK-36120) Support TimestampNTZ type in cache table

2021-07-13 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-36120. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33322

[jira] [Assigned] (SPARK-36076) [SQL] ArrayIndexOutOfBounds in CAST string to timestamp

2021-07-13 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang reassigned SPARK-36076: -- Assignee: dgd_contributor > [SQL] ArrayIndexOutOfBounds in CAST string to timestamp

[jira] [Assigned] (SPARK-36093) The result incorrect if the partition path case is inconsistent

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36093: Assignee: Apache Spark > The result incorrect if the partition path case is inconsistent

[jira] [Commented] (SPARK-36093) The result incorrect if the partition path case is inconsistent

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17379905#comment-17379905 ] Apache Spark commented on SPARK-36093: -- User 'AngersZh' has created a pull request for this

[jira] [Assigned] (SPARK-36093) The result incorrect if the partition path case is inconsistent

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36093: Assignee: (was: Apache Spark) > The result incorrect if the partition path case is

[jira] [Assigned] (SPARK-36093) The result incorrect if the partition path case is inconsistent

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36093: Assignee: Apache Spark > The result incorrect if the partition path case is inconsistent

[jira] [Assigned] (SPARK-35739) [Spark Sql] Add Java-comptable Dataset.join overloads

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35739: Assignee: (was: Apache Spark) > [Spark Sql] Add Java-comptable Dataset.join

[jira] [Assigned] (SPARK-35739) [Spark Sql] Add Java-comptable Dataset.join overloads

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35739: Assignee: Apache Spark > [Spark Sql] Add Java-comptable Dataset.join overloads >

[jira] [Commented] (SPARK-35739) [Spark Sql] Add Java-comptable Dataset.join overloads

2021-07-13 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17379898#comment-17379898 ] Apache Spark commented on SPARK-35739: -- User 'brandondahler' has created a pull request for this

[jira] [Updated] (SPARK-36121) Write data loss caused by stage retry when enable v2 FileOutputCommitter

2021-07-13 Thread gaoyajun02 (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] gaoyajun02 updated SPARK-36121: --- Description: All our ETL scenarios are configured:

[jira] [Updated] (SPARK-36121) Write data loss caused by stage retry when enable v2 FileOutputCommitter

2021-07-13 Thread gaoyajun02 (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] gaoyajun02 updated SPARK-36121: --- Description: All our ETL scenarios are configured:

[jira] [Resolved] (SPARK-36033) Validate partitioning requirements in TPCDS tests

2021-07-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-36033. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33248

[jira] [Assigned] (SPARK-36033) Validate partitioning requirements in TPCDS tests

2021-07-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-36033: --- Assignee: Wenchen Fan > Validate partitioning requirements in TPCDS tests >

[jira] [Resolved] (SPARK-36074) add error class for StructType.findNestedField

2021-07-13 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-36074. - Fix Version/s: 3.2.0 Resolution: Fixed > add error class for StructType.findNestedField

  1   2   >