[jira] [Resolved] (SPARK-35813) Add new adaptive config into sql-performance-tuning docs

2021-07-11 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-35813. - Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 32960

[jira] [Resolved] (SPARK-36071) Spark driver requires large memory space for serialized results even there are no data collected to the driver

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-36071. -- Resolution: Cannot Reproduce > Spark driver requires large memory space for serialized

[jira] [Commented] (SPARK-36071) Spark driver requires large memory space for serialized results even there are no data collected to the driver

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378935#comment-17378935 ] Hyukjin Kwon commented on SPARK-36071: -- [~vcshashank] can you show your codes? > Spark driver

[jira] [Resolved] (SPARK-36084) spark kafka offset missed some partition offset describle

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-36084. -- Resolution: Incomplete > spark kafka offset missed some partition offset describle >

[jira] [Updated] (SPARK-36071) Spark driver requires large memory space for serialized results even there are no data collected to the driver

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-36071: - Priority: Major (was: Critical) > Spark driver requires large memory space for serialized

[jira] [Commented] (SPARK-36084) spark kafka offset missed some partition offset describle

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378934#comment-17378934 ] Hyukjin Kwon commented on SPARK-36084: -- [~geekyouth] Spark 2.4 is EOL, and there will be no more

[jira] [Updated] (SPARK-36084) spark kafka offset missed some partition offset describle

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-36084: - Priority: Major (was: Critical) > spark kafka offset missed some partition offset describle >

[jira] [Assigned] (SPARK-36085) Make broadcast query stage executionContext isolation from AQE

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36085: Assignee: (was: Apache Spark) > Make broadcast query stage executionContext

[jira] [Commented] (SPARK-36085) Make broadcast query stage executionContext isolation from AQE

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378913#comment-17378913 ] Apache Spark commented on SPARK-36085: -- User 'ulysses-you' has created a pull request for this

[jira] [Assigned] (SPARK-36085) Make broadcast query stage executionContext isolation from AQE

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36085: Assignee: Apache Spark > Make broadcast query stage executionContext isolation from AQE

[jira] [Assigned] (SPARK-36076) [SQL] ArrayIndexOutOfBounds in CAST string to timestamp

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36076: Assignee: (was: Apache Spark) > [SQL] ArrayIndexOutOfBounds in CAST string to

[jira] [Assigned] (SPARK-36076) [SQL] ArrayIndexOutOfBounds in CAST string to timestamp

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36076: Assignee: Apache Spark > [SQL] ArrayIndexOutOfBounds in CAST string to timestamp >

[jira] [Commented] (SPARK-36076) [SQL] ArrayIndexOutOfBounds in CAST string to timestamp

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378911#comment-17378911 ] Apache Spark commented on SPARK-36076: -- User 'dgd-contributor' has created a pull request for this

[jira] [Created] (SPARK-36085) Make broadcast query stage executionContext isolation from AQE

2021-07-11 Thread XiDuo You (Jira)
XiDuo You created SPARK-36085: - Summary: Make broadcast query stage executionContext isolation from AQE Key: SPARK-36085 URL: https://issues.apache.org/jira/browse/SPARK-36085 Project: Spark

[jira] [Commented] (SPARK-35561) partition result is incorrect when insert into partition table with int datatype partition column

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378876#comment-17378876 ] Apache Spark commented on SPARK-35561: -- User 'dgd-contributor' has created a pull request for this

[jira] [Commented] (SPARK-36069) spark function from_json should output field name, field type and field value when FAILFAST mode throw exception

2021-07-11 Thread geekyouth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1737#comment-1737 ] geekyouth commented on SPARK-36069: --- I also want to merge this feature into version 2.4.3.   Now  my

[jira] [Commented] (SPARK-36069) spark function from_json should output field name, field type and field value when FAILFAST mode throw exception

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378882#comment-17378882 ] Apache Spark commented on SPARK-36069: -- User 'geekyouth' has created a pull request for this issue:

[jira] [Commented] (SPARK-36069) spark function from_json should output field name, field type and field value when FAILFAST mode throw exception

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378883#comment-17378883 ] Apache Spark commented on SPARK-36069: -- User 'geekyouth' has created a pull request for this issue:

[jira] [Updated] (SPARK-27396) SPIP: Public APIs for extended Columnar Processing Support

2021-07-11 Thread zengrui (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zengrui updated SPARK-27396: Description: *strong text**SPIP: Columnar Processing Without Arrow Formatting Guarantees.*   *Q1.* What

[jira] [Assigned] (SPARK-36069) spark function from_json should output field name, field type and field value when FAILFAST mode throw exception

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36069: Assignee: (was: Apache Spark) > spark function from_json should output field name,

[jira] [Assigned] (SPARK-36069) spark function from_json should output field name, field type and field value when FAILFAST mode throw exception

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36069: Assignee: Apache Spark > spark function from_json should output field name, field type

[jira] [Assigned] (SPARK-35561) partition result is incorrect when insert into partition table with int datatype partition column

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35561: Assignee: (was: Apache Spark) > partition result is incorrect when insert into

[jira] [Commented] (SPARK-35561) partition result is incorrect when insert into partition table with int datatype partition column

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378875#comment-17378875 ] Apache Spark commented on SPARK-35561: -- User 'dgd-contributor' has created a pull request for this

[jira] [Assigned] (SPARK-35561) partition result is incorrect when insert into partition table with int datatype partition column

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-35561: Assignee: Apache Spark > partition result is incorrect when insert into partition table

[jira] [Created] (SPARK-36084) spark kafka offset missed some partition offset describle

2021-07-11 Thread geekyouth (Jira)
geekyouth created SPARK-36084: - Summary: spark kafka offset missed some partition offset describle Key: SPARK-36084 URL: https://issues.apache.org/jira/browse/SPARK-36084 Project: Spark Issue

[jira] [Assigned] (SPARK-36064) Manage InternalField more in DataTypeOps.

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-36064: Assignee: Takuya Ueshin > Manage InternalField more in DataTypeOps. >

[jira] [Resolved] (SPARK-36064) Manage InternalField more in DataTypeOps.

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-36064. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33275

[jira] [Updated] (SPARK-36037) Support ANSI SQL LOCALTIMESTAMP datetime value function

2021-07-11 Thread jiaan.geng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jiaan.geng updated SPARK-36037: --- Summary: Support ANSI SQL LOCALTIMESTAMP datetime value function (was: Support new function

[jira] [Commented] (SPARK-35508) job group and description do not apply on broadcasts

2021-07-11 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378815#comment-17378815 ] Hyukjin Kwon commented on SPARK-35508: -- I think we should probably have a way to set multiple job

[jira] [Resolved] (SPARK-36083) make_timestamp: return different result based on the default timestamp type

2021-07-11 Thread Max Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Max Gekk resolved SPARK-36083. -- Fix Version/s: 3.2.0 Resolution: Fixed Issue resolved by pull request 33290

[jira] [Resolved] (SPARK-36036) Regression: Remote blocks stored on disk by BlockManager are not deleted

2021-07-11 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-36036. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 33251

[jira] [Assigned] (SPARK-36036) Regression: Remote blocks stored on disk by BlockManager are not deleted

2021-07-11 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-36036: Assignee: Denis Tarima > Regression: Remote blocks stored on disk by BlockManager are

[jira] [Updated] (SPARK-35743) Improve Parquet vectorized reader

2021-07-11 Thread Chao Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chao Sun updated SPARK-35743: - Labels: parquet (was: ) > Improve Parquet vectorized reader > - > >

[jira] [Commented] (SPARK-36083) make_timestamp: return different result based on the default timestamp type

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378605#comment-17378605 ] Apache Spark commented on SPARK-36083: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-36083) make_timestamp: return different result based on the default timestamp type

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378604#comment-17378604 ] Apache Spark commented on SPARK-36083: -- User 'gengliangwang' has created a pull request for this

[jira] [Assigned] (SPARK-36083) make_timestamp: return different result based on the default timestamp type

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36083: Assignee: Apache Spark (was: Gengliang Wang) > make_timestamp: return different result

[jira] [Assigned] (SPARK-36083) make_timestamp: return different result based on the default timestamp type

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36083: Assignee: Gengliang Wang (was: Apache Spark) > make_timestamp: return different result

[jira] [Commented] (SPARK-36046) Support new function make_timestamp_ntz

2021-07-11 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378603#comment-17378603 ] Gengliang Wang commented on SPARK-36046: I will work on this after

[jira] [Updated] (SPARK-36046) Support new functions make_timestamp_ntz and make_timestamp_ltz

2021-07-11 Thread Gengliang Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-36046: --- Summary: Support new functions make_timestamp_ntz and make_timestamp_ltz (was: Support new

[jira] [Created] (SPARK-36083) make_timestamp: return different result based on the default timestamp type

2021-07-11 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-36083: -- Summary: make_timestamp: return different result based on the default timestamp type Key: SPARK-36083 URL: https://issues.apache.org/jira/browse/SPARK-36083

[jira] [Commented] (SPARK-36082) when the right side is small enough to use SingleColumn Null Aware Anti Join

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17378594#comment-17378594 ] Apache Spark commented on SPARK-36082: -- User 'mcdull-zhang' has created a pull request for this

[jira] [Assigned] (SPARK-36082) when the right side is small enough to use SingleColumn Null Aware Anti Join

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36082: Assignee: Apache Spark > when the right side is small enough to use SingleColumn Null

[jira] [Assigned] (SPARK-36082) when the right side is small enough to use SingleColumn Null Aware Anti Join

2021-07-11 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-36082: Assignee: (was: Apache Spark) > when the right side is small enough to use

[jira] [Created] (SPARK-36082) when the right side is small enough to use SingleColumn Null Aware Anti Join

2021-07-11 Thread mcdull_zhang (Jira)
mcdull_zhang created SPARK-36082: Summary: when the right side is small enough to use SingleColumn Null Aware Anti Join Key: SPARK-36082 URL: https://issues.apache.org/jira/browse/SPARK-36082