[jira] [Created] (SPARK-35786) Support adaptive repartition

2021-06-16 Thread XiDuo You (Jira)
XiDuo You created SPARK-35786: - Summary: Support adaptive repartition Key: SPARK-35786 URL: https://issues.apache.org/jira/browse/SPARK-35786 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-35786) Support optimize repartition by expression in AQE

2021-06-16 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35786: -- Summary: Support optimize repartition by expression in AQE (was: Support adaptive repartition by

[jira] [Updated] (SPARK-35786) Support adaptive repartition with repartition by expression

2021-06-16 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35786: -- Summary: Support adaptive repartition with repartition by expression (was: Support adaptive

[jira] [Updated] (SPARK-35786) Support adaptive repartition by expression

2021-06-16 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35786: -- Summary: Support adaptive repartition by expression (was: Support adaptive repartition with

[jira] [Created] (SPARK-35687) PythonUDFSuite move assume into its methods

2021-06-08 Thread XiDuo You (Jira)
XiDuo You created SPARK-35687: - Summary: PythonUDFSuite move assume into its methods Key: SPARK-35687 URL: https://issues.apache.org/jira/browse/SPARK-35687 Project: Spark Issue Type:

[jira] [Created] (SPARK-35675) EnsureRequirements remove shuffle should respect PartitioningCollection

2021-06-08 Thread XiDuo You (Jira)
XiDuo You created SPARK-35675: - Summary: EnsureRequirements remove shuffle should respect PartitioningCollection Key: SPARK-35675 URL: https://issues.apache.org/jira/browse/SPARK-35675 Project: Spark

[jira] [Commented] (SPARK-33832) Add an option in AQE to mitigate skew even if it causes an new shuffle

2021-06-08 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-33832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17359190#comment-17359190 ] XiDuo You commented on SPARK-33832: --- hi [~ekoifman] I created a PR for this issue, do you mind taking

[jira] [Updated] (SPARK-35675) EnsureRequirements remove shuffle should respect PartitioningCollection

2021-06-08 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35675: -- Description: Currently EnsureRequirements only check if child has semantic equal HashPartitioning

[jira] [Created] (SPARK-35813) Add new adaptive config into sql-performance-tuning docs

2021-06-17 Thread XiDuo You (Jira)
XiDuo You created SPARK-35813: - Summary: Add new adaptive config into sql-performance-tuning docs Key: SPARK-35813 URL: https://issues.apache.org/jira/browse/SPARK-35813 Project: Spark Issue

[jira] [Created] (SPARK-35853) Remark the shuffle origin to ENSURE_REQUIREMENTS as far as possible

2021-06-22 Thread XiDuo You (Jira)
XiDuo You created SPARK-35853: - Summary: Remark the shuffle origin to ENSURE_REQUIREMENTS as far as possible Key: SPARK-35853 URL: https://issues.apache.org/jira/browse/SPARK-35853 Project: Spark

[jira] [Updated] (SPARK-35853) Remark the shuffle origin to ENSURE_REQUIREMENTS as far as possible

2021-06-22 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35853: -- Description: In some queries, we might repartition by some columns with a large partition number

[jira] [Created] (SPARK-35888) Add dataSize field in CoalescedPartitionSpec

2021-06-24 Thread XiDuo You (Jira)
XiDuo You created SPARK-35888: - Summary: Add dataSize field in CoalescedPartitionSpec Key: SPARK-35888 URL: https://issues.apache.org/jira/browse/SPARK-35888 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-35888) Add dataSize field in CoalescedPartitionSpec

2021-06-24 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35888: -- Description: Currently, all test suite about `CoalescedPartitionSpec` do not check the data size due

[jira] [Updated] (SPARK-35888) Add dataSize field in CoalescedPartitionSpec

2021-06-25 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35888: -- Description: Currently, all test suite about `CoalescedPartitionSpec` do not check the data size due

[jira] [Updated] (SPARK-35786) Support optimize repartition by expression in AQE

2021-06-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35786: -- Description: Add a new hint to distingush if we can optimize it safely. was:Currently, we only

[jira] [Updated] (SPARK-35786) Add a new operator to distingush if AQE can optimize safely

2021-06-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35786: -- Summary: Add a new operator to distingush if AQE can optimize safely (was: Add a new hint to

[jira] [Updated] (SPARK-35786) Add a new hint to distingush if AQE can optimize safely

2021-06-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35786: -- Summary: Add a new hint to distingush if AQE can optimize safely (was: Support optimize repartition

[jira] [Created] (SPARK-35725) Support repartition expand partitions

2021-06-11 Thread XiDuo You (Jira)
XiDuo You created SPARK-35725: - Summary: Support repartition expand partitions Key: SPARK-35725 URL: https://issues.apache.org/jira/browse/SPARK-35725 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-35725) Support repartition expand partitions

2021-06-11 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35725: -- Parent: SPARK-33828 Issue Type: Sub-task (was: Improvement) > Support repartition expand

[jira] [Updated] (SPARK-35725) Support repartition expand partitions in AQE

2021-06-11 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35725: -- Summary: Support repartition expand partitions in AQE (was: Support repartition expand partitions)

[jira] [Created] (SPARK-35376) Fallback config should override defaultValue

2021-05-11 Thread XiDuo You (Jira)
XiDuo You created SPARK-35376: - Summary: Fallback config should override defaultValue Key: SPARK-35376 URL: https://issues.apache.org/jira/browse/SPARK-35376 Project: Spark Issue Type:

[jira] [Commented] (SPARK-35332) Not Coalesce shuffle partitions when cache table

2021-05-14 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17344501#comment-17344501 ] XiDuo You commented on SPARK-35332: --- [~luxianghao] Now you can `set

[jira] [Updated] (SPARK-35442) Eliminate unnecessary join through Aggregate

2021-05-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35442: -- Description: If Aggregate and Join have the same output partitioning, the plan will look like:

[jira] [Updated] (SPARK-35442) Eliminate unnecessary join through Aggregate

2021-05-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35442: -- Description: If Aggregate and Join have the same output partitioning, the plan will look like:

[jira] [Updated] (SPARK-35442) Eliminate unnecessary join through Aggregate

2021-05-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35442: -- Description: If Aggregate and Join have the same output partitioning, the plan will look like:

[jira] [Commented] (SPARK-35332) Not Coalesce shuffle partitions when cache table

2021-05-07 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17341107#comment-17341107 ] XiDuo You commented on SPARK-35332: --- The reason is Spark force disable the AQE during executing the

[jira] [Created] (SPARK-35455) Enhance EliminateUnnecessaryJoin

2021-05-19 Thread XiDuo You (Jira)
XiDuo You created SPARK-35455: - Summary: Enhance EliminateUnnecessaryJoin Key: SPARK-35455 URL: https://issues.apache.org/jira/browse/SPARK-35455 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-35332) Not Coalesce shuffle partitions when cache table

2021-05-08 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17341244#comment-17341244 ] XiDuo You commented on SPARK-35332: --- Adding a new cache-specific option in a CACHE statement seems a

[jira] [Updated] (SPARK-35455) Enhance EliminateUnnecessaryJoin

2021-05-25 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35455: -- Parent: SPARK-33828 Issue Type: Sub-task (was: Improvement) > Enhance

[jira] [Updated] (SPARK-35455) Enhance EliminateUnnecessaryJoin

2021-05-25 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35455: -- Priority: Major (was: Minor) > Enhance EliminateUnnecessaryJoin > >

[jira] [Resolved] (SPARK-35442) Eliminate unnecessary join through Aggregate

2021-05-26 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You resolved SPARK-35442. --- Resolution: Duplicate > Eliminate unnecessary join through Aggregate >

[jira] [Created] (SPARK-35608) Support AQE optimizer side transformUpWithPruning

2021-06-02 Thread XiDuo You (Jira)
XiDuo You created SPARK-35608: - Summary: Support AQE optimizer side transformUpWithPruning Key: SPARK-35608 URL: https://issues.apache.org/jira/browse/SPARK-35608 Project: Spark Issue Type:

[jira] [Created] (SPARK-35585) Support propagate empty relation through project/filter

2021-05-31 Thread XiDuo You (Jira)
XiDuo You created SPARK-35585: - Summary: Support propagate empty relation through project/filter Key: SPARK-35585 URL: https://issues.apache.org/jira/browse/SPARK-35585 Project: Spark Issue

[jira] [Reopened] (SPARK-35442) Eliminate unnecessary join through Aggregate

2021-05-27 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You reopened SPARK-35442: --- > Eliminate unnecessary join through Aggregate > > >

[jira] [Created] (SPARK-35552) Make query stage materialized more readable

2021-05-27 Thread XiDuo You (Jira)
XiDuo You created SPARK-35552: - Summary: Make query stage materialized more readable Key: SPARK-35552 URL: https://issues.apache.org/jira/browse/SPARK-35552 Project: Spark Issue Type:

[jira] [Created] (SPARK-35629) Drop database should check if exists

2021-06-03 Thread XiDuo You (Jira)
XiDuo You created SPARK-35629: - Summary: Drop database should check if exists Key: SPARK-35629 URL: https://issues.apache.org/jira/browse/SPARK-35629 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-35282) Support AQE side shuffled hash join formula

2021-05-26 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35282: -- Description: Use AQE runtime statistics to decide if we can use shuffled hash join instead of sort

[jira] [Created] (SPARK-35540) Make config maxShuffledHashJoinLocalMapThreshold fallback to advisoryPartitionSizeInBytes

2021-05-26 Thread XiDuo You (Jira)
XiDuo You created SPARK-35540: - Summary: Make config maxShuffledHashJoinLocalMapThreshold fallback to advisoryPartitionSizeInBytes Key: SPARK-35540 URL: https://issues.apache.org/jira/browse/SPARK-35540

[jira] [Updated] (SPARK-36014) Use uuid as app id in kubernetes client mode

2021-07-05 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36014: -- Description: Currently, spark on kubernetes with client mode would use `"spark-application-" +

[jira] [Created] (SPARK-36014) Use uuid as app id in kubernetes client mode

2021-07-05 Thread XiDuo You (Jira)
XiDuo You created SPARK-36014: - Summary: Use uuid as app id in kubernetes client mode Key: SPARK-36014 URL: https://issues.apache.org/jira/browse/SPARK-36014 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-35989) Do not remove REPARTITION_BY_NUM shuffle if AQE is enabled

2021-07-02 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35989: -- Description: The shuffle origin is `REPARTITION_BY_NUM` if user specify an exact partition number

[jira] [Updated] (SPARK-35989) Do not remove REPARTITION_BY_NUM shuffle if AQE is enabled

2021-07-02 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35989: -- Environment: (was: The shuffle origin is `REPARTITION_BY_NUM` if user specify an exact partition

[jira] [Created] (SPARK-35989) Do not remove REPARTITION_BY_NUM shuffle if AQE is enabled

2021-07-02 Thread XiDuo You (Jira)
XiDuo You created SPARK-35989: - Summary: Do not remove REPARTITION_BY_NUM shuffle if AQE is enabled Key: SPARK-35989 URL: https://issues.apache.org/jira/browse/SPARK-35989 Project: Spark Issue

[jira] [Created] (SPARK-35961) Only use local shuffle reader for REBALANCE_PARTITIONS_BY_NONE without CustomShuffleReaderExec

2021-06-30 Thread XiDuo You (Jira)
XiDuo You created SPARK-35961: - Summary: Only use local shuffle reader for REBALANCE_PARTITIONS_BY_NONE without CustomShuffleReaderExec Key: SPARK-35961 URL: https://issues.apache.org/jira/browse/SPARK-35961

[jira] [Updated] (SPARK-35961) Only use local shuffle reader for REBALANCE_PARTITIONS_BY_NONE without CustomShuffleReaderExec

2021-06-30 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35961: -- Parent: SPARK-35793 Issue Type: Sub-task (was: Improvement) > Only use local shuffle reader

[jira] [Updated] (SPARK-35961) Only use local shuffle reader for REBALANCE_PARTITIONS_BY_NONE without CustomShuffleReaderExec

2021-06-30 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35961: -- Description: After [SPARK-35725](https://issues.apache.org/jira/browse/SPARK-35725), we might expand

[jira] [Updated] (SPARK-35961) Only use local shuffle reader when REBALANCE_PARTITIONS_BY_NONE without CustomShuffleReaderExec

2021-06-30 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-35961: -- Summary: Only use local shuffle reader when REBALANCE_PARTITIONS_BY_NONE without

[jira] [Created] (SPARK-35923) Coalesce empty partition with mixed CoalescedPartitionSpec and PartialReducerPartitionSpec

2021-06-28 Thread XiDuo You (Jira)
XiDuo You created SPARK-35923: - Summary: Coalesce empty partition with mixed CoalescedPartitionSpec and PartialReducerPartitionSpec Key: SPARK-35923 URL: https://issues.apache.org/jira/browse/SPARK-35923

[jira] [Created] (SPARK-36085) Make broadcast query stage executionContext isolation from AQE

2021-07-11 Thread XiDuo You (Jira)
XiDuo You created SPARK-36085: - Summary: Make broadcast query stage executionContext isolation from AQE Key: SPARK-36085 URL: https://issues.apache.org/jira/browse/SPARK-36085 Project: Spark

[jira] [Resolved] (SPARK-36085) Make broadcast query stage executionContext isolation from AQE

2021-07-12 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You resolved SPARK-36085. --- Resolution: Won't Fix > Make broadcast query stage executionContext isolation from AQE >

[jira] [Updated] (SPARK-36085) Make broadcast query stage executionContext isolation from AQE

2021-07-12 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36085: -- Parent: (was: SPARK-33828) Issue Type: Improvement (was: Sub-task) > Make broadcast

[jira] [Created] (SPARK-36032) RemoveRedundantSorts should be applied after reOptimize in AQE

2021-07-07 Thread XiDuo You (Jira)
XiDuo You created SPARK-36032: - Summary: RemoveRedundantSorts should be applied after reOptimize in AQE Key: SPARK-36032 URL: https://issues.apache.org/jira/browse/SPARK-36032 Project: Spark

[jira] [Updated] (SPARK-36032) Use inputPlan instead of currentPhysicalPlan to initialize logical link

2021-07-08 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36032: -- Summary: Use inputPlan instead of currentPhysicalPlan to initialize logical link (was:

[jira] [Updated] (SPARK-36032) Use inputPlan instead of currentPhysicalPlan to initialize logical link

2021-07-08 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36032: -- Description: At {{initialPlan}} we may remove some Spark Plan with  {{queryStagePreparationRules}},

[jira] [Created] (SPARK-35442) Eliminate unnecessary join through Aggregate

2021-05-19 Thread XiDuo You (Jira)
XiDuo You created SPARK-35442: - Summary: Eliminate unnecessary join through Aggregate Key: SPARK-35442 URL: https://issues.apache.org/jira/browse/SPARK-35442 Project: Spark Issue Type:

[jira] [Created] (SPARK-36424) Support eliminate limits in AQE Optimizer

2021-08-05 Thread XiDuo You (Jira)
XiDuo You created SPARK-36424: - Summary: Support eliminate limits in AQE Optimizer Key: SPARK-36424 URL: https://issues.apache.org/jira/browse/SPARK-36424 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-36993) Fix json_tupe throw NPE if fields exist no foldable null value

2021-10-12 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36993: -- Affects Version/s: 3.0.3 > Fix json_tupe throw NPE if fields exist no foldable null value >

[jira] [Created] (SPARK-36993) Fix json_tupe throw NPE if fields exist no foldable null column

2021-10-12 Thread XiDuo You (Jira)
XiDuo You created SPARK-36993: - Summary: Fix json_tupe throw NPE if fields exist no foldable null column Key: SPARK-36993 URL: https://issues.apache.org/jira/browse/SPARK-36993 Project: Spark

[jira] [Updated] (SPARK-36993) Fix json_tupe throw NPE if fields exist no foldable null column

2021-10-12 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36993: -- Description: If json_tuple exists no foldable null field, Spark would throw NPE during eval

[jira] [Updated] (SPARK-36993) Fix json_tupe throw NPE if fields exist no foldable null field

2021-10-12 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36993: -- Summary: Fix json_tupe throw NPE if fields exist no foldable null field (was: Fix json_tupe throw

[jira] [Updated] (SPARK-36993) Fix json_tupe throw NPE if fields exist no foldable null value

2021-10-12 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36993: -- Summary: Fix json_tupe throw NPE if fields exist no foldable null value (was: Fix json_tupe throw

[jira] [Created] (SPARK-36992) Improve byte array sort perf by unify getPrefix function of UTF8String and ByteArray

2021-10-12 Thread XiDuo You (Jira)
XiDuo You created SPARK-36992: - Summary: Improve byte array sort perf by unify getPrefix function of UTF8String and ByteArray Key: SPARK-36992 URL: https://issues.apache.org/jira/browse/SPARK-36992

[jira] [Created] (SPARK-36979) Add RewriteLateralSubquery rule into nonExcludableRules

2021-10-11 Thread XiDuo You (Jira)
XiDuo You created SPARK-36979: - Summary: Add RewriteLateralSubquery rule into nonExcludableRules Key: SPARK-36979 URL: https://issues.apache.org/jira/browse/SPARK-36979 Project: Spark Issue

[jira] [Updated] (SPARK-37080) Add benchmark tool guide in pull request template

2021-10-20 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37080: -- Summary: Add benchmark tool guide in pull request template (was: Add benchmark guide in pull request

[jira] [Created] (SPARK-37080) Add benchmark guide in pull request template

2021-10-20 Thread XiDuo You (Jira)
XiDuo You created SPARK-37080: - Summary: Add benchmark guide in pull request template Key: SPARK-37080 URL: https://issues.apache.org/jira/browse/SPARK-37080 Project: Spark Issue Type:

[jira] [Commented] (SPARK-37063) SQL Adaptive Query Execution QA: Phase 2

2021-10-19 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17430838#comment-17430838 ] XiDuo You commented on SPARK-37063: --- thank you [~dongjoon] for creating this umbrella ! > SQL

[jira] [Created] (SPARK-37064) Fix outer join return the wrong max rows if other side is empty

2021-10-19 Thread XiDuo You (Jira)
XiDuo You created SPARK-37064: - Summary: Fix outer join return the wrong max rows if other side is empty Key: SPARK-37064 URL: https://issues.apache.org/jira/browse/SPARK-37064 Project: Spark

[jira] [Updated] (SPARK-37037) Improve byte array sort by unify compareTo function of UTF8String and ByteArray

2021-10-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37037: -- Description: BinaryType use `TypeUtils.compareBinary` to compare two byte array, however it's slow

[jira] [Updated] (SPARK-37037) Improve byte array sort by unify compareTo function of UTF8String and ByteArray

2021-10-18 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37037: -- Description: BinaryType use `TypeUtils.compareBinary` to compare two byte array, however it's slow

[jira] [Created] (SPARK-37037) Improve byte array sort by unify compareTo function of UTF8String and ByteArray

2021-10-17 Thread XiDuo You (Jira)
XiDuo You created SPARK-37037: - Summary: Improve byte array sort by unify compareTo function of UTF8String and ByteArray Key: SPARK-37037 URL: https://issues.apache.org/jira/browse/SPARK-37037 Project:

[jira] [Created] (SPARK-37043) Cancel all running job after AQE plan finished

2021-10-18 Thread XiDuo You (Jira)
XiDuo You created SPARK-37043: - Summary: Cancel all running job after AQE plan finished Key: SPARK-37043 URL: https://issues.apache.org/jira/browse/SPARK-37043 Project: Spark Issue Type:

[jira] [Updated] (SPARK-37043) Cancel all running job after AQE plan finished

2021-10-18 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37043: -- Description: We see stage was still running after AQE plan finished. This is because the plan which

[jira] [Updated] (SPARK-36424) Support eliminate limits in AQE Optimizer

2021-09-29 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36424: -- Parent: SPARK-33828 Issue Type: Sub-task (was: Improvement) > Support eliminate limits in

[jira] [Created] (SPARK-36822) BroadcastNestedLoopJoinExec should use all condition instead of non-equi condition

2021-09-21 Thread XiDuo You (Jira)
XiDuo You created SPARK-36822: - Summary: BroadcastNestedLoopJoinExec should use all condition instead of non-equi condition Key: SPARK-36822 URL: https://issues.apache.org/jira/browse/SPARK-36822

[jira] [Created] (SPARK-36823) Support broadcast nested loop join hint for equi-join

2021-09-21 Thread XiDuo You (Jira)
XiDuo You created SPARK-36823: - Summary: Support broadcast nested loop join hint for equi-join Key: SPARK-36823 URL: https://issues.apache.org/jira/browse/SPARK-36823 Project: Spark Issue Type:

[jira] [Updated] (SPARK-36823) Support broadcast nested loop join hint for equi-join

2021-09-21 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-36823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-36823: -- Description: For the join if one side is small and other side is large, the shuffle overhead is also

[jira] [Created] (SPARK-37098) Alter table properties should invalidate cache

2021-10-22 Thread XiDuo You (Jira)
XiDuo You created SPARK-37098: - Summary: Alter table properties should invalidate cache Key: SPARK-37098 URL: https://issues.apache.org/jira/browse/SPARK-37098 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-37559) ShuffledRowRDD get preferred locations order by reduce size

2021-12-06 Thread XiDuo You (Jira)
XiDuo You created SPARK-37559: - Summary: ShuffledRowRDD get preferred locations order by reduce size Key: SPARK-37559 URL: https://issues.apache.org/jira/browse/SPARK-37559 Project: Spark Issue

[jira] [Updated] (SPARK-37796) ByteArrayMethods arrayEquals should fast skip the check of aligning with unaligned platform

2021-12-30 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37796: -- Description: The method `arrayEquals` in `ByteArrayMethods` is critical function which is used in

[jira] [Updated] (SPARK-37796) ByteArrayMethods arrayEquals should fast skip the check of aligning with unaligned platform

2021-12-30 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37796: -- Summary: ByteArrayMethods arrayEquals should fast skip the check of aligning with unaligned platform

[jira] [Created] (SPARK-37796) ByteArrayMethods arrayEquals should fast skip the checking of aligned in unaligned platform

2021-12-30 Thread XiDuo You (Jira)
XiDuo You created SPARK-37796: - Summary: ByteArrayMethods arrayEquals should fast skip the checking of aligned in unaligned platform Key: SPARK-37796 URL: https://issues.apache.org/jira/browse/SPARK-37796

[jira] [Created] (SPARK-37357) Add merged last partition factor for rebalance

2021-11-17 Thread XiDuo You (Jira)
XiDuo You created SPARK-37357: - Summary: Add merged last partition factor for rebalance Key: SPARK-37357 URL: https://issues.apache.org/jira/browse/SPARK-37357 Project: Spark Issue Type:

[jira] [Updated] (SPARK-37357) Add merged last partition factor for rebalance

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: `Rebalance` provide a functionality that split the large reduce partition into smalls.

[jira] [Updated] (SPARK-37357) Add merged last partition factor for rebalance

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: `Rebalance` provide a functionality that split the large reduce partition into smalls.

[jira] [Updated] (SPARK-37357) Add merged last partition factor for rebalance

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: `Rebalance` provide a functionality that split the large reduce partition into smalls.

[jira] [Updated] (SPARK-37357) Add merged last partition factor for split skew partition

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Summary: Add merged last partition factor for split skew partition (was: Add merged last partition

[jira] [Updated] (SPARK-37357) Add merged last partition factor for split skew partition

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: For example `Rebalance` provide a functionality that split the large reduce partition

[jira] [Updated] (SPARK-37357) Add merged last partition factor for rebalance

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Summary: Add merged last partition factor for rebalance (was: Add merged last partition factor for

[jira] [Updated] (SPARK-37357) Add merged last partition factor for rebalance

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: `Rebalance` provide a functionality that split the large reduce partition into smalls.

[jira] [Created] (SPARK-37267) OptimizeSkewInRebalancePartitions support optimize non-root node

2021-11-10 Thread XiDuo You (Jira)
XiDuo You created SPARK-37267: - Summary: OptimizeSkewInRebalancePartitions support optimize non-root node Key: SPARK-37267 URL: https://issues.apache.org/jira/browse/SPARK-37267 Project: Spark

[jira] [Updated] (SPARK-37287) Pull out dynamic partition and bucket sort from FileFormatWriter

2021-11-11 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37287: -- Description: FileFormatWriter.write now is used by all V1 write which includes datasource and hive

[jira] [Updated] (SPARK-37287) Pull out dynamic partition and bucket sort from FileFormatWriter

2021-11-11 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37287: -- Description: `FileFormatWriter.write` now is used by all V1 write which includes datasource and hive

[jira] [Updated] (SPARK-37287) Pull out dynamic partition and bucket sort from FileFormatWriter

2021-11-11 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37287: -- Description: FileFormatWriter.write now is used by all V1 write which includes datasource and hive

[jira] [Updated] (SPARK-37357) Add small partition factor for rebalance partitions

2021-11-24 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Summary: Add small partition factor for rebalance partitions (was: Create skew partition specs

[jira] [Created] (SPARK-37333) Specify the required distribution at V1Write

2021-11-15 Thread XiDuo You (Jira)
XiDuo You created SPARK-37333: - Summary: Specify the required distribution at V1Write Key: SPARK-37333 URL: https://issues.apache.org/jira/browse/SPARK-37333 Project: Spark Issue Type:

[jira] [Created] (SPARK-37287) Pull out dynamic partition and bucket sort from FileFormatWriter

2021-11-11 Thread XiDuo You (Jira)
XiDuo You created SPARK-37287: - Summary: Pull out dynamic partition and bucket sort from FileFormatWriter Key: SPARK-37287 URL: https://issues.apache.org/jira/browse/SPARK-37287 Project: Spark

[jira] [Updated] (SPARK-37357) Add merged last partition factor for rebalance

2021-11-17 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: `Rebalance` provide a functionality that split the large reduce partition into smalls.

[jira] [Updated] (SPARK-37357) Create skew partition specs should respect min partition size

2021-11-18 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Description: For example `Rebalance` provide a functionality that split the large reduce partition

[jira] [Updated] (SPARK-37357) Create skew partition specs should respect min partition size

2021-11-18 Thread XiDuo You (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiDuo You updated SPARK-37357: -- Summary: Create skew partition specs should respect min partition size (was: Add merged last

[jira] [Created] (SPARK-37194) Avoid unnecessary sort in FileFormatWriter if it's not dynamic partition

2021-11-02 Thread XiDuo You (Jira)
XiDuo You created SPARK-37194: - Summary: Avoid unnecessary sort in FileFormatWriter if it's not dynamic partition Key: SPARK-37194 URL: https://issues.apache.org/jira/browse/SPARK-37194 Project: Spark

  1   2   3   4   5   >