[spark] branch branch-3.0 updated: [SPARK-33451][DOCS] Change to 'spark.sql.adaptive.skewJoin.skewedPartitionThresholdInBytes' in documentation

gurwls223 Sun, 15 Nov 2020 17:56:02 -0800

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git



The following commit(s) were added to refs/heads/branch-3.0 by this push:
     new 265363d  [SPARK-33451][DOCS] Change to 
'spark.sql.adaptive.skewJoin.skewedPartitionThresholdInBytes' in documentation
265363d is described below

commit 265363d89e8ffc452e100289f8f1a94cd9d6cc44
Author: aof00 <[email protected]>
AuthorDate: Mon Nov 16 10:32:00 2020 +0900

    [SPARK-33451][DOCS] Change to 
'spark.sql.adaptive.skewJoin.skewedPartitionThresholdInBytes' in documentation
    
    ### What changes were proposed in this pull request?
    
    In the 'Optimizing Skew Join' section of the following two pages:
    1. 
[https://spark.apache.org/docs/3.0.0/sql-performance-tuning.html](https://spark.apache.org/docs/3.0.0/sql-performance-tuning.html)
    2. 
[https://spark.apache.org/docs/3.0.1/sql-performance-tuning.html](https://spark.apache.org/docs/3.0.1/sql-performance-tuning.html)
    
    The configuration 'spark.sql.adaptive.skewedPartitionThresholdInBytes' 
should be changed to 
'spark.sql.adaptive.skewJoin.skewedPartitionThresholdInBytes', The former is 
missing the 'skewJoin'.
    
    ### Why are the changes needed?
    
    To document the correct name of configuration
    
    ### Does this PR introduce _any_ user-facing change?
    
    Yes, this is a user-facing doc change.
    
    ### How was this patch tested?
    
    Jenkins / CI builds in this PR.
    
    Closes #30376 from aof00/doc_change.
    
    Authored-by: aof00 <[email protected]>
    Signed-off-by: HyukjinKwon <[email protected]>
    (cherry picked from commit 0933f1c6c27c7a087c8dbe485bd08371ce5f5695)
    Signed-off-by: HyukjinKwon <[email protected]>
---
 docs/sql-performance-tuning.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/sql-performance-tuning.md b/docs/sql-performance-tuning.md
index 5d8c3b6..49b32e7 100644
--- a/docs/sql-performance-tuning.md
+++ b/docs/sql-performance-tuning.md
@@ -280,7 +280,7 @@ Data skew can severely downgrade the performance of join 
queries. This feature d
        <td><code>spark.sql.adaptive.skewJoin.skewedPartitionFactor</code></td>
        <td>10</td>
        <td>
-         A partition is considered as skewed if its size is larger than this 
factor multiplying the median partition size and also larger than 
<code>spark.sql.adaptive.skewedPartitionThresholdInBytes</code>.
+         A partition is considered as skewed if its size is larger than this 
factor multiplying the median partition size and also larger than 
<code>spark.sql.adaptive.skewJoin.skewedPartitionThresholdInBytes</code>.
        </td>
        <td>3.0.0</td>
      </tr>


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[spark] branch branch-3.0 updated: [SPARK-33451][DOCS] Change to 'spark.sql.adaptive.skewJoin.skewedPartitionThresholdInBytes' in documentation

Reply via email to