[spark] branch branch-3.0 updated: [SPARK-32095][SQL] Update documentation to reflect usage of updated statistics

wenchen Wed, 01 Jul 2020 00:14:53 -0700

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git



The following commit(s) were added to refs/heads/branch-3.0 by this push:
     new 49855e7  [SPARK-32095][SQL] Update documentation to reflect usage of 
updated statistics
49855e7 is described below

commit 49855e78a3ae430cd67c50d6dd81f6688fb3de15
Author: Micah Kornfield <mic...@google.com>
AuthorDate: Wed Jul 1 07:13:10 2020 +0000

    [SPARK-32095][SQL] Update documentation to reflect usage of updated 
statistics
    
    ### What changes were proposed in this pull request?
    Update documentation to reflect changes in 
https://github.com/apache/spark/commit/faf220aad9051c224a630e678c54098861f6b482
    
    I've changed the documentation to reflect updated statistics may be used to 
improve query plan.
    
    ### Why are the changes needed?
    I believe the documentation is stale and misleading.
    
    ### Does this PR introduce _any_ user-facing change?
    Yes, this is a javadoc documentation fix.
    
    ### How was this patch tested?
    Doc fix.
    
    Closes #28925 from emkornfield/spark-32095.
    
    Authored-by: Micah Kornfield <mic...@google.com>
    Signed-off-by: Wenchen Fan <wenc...@databricks.com>
    (cherry picked from commit a4ba34434c55c7f0eafa32d8c7ea6e052813f293)
    Signed-off-by: Wenchen Fan <wenc...@databricks.com>
---
 .../apache/spark/sql/connector/read/SupportsReportStatistics.java   | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git 
a/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
 
b/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
index b839fd5..1e0c9ca 100644
--- 
a/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
+++ 
b/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
@@ -23,9 +23,9 @@ import org.apache.spark.annotation.Evolving;
  * A mix in interface for {@link Scan}. Data sources can implement this 
interface to
  * report statistics to Spark.
  *
- * As of Spark 2.4, statistics are reported to the optimizer before any 
operator is pushed to the
- * data source. Implementations that return more accurate statistics based on 
pushed operators will
- * not improve query performance until the planner can push operators before 
getting stats.
+ * As of Spark 3.0, statistics are reported to the optimizer after operators 
are pushed to the
+ * data source. Implementations may return more accurate statistics based on 
pushed operators
+ * which may improve query performance by providing better information to the 
optimizer.
  *
  * @since 3.0.0
  */


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[spark] branch branch-3.0 updated: [SPARK-32095][SQL] Update documentation to reflect usage of updated statistics

Reply via email to