This is an automated email from the ASF dual-hosted git repository. wenchen pushed a commit to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/branch-3.0 by this push: new 49855e7 [SPARK-32095][SQL] Update documentation to reflect usage of updated statistics 49855e7 is described below commit 49855e78a3ae430cd67c50d6dd81f6688fb3de15 Author: Micah Kornfield <mic...@google.com> AuthorDate: Wed Jul 1 07:13:10 2020 +0000 [SPARK-32095][SQL] Update documentation to reflect usage of updated statistics ### What changes were proposed in this pull request? Update documentation to reflect changes in https://github.com/apache/spark/commit/faf220aad9051c224a630e678c54098861f6b482 I've changed the documentation to reflect updated statistics may be used to improve query plan. ### Why are the changes needed? I believe the documentation is stale and misleading. ### Does this PR introduce _any_ user-facing change? Yes, this is a javadoc documentation fix. ### How was this patch tested? Doc fix. Closes #28925 from emkornfield/spark-32095. Authored-by: Micah Kornfield <mic...@google.com> Signed-off-by: Wenchen Fan <wenc...@databricks.com> (cherry picked from commit a4ba34434c55c7f0eafa32d8c7ea6e052813f293) Signed-off-by: Wenchen Fan <wenc...@databricks.com> --- .../apache/spark/sql/connector/read/SupportsReportStatistics.java | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java b/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java index b839fd5..1e0c9ca 100644 --- a/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java +++ b/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java @@ -23,9 +23,9 @@ import org.apache.spark.annotation.Evolving; * A mix in interface for {@link Scan}. Data sources can implement this interface to * report statistics to Spark. * - * As of Spark 2.4, statistics are reported to the optimizer before any operator is pushed to the - * data source. Implementations that return more accurate statistics based on pushed operators will - * not improve query performance until the planner can push operators before getting stats. + * As of Spark 3.0, statistics are reported to the optimizer after operators are pushed to the + * data source. Implementations may return more accurate statistics based on pushed operators + * which may improve query performance by providing better information to the optimizer. * * @since 3.0.0 */ --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org