[GitHub] [spark] viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate in explode

GitBox Sun, 07 Jul 2019 18:31:03 -0700

viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune 
unnecessary nested fields from Generate in explode
URL: https://github.com/apache/spark/pull/24637#discussion_r300897753


 ##########
 File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala
 ##########
 @@ -1576,7 +1576,8 @@ object SQLConf {
       .doc("Prune nested fields from a logical relation's output which are 
unnecessary in " +
         "satisfying a query. This optimization allows columnar file format 
readers to avoid " +
         "reading unnecessary nested column data. Currently Parquet and ORC are 
the " +
-        "data sources that implement this optimization.")
+        "data sources that implement this optimization. This optimization also 
allows pruning " +
+        "unnecessary nested fields in expressions of operator.")
 
 Review comment:
   We talked about it in previous discussion, that is an option. Reusing 
`nestedSchemaPruning` was to simplify configs. Let me create a separate config 
for this.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] viirya commented on a change in pull request #24637: [SPARK-27707][SQL] Prune unnecessary nested fields from Generate in explode

Reply via email to