Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/14660#discussion_r74889938
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala
---
@@ -224,7 +226,16 @@ class ParquetFileFormat
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/14660#discussion_r74889501
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala
---
@@ -794,13 +805,44 @@ object
GitHub user HyukjinKwon opened a pull request:
https://github.com/apache/spark/pull/14660
[SPARK-17071][SQL] Fetch Parquet schema without another Spark job when it
is a single file to touch
## What changes were proposed in this pull request?
It seems Spark executes