Hi Andrew, This blog gives an idea how to schema is resolved: https://blog.godatadriven.com/multiformat-spark-partition There is some optimisation going on when reading Parquet using Spark. Hope this helps.
Cheers, Fokko Op wo 22 aug. 2018 om 23:59 schreef t4 <reubensaw...@hotmail.com>: > https://issues.apache.org/jira/browse/SPARK-23576 ? > > > > -- > Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ > > --------------------------------------------------------------------- > To unsubscribe e-mail: dev-unsubscr...@spark.apache.org > >