[ https://issues.apache.org/jira/browse/SPARK-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14038661#comment-14038661 ]
Andre Schumacher commented on SPARK-2195: ----------------------------------------- Since commit https://github.com/apache/spark/commit/f479cf3743e416ee08e62806e1b34aff5998ac22 the path is no longer stored in the extraMetadata. So I guess this issue can be closed? > Parquet extraMetadata can contain key information > ------------------------------------------------- > > Key: SPARK-2195 > URL: https://issues.apache.org/jira/browse/SPARK-2195 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 1.0.0 > Reporter: Michael Armbrust > Priority: Blocker > > {code} > 14/06/19 01:52:05 INFO NewHadoopRDD: Input split: ParquetInputSplit{part: > file:/Users/pat/Projects/spark-summit-training-2014/usb/data/wiki-parquet/part-r-1.parquet > start: 0 length: 24971040 hosts: [localhost] blocks: 1 requestedSchema: same > as file fileSchema: message root { > optional int32 id; > optional binary title; > optional int64 modified; > optional binary text; > optional binary username; > } > extraMetadata: > {org.apache.spark.sql.parquet.row.metadata=StructType(List(StructField(id,IntegerType,true), > StructField(title,StringType,true), StructField(modified,LongType,true), > StructField(text,StringType,true), StructField(username,StringType,true))), > path=************************ MY AWS KEYS!!! ************************} > readSupportMetadata: > {org.apache.spark.sql.parquet.row.metadata=StructType(List(StructField(id,IntegerType,true), > StructField(title,StringType,true), StructField(modified,LongType,true), > StructField(text,StringType,true), StructField(username,StringType,true))), > path=**************************** MY AWS KEYS!!!! > ***********************************}} > {code} -- This message was sent by Atlassian JIRA (v6.2#6252)