[ https://issues.apache.org/jira/browse/SPARK-3833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen updated SPARK-3833: ----------------------------- Assignee: Michael Armbrust > Allow Spark SQL SchemaRDDs to be merged > --------------------------------------- > > Key: SPARK-3833 > URL: https://issues.apache.org/jira/browse/SPARK-3833 > Project: Spark > Issue Type: Wish > Components: SQL > Reporter: Chris Wood > Assignee: Michael Armbrust > Fix For: 1.4.0 > > > We have JSON flowing into Spark SQL. > I can successfully store them as parquet and read them with > sqlContext.jsonRDD, but the inferred schemas cannot be merged into a single > table to do queries. > I'd like a way to allow for parquet file schemas to be merged, whether they > match or not, since we know the schema should be a union of the schemas from > the files. > This will allow us to have the data define the schema and new columns will > just appear. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org