Re: Spark SQL / Parquet - Dynamic Schema detection

Michael Armbrust Mon, 14 Mar 2016 11:00:57 -0700

>
> Each json file is of a single object and has the potential to have
> variance in the schema.
>
How much variance are we talking?  JSON->Parquet is going to do well with
100s of different columns, but at 10,000s many things will probably start
breaking.

Re: Spark SQL / Parquet - Dynamic Schema detection

Reply via email to