Hi, In my opinion, compatibility is the main thing to consider here. Some applications (Impala being a notable example) only support v1 at the moment. You should carefully consider what applications you might want to use in the future to process the data and check whether they all support v2.
Regards, Zoltan On Wed, Nov 15, 2017 at 3:07 AM Ivan Gozali <[email protected]> wrote: > Hi Parquet maintainers, > > I was wondering if there are any advantages (e.g. performance increases) or > disadvantages (e.g. any stability issues) for setting the configuration > parquet.writer.version=v2 in apache-parquet-1.8.2 (particularly curious > about this version since Spark 2.2.0 uses it) or above? > > Thank you in advance! > > -- > Regards, > > > Ivan Gozali > Lecida > Email: [email protected] >
