Mich, Here are the benchmarks that I did using three different types of data:
http://www.slideshare.net/HadoopSummit/file-format-benchmark-avro-json-orc-parquet I assume you are comparing parquet-snappy vs parquet-none. .. Owen On Wed, Jan 25, 2017 at 1:37 PM, Mich Talebzadeh <mich.talebza...@gmail.com> wrote: > Hi, > > Has there been any study of how much compressing Hive Parquet tables with > snappy reduces storage space or simply the table size in quantitative terms? > > Thanks > > Dr Mich Talebzadeh > > > > LinkedIn * > https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw > <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>* > > > > http://talebzadehmich.wordpress.com > > > *Disclaimer:* Use it at your own risk. Any and all responsibility for any > loss, damage or destruction of data or any other property which may arise > from relying on this email's technical content is explicitly disclaimed. > The author will in no case be liable for any monetary damages arising from > such loss, damage or destruction. > > >