subject:"Writing parquet table using spark"

Re: Writing parquet table using spark

2016-11-16 Thread Dirceu Semighini Filho

Hello, Have you configured this property? spark.sql.parquet.compression.codec 2016-11-16 6:40 GMT-02:00 Vaibhav Sinha : > Hi, > I am using hiveContext.sql() method to select data from source table and > insert into parquet tables. > The query executed from spark takes

Writing parquet table using spark

2016-11-16 Thread Vaibhav Sinha

Hi, I am using hiveContext.sql() method to select data from source table and insert into parquet tables. The query executed from spark takes about 3x more disk space to write the same number of rows compared to when fired from impala. Just wondering if this is normal behaviour and if there's a way