Hello all, 
I have a dataset consisting of about 16 GB of CSV files.  I am looking to do 
some time series analysis of this data, and created a view but when I started 
doing aggregate queries using components of the date, the performance was 
disappointing.  Would it be better to do a CTAS and partition by components of 
the date?  If so, would parquet be the best format?  
Would anyone have other suggestions of things I could do to improve performance?
Thanks,
— C

Reply via email to