Re: write parquet with statistics min max with binary field

2018-01-28 Thread Stephen Joung
For the reference, this was intended symptom by PARQUET-686 [1]. [1] https://www.mail-archive.com/commits@parquet.apache.org/msg00491.html 2018-01-24 10:31 GMT+09:00 Stephen Joung <step...@vcnc.co.kr>: > How can I write parquet file with min/max statistic? > > 2018-01-24

Re: parquet vs orc files

2018-02-21 Thread Stephen Joung
In case of parquet, best source for me to configure and to ensure "min/max statistics" was https://www.slideshare.net/mobile/RyanBlue3/parquet-performance-tuning-the-missing-guide --- I don't have any experience in orc. 2018년 2월 22일 (목) 오전 6:59, Kane Kim 님이 작성: >

Re: write parquet with statistics min max with binary field

2018-01-23 Thread Stephen Joung
How can I write parquet file with min/max statistic? 2018-01-24 10:30 GMT+09:00 Stephen Joung <step...@vcnc.co.kr>: > Hi, I am trying to use spark sql filter push down. and specially want to > use row group skipping with parquet file. > > And I guessed that I need parquet fi

write parquet with statistics min max with binary field

2018-01-23 Thread Stephen Joung
Hi, I am trying to use spark sql filter push down. and specially want to use row group skipping with parquet file. And I guessed that I need parquet file with statistics min/max. On spark master branch - I tried to write single column with "a", "b", "c" to parquet file f1 scala>