Re: [VOTE] Add BYTE_STREAM_SPLIT encoding to Apache Parquet

2019-10-10 Thread Radev, Martin
Dear Ryan Blue and other Parquet developers, I tested Ryan's proposal for modifying the encoding. The short answer is that it doesn't perform well in my tests. The encoding, code and results can be viewed below. The current implementation only handles 32-bit IEEE754 floats in the following

[jira] [Updated] (PARQUET-319) Define the parquet bloom filter statistics in parquet format

2019-10-10 Thread Junjie Chen (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Junjie Chen updated PARQUET-319: Fix Version/s: format-2.7.0 > Define the parquet bloom filter statistics in parquet format >