liyunzhang_intel created PARQUET-1117:
-
Summary: ParquetRecordWriter does not provide interface like
getRowCount(),getRawDataSize() like org.apache.orc.Writer
Key: PARQUET-1117
URL:
For anyone that would also like to test the compression codecs, I’ve
uploaded a copy of parquet-cli that can read and write zstd, lz4, and
brotli to my Apache public folder:
http://home.apache.org/~blue/
There’s also a copy of hadoop-common that has all the codec bits for
testing zstd. LZ4
Hi everyone,
I ran some tests using 4 of our large tables to compare compression codecs.
I tested gzip, brotli, lz4, and zstd, all with the default configuration.
You can find the raw data and summary tables/graphs in this spreadsheet:
Zoltan Ivanfi created PARQUET-1116:
--
Summary: Add Yetus InterfaceAudience annotations to Parquet
Key: PARQUET-1116
URL: https://issues.apache.org/jira/browse/PARQUET-1116
Project: Parquet
Zoltan Ivanfi created PARQUET-1115:
--
Summary: Prevent users from misusing parquet-tools merge
Key: PARQUET-1115
URL: https://issues.apache.org/jira/browse/PARQUET-1115
Project: Parquet
starting now at:
https://meet.google.com/wgv-qske-hzs