Their implementations are actually in parquet-mr:
https://github.com/apache/parquet-mr/tree/apache-parquet-1.8.1/parquet-column/src/test/java/org/apache/parquet/column/values
And in Parquet terms, they are usually called encodings instead of
compression schemes/algorithms. This can be a little bit confusing.
https://github.com/apache/parquet-format/blob/master/Encodings.md
Cheng
On 9/7/15 11:46 AM, Wangchangchun (A) wrote:
Thanks for your reply.
But in this file, it just a thrift file definition.
And I have generated code through thrift compiler,
I want to find where encoding algorithms be implemented and be called, such as
RLE be implemented and be called,
Can you give me a guide?
-----邮件原件-----
发件人: Jason Altekruse [mailto:[email protected]]
发送时间: 2015年9月7日 0:11
收件人: [email protected]
主题: Re: [compression implemention of Parquet]
There are a few compression algorithms that can be configured for use with
Parquet. Here is the reference in the parquet metadata definition (all of the
metadata of the file is written with thrift) for the compression algorithms
currently supported.
https://github.com/apache/parquet-format/blob/master/src/thrift/parquet.thrift#L320
On Sun, Sep 6, 2015 at 12:21 AM, Wangchangchun (A) <
[email protected]> wrote:
Hi everyone, can anybody answer me a question?
I want to find the compression implemention of Parquet, But I can not
find it.
I think it should be in parquet-format project, but I didn't find it.