Hi all,

I see parquet-mr has been using ZSTD-JNI <https://github.com/luben/zstd-jni>for
the parquet-cli
<https://github.com/apache/parquet-mr/blob/master/parquet-cli/pom.xml#L48>
project. It is a clean approach to use this JNI for testing ZSTD instead of
using Hadoop implementation, especially when testing in localhost. I am
wondering maybe we can promote it to parquet-hadoop project as ZSTD
becomes more and more popular. I have a prototype working but I would like
to ask if anybody knows any issues (performance, reliability etc) of
ZSTD-JNI <https://github.com/luben/zstd-jni>? It is welcome to share any
feedback on using this JNI.

BTW, I am also trying out the AirCompressor
<https://github.com/airlift/aircompressor> approach, but it seems the ZSTD
compression level is not adjustable.

-- 
Xinli Shang

Reply via email to