William Lee created ARROW-2677:
----------------------------------
Summary: Missing pyarrow ZSTD support for Parquet
Key: ARROW-2677
URL: https://issues.apache.org/jira/browse/ARROW-2677
Project: Apache Arrow
Issue Type: Improvement
Components: Python
Affects Versions: 0.9.0
Reporter: William Lee
It seems like in the python/pyarrow/_parquet.pyx file there's a check for the
supported compression scheme:
{code}
cdef int check_compression_name(name) except -1:
if name.upper() not in ['NONE', 'SNAPPY', 'GZIP', 'LZO', 'BROTLI', 'LZ4']:
raise ArrowException("Unsupported compression: " + name)
return 0{code}
which does not include ZSTD (Zstandard). From my understanding it should be
supported in the underlying c++ library already. Is it possible to add that
support?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)