wgtmac commented on code in PR #36027: URL: https://github.com/apache/arrow/pull/36027#discussion_r1226380305
########## docs/source/status.rst: ########## @@ -348,3 +348,107 @@ Notes: * \(1) Through JNI bindings. (Provided by ``org.apache.arrow.orc:arrow-orc``) * \(2) Through JNI bindings to Arrow C++ Datasets. (Provided by ``org.apache.arrow:arrow-dataset``) + + +Parquet format public API details +================================= + ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Format | C++ | Python | Java | Go | Rust | +| | | | | | | ++===========================================+=======+========+========+=======+=======+ +| Basic compression | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Brotli, LZ4, ZSTD | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| LZ4_RAW | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Hive-style partitioning | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| File metadata | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| RowGroup metadata | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Column metadata | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Chunk metadta | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Sorting column | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| ColumnIndex statistics | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Page statistics | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| Statistics min_value | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| xxHash based bloom filter | | | | | | ++-------------------------------------------+-------+--------+--------+-------+-------+ +| bloom filter length | | | | | | Review Comment: > OMG, they finally added it - amazing, will get that incorporated into the rust writer/reader I just added it recently :) Please note that the latest format is not released yet so the parquet-mr does not know `bloom_filter_length` now. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
