etseidl commented on code in PR #101: URL: https://github.com/apache/parquet-site/pull/101#discussion_r1954826000
########## content/en/docs/File Format/implementationstatus.md: ########## @@ -45,66 +45,66 @@ Implementations: | Data type | C++ | Java | Go | Rust | cuDF | | ----------------------------------------- | ----- | ----- | ----- | ----- | ----- | -| STRING | ✅ | ✅ | | | ✅ | -| ENUM | ❌ | ✅ | | | ❌ | -| UUID | ❌ | ✅ | | | ❌ | -| 8, 16, 32, 64 bit signed and unsigned INT | ✅ | ✅ | | | ✅ | -| DECIMAL (INT32) | ✅ | ✅ | | | ✅ | -| DECIMAL (INT64) | ✅ | ✅ | | | ✅ | -| DECIMAL (BYTE_ARRAY) | ✅ | ✅ | | | ✅ | -| DECIMAL (FIXED_LEN_BYTE_ARRAY) | ✅ | ✅ | | | ✅ | -| DATE | ✅ | ✅ | | | ✅ | -| TIME (INT32) | ✅ | ✅ | | | ✅ | -| TIME (INT64) | ✅ | ✅ | | | ✅ | -| TIMESTAMP (INT64) | ✅ | ✅ | | | ✅ | -| INTERVAL | ✅ | ✅(*)| | | ❌ | -| JSON | ✅ | ✅(*)| | | ❌ | -| BSON | ❌ | ✅(*)| | | ❌ | -| LIST | ✅ | ✅ | | | ✅ | -| MAP | ✅ | ✅ | | | ✅ | -| UNKNOWN (always null) | ✅ | ✅ | | | ✅ | -| FLOAT16 | ✅ | ✅(*)| | | ✅ | +| STRING | ✅ | ✅ | | ✅ | ✅ | +| ENUM | ❌ | ✅ | | ✅(*)| ❌ | +| UUID | ❌ | ✅ | | ✅(*)| ❌ | +| 8, 16, 32, 64 bit signed and unsigned INT | ✅ | ✅ | | ✅ | ✅ | +| DECIMAL (INT32) | ✅ | ✅ | | ✅ | ✅ | +| DECIMAL (INT64) | ✅ | ✅ | | ✅ | ✅ | +| DECIMAL (BYTE_ARRAY) | ✅ | ✅ | | ✅ | ✅ | +| DECIMAL (FIXED_LEN_BYTE_ARRAY) | ✅ | ✅ | | ✅ | ✅ | +| DATE | ✅ | ✅ | | ✅ | ✅ | +| TIME (INT32) | ✅ | ✅ | | ✅ | ✅ | +| TIME (INT64) | ✅ | ✅ | | ✅ | ✅ | +| TIMESTAMP (INT64) | ✅ | ✅ | | ✅ | ✅ | +| INTERVAL | ✅ | ✅(*)| | ✅ | ❌ | +| JSON | ✅ | ✅(*)| | ✅(*)| ❌ | +| BSON | ❌ | ✅(*)| | ✅(*)| ❌ | +| LIST | ✅ | ✅ | | ✅ | ✅ | +| MAP | ✅ | ✅ | | ✅ | ✅ | +| UNKNOWN (always null) | ✅ | ✅ | | ✅ | ✅ | +| FLOAT16 | ✅ | ✅(*)| | ✅ | ✅ | (*): Only supported to use its annotated physical type ### Encodings | Encoding | C++ | Java | Go | Rust | cuDF | | ----------------------------------------- | ----- | ----- | ----- | ----- | ----- | -| PLAIN | ✅ | ✅ | | | ✅ | -| PLAIN_DICTIONARY | ✅ | ✅ | | | ✅ | -| RLE_DICTIONARY | ✅ | ✅ | | | ✅ | -| RLE | ✅ | ✅ | | | ✅ | -| BIT_PACKED (deprecated) | ✅ | ✅ | | | (R) | -| DELTA_BINARY_PACKED | ✅ | ✅ | | | ✅ | -| DELTA_LENGTH_BYTE_ARRAY | ✅ | ✅ | | | ✅ | -| DELTA_BYTE_ARRAY | ✅ | ✅ | | | ✅ | -| BYTE_STREAM_SPLIT | ✅ | ✅ | | | ✅ | +| PLAIN | ✅ | ✅ | | ✅ | ✅ | +| PLAIN_DICTIONARY | ✅ | ✅ | | ✅ | ✅ | +| RLE_DICTIONARY | ✅ | ✅ | | ✅ | ✅ | +| RLE | ✅ | ✅ | | ✅ | ✅ | +| BIT_PACKED (deprecated) | ✅ | ✅ | | ❌ | (R) | +| DELTA_BINARY_PACKED | ✅ | ✅ | | ✅ | ✅ | +| DELTA_LENGTH_BYTE_ARRAY | ✅ | ✅ | | ✅ | ✅ | +| DELTA_BYTE_ARRAY | ✅ | ✅ | | ✅ | ✅ | +| BYTE_STREAM_SPLIT | ✅ | ✅ | | ✅ | ✅ | ### Compressions | Compression | C++ | Java | Go | Rust | cuDF | | ----------------------------------------- | ----- | ----- | ----- | ----- | ----- | -| UNCOMPRESSED | ✅ | ✅ | | | ✅ | -| BROTLI | ✅ | ✅ | | | (R) | -| GZIP | ✅ | ✅ | | | (R) | -| LZ4 (deprecated) | ✅ | ❌ | | | ❌ | -| LZ4_RAW | ✅ | ✅ | | | ✅ | -| LZO | ❌ | ❌ | | | ❌ | -| SNAPPY | ✅ | ✅ | | | ✅ | -| ZSTD | ✅ | ✅ | | | ✅ | +| UNCOMPRESSED | ✅ | ✅ | | ✅ | ✅ | +| BROTLI | ✅ | ✅ | | ✅ | (R) | +| GZIP | ✅ | ✅ | | ✅ | (R) | +| LZ4 (deprecated) | ✅ | ❌ | | ✅ | ❌ | +| LZ4_RAW | ✅ | ✅ | | ✅ | ✅ | +| LZO | ❌ | ❌ | | ❌ | ❌ | +| SNAPPY | ✅ | ✅ | | ✅ | ✅ | +| ZSTD | ✅ | ✅ | | ✅ | ✅ | ### Other format level features | | C++ | Java | Go | Rust | cuDF | | ----------------------------------------- | ----- | ----- | ----- | ----- | ----- | -| xxHash-based bloom filters | (R) | ✅ | | | (R) | -| Bloom filter length (1) | (R) | ✅ | | | (R) | -| Statistics min_value, max_value | ✅ | ✅ | | | ✅ | -| Page index | ✅ | ✅ | | | ✅ | -| Page CRC32 checksum | ✅ | ✅ | | | ❌ | -| Modular encryption | ✅ | ✅ | | | ❌ | -| Size statistics (2) | ✅ | ✅ | | | ✅ | +| xxHash-based bloom filters | (R) | ✅ | | ✅ | (R) | +| Bloom filter length (1) | (R) | ✅ | | ✅ | (R) | +| Statistics min_value, max_value | ✅ | ✅ | | ✅ | ✅ | +| Page index | ✅ | ✅ | | ✅ | ✅ | +| Page CRC32 checksum | ✅ | ✅ | | ✅ | ❌ | +| Modular encryption | ✅ | ✅ | | ❌ | ❌ | Review Comment: Have to remember to update this cell when https://github.com/apache/arrow-rs/pull/6637 is merged. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
