This is an automated email from the ASF dual-hosted git repository.
gangwu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-format.git
The following commit(s) were added to refs/heads/master by this push:
new dac5a35 Finalize Variant and shredding specs (#509)
dac5a35 is described below
commit dac5a35040ab57000b84246746c5c9cb25267261
Author: Aihua Xu <[email protected]>
AuthorDate: Sun Aug 24 19:03:18 2025 -0700
Finalize Variant and shredding specs (#509)
---
VariantEncoding.md | 3 ---
VariantShredding.md | 3 ---
2 files changed, 6 deletions(-)
diff --git a/VariantEncoding.md b/VariantEncoding.md
index 2d3a0c8..d393d43 100644
--- a/VariantEncoding.md
+++ b/VariantEncoding.md
@@ -19,9 +19,6 @@
# Variant Binary Encoding
-> [!IMPORTANT]
-> **This specification is still under active development, and has not been
formally adopted.**
-
A Variant represents a type that contains one of:
- Primitive: A type and corresponding value (e.g. INT, STRING)
- Array: An ordered list of Variant values
diff --git a/VariantShredding.md b/VariantShredding.md
index d8d9ddf..9f59838 100644
--- a/VariantShredding.md
+++ b/VariantShredding.md
@@ -19,9 +19,6 @@
# Variant Shredding
-> [!IMPORTANT]
-> **This specification is still under active development, and has not been
formally adopted.**
-
The Variant type is designed to store and process semi-structured data
efficiently, even with heterogeneous values.
Query engines encode each Variant value in a self-describing format, and store
it as a group containing `value` and `metadata` binary fields in Parquet.
Since data is often partially homogeneous, it can be beneficial to extract
certain fields into separate Parquet columns to further improve performance.