This is an automated email from the ASF dual-hosted git repository.
apitrou pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git
The following commit(s) were added to refs/heads/master by this push:
new 3edb72a Add parquet file with optional metadata (#22)
3edb72a is described below
commit 3edb72ae36305b24bf45dde4af41c92e54be85cf
Author: Shani Solomon <[email protected]>
AuthorDate: Tue Feb 15 17:34:14 2022 +0200
Add parquet file with optional metadata (#22)
Co-authored-by: Shani Solomon
<[email protected]>
---
data/README.md | 15 ++++++++-------
data/data_index_bloom_encoding_stats.parquet | Bin 0 -> 1643 bytes
2 files changed, 8 insertions(+), 7 deletions(-)
diff --git a/data/README.md b/data/README.md
index 37f4289..78bb2ba 100644
--- a/data/README.md
+++ b/data/README.md
@@ -19,13 +19,14 @@
# Test data files for Parquet compatibility and regression testing
-| File | Description |
-|---|---|
-| delta_byte_array.parquet | string columns with DELTA_BYTE_ARRAY encoding.
See [delta_byte_array.md](delta_byte_array.md) for details. |
-| delta_binary_packed.parquet | INT32 and INT64 columns with
DELTA_BINARY_PACKED encoding. See
[delta_binary_packed.md](delta_binary_packed.md) for details. |
-| delta_encoding_required_column.parquet | required INT32 and STRING columns
with delta encoding. See
[delta_encoding_required_column.md](delta_encoding_required_column.md) for
details. |
-| delta_encoding_optional_column.parquet | optional INT64 and STRING columns
with delta encoding. See
[delta_encoding_optional_column.md](delta_encoding_optional_column.md) for
details. |
-| nested_structs.rust.parquet | Used to test that the Rust Arrow reader can
lookup the correct field from a nested struct. See
[ARROW-11452](https://issues.apache.org/jira/browse/ARROW-11452) |
+| File | Description
|
+|----------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| delta_byte_array.parquet | string columns with
DELTA_BYTE_ARRAY encoding. See [delta_byte_array.md](delta_byte_array.md) for
details. |
+| delta_binary_packed.parquet | INT32 and INT64 columns with
DELTA_BINARY_PACKED encoding. See
[delta_binary_packed.md](delta_binary_packed.md) for details.
|
+| delta_encoding_required_column.parquet | required INT32 and STRING
columns with delta encoding. See
[delta_encoding_required_column.md](delta_encoding_required_column.md) for
details. |
+| delta_encoding_optional_column.parquet | optional INT64 and STRING
columns with delta encoding. See
[delta_encoding_optional_column.md](delta_encoding_optional_column.md) for
details. |
+| nested_structs.rust.parquet | Used to test that the Rust
Arrow reader can lookup the correct field from a nested struct. See
[ARROW-11452](https://issues.apache.org/jira/browse/ARROW-11452) |
+| data_index_bloom_encoding_stats.parquet | optional STRING column. Contains
optional metadata: bloom filters, column index, offset index and encoding
stats. |
TODO: Document what each file is in the table above.
diff --git a/data/data_index_bloom_encoding_stats.parquet
b/data/data_index_bloom_encoding_stats.parquet
new file mode 100644
index 0000000..14ad191
Binary files /dev/null and b/data/data_index_bloom_encoding_stats.parquet differ