This is an automated email from the ASF dual-hosted git repository.
apitrou pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git
The following commit(s) were added to refs/heads/master by this push:
new d69d979 Add bloom-filter with length (#43)
d69d979 is described below
commit d69d979223e883faef9dc6fe3cf573087243c28a
Author: mwish <[email protected]>
AuthorDate: Fri Nov 24 00:41:57 2023 +0800
Add bloom-filter with length (#43)
---------
Co-authored-by: Gang Wu <[email protected]>
Co-authored-by: Antoine Pitrou <[email protected]>
---
data/README.md | 1 +
data/data_index_bloom_encoding_with_length.parquet | Bin 0 -> 2885 bytes
2 files changed, 1 insertion(+)
diff --git a/data/README.md b/data/README.md
index 761c7b1..69c5e94 100644
--- a/data/README.md
+++ b/data/README.md
@@ -28,6 +28,7 @@
| delta_encoding_optional_column.parquet | optional INT64 and STRING
columns with delta encoding. See
[delta_encoding_optional_column.md](delta_encoding_optional_column.md) for
details. |
| nested_structs.rust.parquet | Used to test that the Rust
Arrow reader can lookup the correct field from a nested struct. See
[ARROW-11452](https://issues.apache.org/jira/browse/ARROW-11452) |
| data_index_bloom_encoding_stats.parquet | optional STRING column. Contains
optional metadata: bloom filters, column index, offset index and encoding
stats. |
+| data_index_bloom_encoding_with_length.parquet | Same as
`data_index_bloom_encoding_stats.parquet` but has `bloom_filter_length`
populated in the ColumnMetaData |
| null_list.parquet | an empty list. Generated from this
json `{"emptylist":[]}` and for the purposes of testing correct read/write
behaviour of this base case. |
| alltypes_tiny_pages.parquet | small page sizes with dictionary
encoding with page index from
[impala](https://github.com/apache/impala/tree/master/testdata/data/alltypes_tiny_pages.parquet).
|
| alltypes_tiny_pages_plain.parquet | small page sizes with plain
encoding with page index
[impala](https://github.com/apache/impala/tree/master/testdata/data/alltypes_tiny_pages.parquet).
|
diff --git a/data/data_index_bloom_encoding_with_length.parquet
b/data/data_index_bloom_encoding_with_length.parquet
new file mode 100644
index 0000000..5b18eb6
Binary files /dev/null and b/data/data_index_bloom_encoding_with_length.parquet
differ