This is an automated email from the ASF dual-hosted git repository.
gangwu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git
The following commit(s) were added to refs/heads/master by this push:
new 550368c Add repeated_primitive_no_list.parquet (#61)
550368c is described below
commit 550368ca77b97231efead39251a96bd6f8f08c6e
Author: Ze'ev Maor <[email protected]>
AuthorDate: Thu Oct 31 16:32:17 2024 +0200
Add repeated_primitive_no_list.parquet (#61)
---
data/README.md | 18 ++++++++++++++++++
data/repeated_primitive_no_list.parquet | Bin 0 -> 1296 bytes
2 files changed, 18 insertions(+)
diff --git a/data/README.md b/data/README.md
index 2c93965..f821ccb 100644
--- a/data/README.md
+++ b/data/README.md
@@ -54,6 +54,7 @@
| column_chunk_key_value_metadata.parquet | two INT32 columns, one with column
chunk key-value metadata {"foo": "bar", "thisiskeywithoutvalue": null} note
that the second key "thisiskeywithoutvalue", does not have a value, but the
value can be mapped to an empty string "" when read depending on the client |
| sorting_columns.parquet | INT64 and BYTE_ARRAY columns with first column
with nulls first and descending, second column with nulls last and ascending.
This file contains two row groups with same data and sorting columns. |
| old_list_structure.parquet | Single LIST<LIST<INT32>> column with legacy
two-level list structure. See [old_list_structure.md](old_list_structure.md) |
+| repeated_primitive_no_list.parquet | REPEATED INT32 and BYTE_ARRAY fields
without LIST annotation. See
[note](#REPEATED-primitive-fields-with-no-LIST-annotation) |
TODO: Document what each file is in the table above.
@@ -430,3 +431,20 @@ message hive_schema {
}
}
```
+
+## REPEATED primitive fields with no LIST annotation
+```
+Metadata for file: repeated_primitive_no_list.parquet
+
+version: 1
+num of rows: 4
+created by: parquet-rs version 53.2.0
+message schema {
+ REPEATED INT32 Int32_list;
+ REPEATED BYTE_ARRAY String_list (UTF8);
+ REQUIRED group group_of_lists {
+ REPEATED INT32 Int32_list_in_group;
+ REPEATED BYTE_ARRAY String_list_in_group (UTF8);
+ }
+}
+```
diff --git a/data/repeated_primitive_no_list.parquet
b/data/repeated_primitive_no_list.parquet
new file mode 100644
index 0000000..e1b29a6
Binary files /dev/null and b/data/repeated_primitive_no_list.parquet differ