This is an automated email from the ASF dual-hosted git repository.
maplefu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git
The following commit(s) were added to refs/heads/master by this push:
new f4d7ed7 GH-45185: Add bad_data file with invalid repetition levels
(#67)
f4d7ed7 is described below
commit f4d7ed772a62a95111db50fbcad2460833e8c882
Author: Adam Reeve <[email protected]>
AuthorDate: Tue Jan 14 00:43:53 2025 +1300
GH-45185: Add bad_data file with invalid repetition levels (#67)
* Add example file with bad repetition levels
* Simplify and improve test file
* Reduce row count
* Use int32 values
* Disable dictionary encoding and statistics
* Use correct list structure with logical type annotation
---
bad_data/ARROW-GH-45185.parquet | Bin 0 -> 264 bytes
bad_data/README.md | 2 ++
2 files changed, 2 insertions(+)
diff --git a/bad_data/ARROW-GH-45185.parquet b/bad_data/ARROW-GH-45185.parquet
new file mode 100644
index 0000000..dea95fb
Binary files /dev/null and b/bad_data/ARROW-GH-45185.parquet differ
diff --git a/bad_data/README.md b/bad_data/README.md
index 52a4818..0a030a0 100644
--- a/bad_data/README.md
+++ b/bad_data/README.md
@@ -31,3 +31,5 @@ These are files used for reproducing various bugs that have
been reported.
* ARROW-GH-41317.parquet: test case of
https://github.com/apache/arrow/issues/41317
where all columns have not the same size.
* ARROW-GH-43605.parquet: dictionary index page uses rle encoding but 0 as rle
bit-width.
+* ARROW-GH-45185.parquet: test case of
https://github.com/apache/arrow/issues/45185
+ where repetition levels start with a 1 instead of 0.