This is an automated email from the ASF dual-hosted git repository.
maplefu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git
The following commit(s) were added to refs/heads/master by this push:
new cb7a967 adding some bad parquet files (#58)
cb7a967 is described below
commit cb7a9674142c137367bf75a01b79c6e214a73199
Author: Jinpeng <[email protected]>
AuthorDate: Thu Aug 15 03:00:42 2024 -0700
adding some bad parquet files (#58)
* adding some bad parquet files
* move to bad-data
* fmt?
---------
Co-authored-by: jp0317 <[email protected]>
Co-authored-by: mwish <[email protected]>
---
bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet | Bin 0 -> 533 bytes
bad_data/ARROW-RS-GH-6229-LEVELS.parquet | Bin 0 -> 609 bytes
bad_data/README.md | 6 +++++-
3 files changed, 5 insertions(+), 1 deletion(-)
diff --git a/bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet
b/bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet
new file mode 100755
index 0000000..7d14d5e
Binary files /dev/null and b/bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet differ
diff --git a/bad_data/ARROW-RS-GH-6229-LEVELS.parquet
b/bad_data/ARROW-RS-GH-6229-LEVELS.parquet
new file mode 100644
index 0000000..110b783
Binary files /dev/null and b/bad_data/ARROW-RS-GH-6229-LEVELS.parquet differ
diff --git a/bad_data/README.md b/bad_data/README.md
index 885af61..30802a5 100644
--- a/bad_data/README.md
+++ b/bad_data/README.md
@@ -21,7 +21,11 @@
These are files used for reproducing various bugs that have been reported.
* PARQUET-1481.parquet: tests a case where a schema Thrift value has been
- corrupted
+ corrupted.
+* ARROW-RS-GH-6229-DICTHEADER.parquet: tests a case where the number of values
+ stored in dictionary page header is negative.
+* ARROW-RS-GH-6229-LEVELS.parquet: tests a case where a page has insufficient
+ repetition levels.
* ARROW-GH-41321.parquet: test case of
https://github.com/apache/arrow/issues/41321
where decoded rep / def levels is less than num_values in page_header.
* ARROW-GH-41317.parquet: test case of
https://github.com/apache/arrow/issues/41317