This is an automated email from the ASF dual-hosted git repository.

maplefu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git


The following commit(s) were added to refs/heads/master by this push:
     new cb7a967  adding some bad parquet files (#58)
cb7a967 is described below

commit cb7a9674142c137367bf75a01b79c6e214a73199
Author: Jinpeng <[email protected]>
AuthorDate: Thu Aug 15 03:00:42 2024 -0700

    adding some bad parquet files (#58)
    
    * adding some bad parquet files
    
    * move to bad-data
    
    * fmt?
    
    ---------
    
    Co-authored-by: jp0317 <[email protected]>
    Co-authored-by: mwish <[email protected]>
---
 bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet | Bin 0 -> 533 bytes
 bad_data/ARROW-RS-GH-6229-LEVELS.parquet     | Bin 0 -> 609 bytes
 bad_data/README.md                           |   6 +++++-
 3 files changed, 5 insertions(+), 1 deletion(-)

diff --git a/bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet 
b/bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet
new file mode 100755
index 0000000..7d14d5e
Binary files /dev/null and b/bad_data/ARROW-RS-GH-6229-DICTHEADER.parquet differ
diff --git a/bad_data/ARROW-RS-GH-6229-LEVELS.parquet 
b/bad_data/ARROW-RS-GH-6229-LEVELS.parquet
new file mode 100644
index 0000000..110b783
Binary files /dev/null and b/bad_data/ARROW-RS-GH-6229-LEVELS.parquet differ
diff --git a/bad_data/README.md b/bad_data/README.md
index 885af61..30802a5 100644
--- a/bad_data/README.md
+++ b/bad_data/README.md
@@ -21,7 +21,11 @@
 These are files used for reproducing various bugs that have been reported.
 
 * PARQUET-1481.parquet: tests a case where a schema Thrift value has been
-  corrupted
+  corrupted.
+* ARROW-RS-GH-6229-DICTHEADER.parquet: tests a case where the number of values
+  stored in dictionary page header is negative.
+* ARROW-RS-GH-6229-LEVELS.parquet: tests a case where a page has insufficient 
+  repetition levels.
 * ARROW-GH-41321.parquet: test case of 
https://github.com/apache/arrow/issues/41321
   where decoded rep / def levels is less than num_values in page_header.
 * ARROW-GH-41317.parquet: test case of 
https://github.com/apache/arrow/issues/41317

Reply via email to