wgtmac commented on code in PR #111:
URL: https://github.com/apache/parquet-testing/pull/111#discussion_r3265060888


##########
bad_data/README.md:
##########
@@ -33,3 +33,5 @@ These are files used for reproducing various bugs that have 
been reported.
 * ARROW-GH-43605.parquet: dictionary index page uses rle encoding but 0 as rle 
bit-width.
 * ARROW-GH-45185.parquet: test case of 
https://github.com/apache/arrow/issues/45185
   where repetition levels start with a 1 instead of 0.
+* ARROW-GH-47662.parquet: test case identified in 
https://github.com/apache/arrow/issues/47662

Review Comment:
   Just curious why the new file (102 B) is way smaller than the old one (4.23 
KB)?



##########
bad_data/README.md:
##########
@@ -33,3 +33,5 @@ These are files used for reproducing various bugs that have 
been reported.
 * ARROW-GH-43605.parquet: dictionary index page uses rle encoding but 0 as rle 
bit-width.
 * ARROW-GH-45185.parquet: test case of 
https://github.com/apache/arrow/issues/45185
   where repetition levels start with a 1 instead of 0.
+* ARROW-GH-47662.parquet: test case identified in 
https://github.com/apache/arrow/issues/47662

Review Comment:
   Do we want to mention that this file was supposed to serve the purpose of 
data/fixed_length_byte_array.parquet before this bug has been spotted?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to