novemberkilo opened a new issue #1399:
URL: https://github.com/apache/arrow-rs/issues/1399


   **Describe the bug**
   Parquet reader fails to read a parquet file that is generated from this json:
   ```
   {"emptylist":[]}
   ```
   
   **To Reproduce**
   I have written a test on a branch to reproduce this failure. 
   
   1. Generated a parquet-file from the aforementioned json. See 
https://github.com/apache/arrow-rs/pull/1063#issuecomment-1053939744 and the 
related issue for context. Verified that this parquet file can be read using 
pyarrow as the following arrow table:
   
   ```
   >>> empty_table = pq.read_table("empty_table.parquet")
   >>> empty_table
   pyarrow.Table
   emptylist: list<item: null>
     child 0, item: null
   ----
   emptylist: [[0 nulls]]
   ```
   2. Added this parquet file to a branch of my fork of `parquet-testing` 
https://github.com/novemberkilo/parquet-testing/commit/a827b12cd0985ef2d6d89534a405e64d55a40140
   
   3. Wrote a test that reads this file. 
   
   Test: 
https://github.com/novemberkilo/arrow-rs/commit/e5952ae74344d253fad7212ec46cb913d5e4f1cb
   Failure: 
https://github.com/novemberkilo/arrow-rs/runs/5431110483?check_suite_focus=true#step:4:1999
   
   
   **Expected behavior**
   Should read the file (like pyarrow can)
   
   **Additional context**
   This is all related to this original issue #1036 // @alamb 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to