mapleFU opened a new issue, #8263:
URL: https://github.com/apache/arrow-rs/issues/8263

   **Describe the bug**
   
   In Parquet, page size cannot exceeds i32, since it uses thirft to store 
`uncompressed_page_size` and `compressed_page_size`.
   
   See: 
https://github.com/apache/parquet-format/blob/master/src/main/thrift/parquet.thrift#L802
   
   It's unlikely to happen, since arrow-rs change page-size to 1MiB by default. 
However, when we enlarge batch-size and page size limit, it's likely to happen
   
   
   **To Reproduce**
   
   Trying to write huge blob to parquet
   
   **Expected behavior**
   
   Switching to smaller boundery > Throw error > Leaving bad parquet page
   
   **Additional context**
   
   No


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to