etseidl opened a new issue, #6778:
URL: https://github.com/apache/arrow-rs/issues/6778

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   As of now, when writing Parquet files, the [offset 
index](https://github.com/apache/parquet-format/blob/master/PageIndex.md) 
structures are populated and written regardless of whether statistics and 
column indexes are written. It is unclear if this behavior is intended or not.
   
   **Describe the solution you'd like**
   Add a writer option to disable the collection and writing of the offset 
index. Of course the offset index is required if the column index is written, 
so this option would probably only be useful when not writing column indexes 
(i.e. when the statistics level is `None` or `Chunk`).
   
   **Describe alternatives you've considered**
   Alternatively, the writing of the offset index could be disabled whenever 
the column index is disabled (i.e. when the stats level is not `Page`). This 
solution assumes the current behavior is not intentional.
   
   **Additional context**
   <!--
   Add any other context or screenshots about the feature request here.
   -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to