etseidl opened a new issue, #6778: URL: https://github.com/apache/arrow-rs/issues/6778
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** As of now, when writing Parquet files, the [offset index](https://github.com/apache/parquet-format/blob/master/PageIndex.md) structures are populated and written regardless of whether statistics and column indexes are written. It is unclear if this behavior is intended or not. **Describe the solution you'd like** Add a writer option to disable the collection and writing of the offset index. Of course the offset index is required if the column index is written, so this option would probably only be useful when not writing column indexes (i.e. when the statistics level is `None` or `Chunk`). **Describe alternatives you've considered** Alternatively, the writing of the offset index could be disabled whenever the column index is disabled (i.e. when the stats level is not `Page`). This solution assumes the current behavior is not intentional. **Additional context** <!-- Add any other context or screenshots about the feature request here. --> -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
