rdettai opened a new pull request #8682: URL: https://github.com/apache/arrow/pull/8682
> Getting the range of bytes of a column chunk inside a parquet file can be useful for external crates (for instance if they want to pre-fetch the columns), and is not completely obvious (it is enough to take a look at [1] and [2] to see that things can quickly get messy). > > I think it would be nice to move this logic in the metadata definition rather than have lost it in the middle of the reader implem. > > [1] https://stackoverflow.com/questions/55225108/why-is-dictionary-page-offset-0-for-plain-dictionary-encoding/ > [2] https://issues.apache.org/jira/browse/PARQUET-816 https://issues.apache.org/jira/browse/ARROW-10620 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
