[GitHub] [arrow] rdettai opened a new pull request #8682: ARROW-10620: [Rust][Parquet] move column chunk range logic to metadata.rs

GitBox Mon, 16 Nov 2020 10:20:22 -0800


rdettai opened a new pull request #8682:
URL: https://github.com/apache/arrow/pull/8682



   > Getting the range of bytes of a column chunk inside a parquet file can be 
useful for external crates (for instance if they want to pre-fetch the 
columns), and is not completely obvious (it is enough to take a look at [1] and 
[2] to see that things can quickly get messy).
   > 
   > I think it would be nice to move this logic in the metadata definition 
rather than have lost it in the middle of the reader implem.
   > 
   > [1] 
https://stackoverflow.com/questions/55225108/why-is-dictionary-page-offset-0-for-plain-dictionary-encoding/
   > [2] https://issues.apache.org/jira/browse/PARQUET-816
   
   https://issues.apache.org/jira/browse/ARROW-10620


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow] rdettai opened a new pull request #8682: ARROW-10620: [Rust][Parquet] move column chunk range logic to metadata.rs

Reply via email to