zhuqi-lucas commented on issue #7363:
URL: https://github.com/apache/arrow-rs/issues/7363#issuecomment-2857020809

   Thank you @alamb for the suggestion, i agree, this is the perfect solution, 
but it seems no row count information available for the parquet default V1 
version page it only has num_values, i will investigate if we can do it.
   
   ```rust
   #[derive(Clone)]
   pub enum Page {
       /// Data page Parquet format v1.
       DataPage {
           /// The underlying data buffer
           buf: Bytes,
           /// Number of values in this page
           num_values: u32,
           /// Encoding for values in this page
           encoding: Encoding,
           /// Definition level encoding
           def_level_encoding: Encoding,
           /// Repetition level encoding
           rep_level_encoding: Encoding,
           /// Optional statistics for this page
           statistics: Option<Statistics>,
       },
       /// Data page Parquet format v2.
       DataPageV2 {
           /// The underlying data buffer
           buf: Bytes,
           /// Number of values in this page
           num_values: u32,
           /// Encoding for values in this page
           encoding: Encoding,
           /// Number of null values in this page
           num_nulls: u32,
           /// Number of rows in this page
           num_rows: u32,
           /// Length of definition levels
           def_levels_byte_len: u32,
           /// Length of repetition levels
           rep_levels_byte_len: u32,
           /// Is this page compressed
           is_compressed: bool,
           /// Optional statistics for this page
           statistics: Option<Statistics>,
       },
       /// Dictionary page.
       DictionaryPage {
           /// The underlying data buffer
           buf: Bytes,
           /// Number of values in this page
           num_values: u32,
           /// Encoding for values in this page
           encoding: Encoding,
           /// Is dictionary page sorted
           is_sorted: bool,
       },
   }
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to