zhuqi-lucas commented on issue #7363: URL: https://github.com/apache/arrow-rs/issues/7363#issuecomment-2857020809
Thank you @alamb for the suggestion, i agree, this is the perfect solution, but it seems no row count information available for the parquet default V1 version page it only has num_values, i will investigate if we can do it. ```rust #[derive(Clone)] pub enum Page { /// Data page Parquet format v1. DataPage { /// The underlying data buffer buf: Bytes, /// Number of values in this page num_values: u32, /// Encoding for values in this page encoding: Encoding, /// Definition level encoding def_level_encoding: Encoding, /// Repetition level encoding rep_level_encoding: Encoding, /// Optional statistics for this page statistics: Option<Statistics>, }, /// Data page Parquet format v2. DataPageV2 { /// The underlying data buffer buf: Bytes, /// Number of values in this page num_values: u32, /// Encoding for values in this page encoding: Encoding, /// Number of null values in this page num_nulls: u32, /// Number of rows in this page num_rows: u32, /// Length of definition levels def_levels_byte_len: u32, /// Length of repetition levels rep_levels_byte_len: u32, /// Is this page compressed is_compressed: bool, /// Optional statistics for this page statistics: Option<Statistics>, }, /// Dictionary page. DictionaryPage { /// The underlying data buffer buf: Bytes, /// Number of values in this page num_values: u32, /// Encoding for values in this page encoding: Encoding, /// Is dictionary page sorted is_sorted: bool, }, } ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org