Hi all,

I am pretty new to Parquet and trying to learn Parquet structure.

I assume that min, max and etc information has been stored for both
ColumnMetaData and also DataPageHeader since 1.6.0 (
https://github.com/Parquet/parquet-mr/pull/338)

I see the statistics in ColumnMetaData is used to filter blocks (or row
groups) as filter2 at RowGroupFilter by calling canDrop().

I though the statistics in DataPageHeader is used to not to read a page by
reading the statistics.
However, my question is, I could not find where to use statistics in
DataPageHeader for filter1 and also filter2.
​

Could you give me some comments on this please?

Reply via email to