[ https://issues.apache.org/jira/browse/PARQUET-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mars updated PARQUET-2237: -------------------------- Description: Bloomfilter needs to load from filesystem, it may costs time and space. If we can When the minMax values in StatisticsFilter is same, we can exactly determine the existence/nonexistence of the value. When we have page dictionaries, we can also determine the existence/nonexistence of the value. > Improve performance when filters in RowGroupFilter can match exactly > -------------------------------------------------------------------- > > Key: PARQUET-2237 > URL: https://issues.apache.org/jira/browse/PARQUET-2237 > Project: Parquet > Issue Type: Improvement > Reporter: Mars > Priority: Major > > Bloomfilter needs to load from filesystem, it may costs time and space. If we > can > > When the minMax values in StatisticsFilter is same, we can exactly determine > the existence/nonexistence of the value. > When we have page dictionaries, we can also determine the > existence/nonexistence of the value. -- This message was sent by Atlassian Jira (v8.20.10#820010)