[ 
https://issues.apache.org/jira/browse/PARQUET-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mars updated PARQUET-2237:
--------------------------
    Description: 
Bloomfilter needs to load from filesystem, it may costs time and space. If we 
can 
 
When the minMax values in  StatisticsFilter is same, we can exactly determine 
the existence/nonexistence of the value.
When we have page dictionaries, we can also determine the 
existence/nonexistence of the value.

> Improve performance when filters in RowGroupFilter can match exactly
> --------------------------------------------------------------------
>
>                 Key: PARQUET-2237
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2237
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: Mars
>            Priority: Major
>
> Bloomfilter needs to load from filesystem, it may costs time and space. If we 
> can 
>  
> When the minMax values in  StatisticsFilter is same, we can exactly determine 
> the existence/nonexistence of the value.
> When we have page dictionaries, we can also determine the 
> existence/nonexistence of the value.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to