[ 
https://issues.apache.org/jira/browse/IMPALA-10709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Qifan Chen updated IMPALA-10709:
--------------------------------
    Description: 
Currently, the min/max filter feature is turned off by default 
(MINMAX_FILTER_THRESHOLD=0). 

When joining into sorted columns in a fact Parquet table created by Imoala, the 
feature can be turned on by default. This is because Impala sorts the data in 
sort by columns in each data file during population. A min/max filter can be 
used to easily reject pages not overlap with the search region specified in the 
filter. 

  was:
Currently, the min/max filter feature is turned off by default 
(MINMAX_FILTER_THRESHOLD=0). 

When joining into sorted columns in a fact Parquet table, the feature can be 
turned on by default since the filter can achieve maximum filtering efficiency. 


> Min/max filters should be enabled for joins on sorted columns in Parquet 
> tables 
> --------------------------------------------------------------------------------
>
>                 Key: IMPALA-10709
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10709
>             Project: IMPALA
>          Issue Type: Test
>            Reporter: Qifan Chen
>            Assignee: Qifan Chen
>            Priority: Major
>
> Currently, the min/max filter feature is turned off by default 
> (MINMAX_FILTER_THRESHOLD=0). 
> When joining into sorted columns in a fact Parquet table created by Imoala, 
> the feature can be turned on by default. This is because Impala sorts the 
> data in sort by columns in each data file during population. A min/max filter 
> can be used to easily reject pages not overlap with the search region 
> specified in the filter. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to