[
https://issues.apache.org/jira/browse/IMPALA-10709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Qifan Chen updated IMPALA-10709:
--------------------------------
Description:
Currently, the min/max filter feature is turned off by default
(MINMAX_FILTER_THRESHOLD=0).
When joining into sorted columns in a fact Parquet table created by Imoala, the
feature can be turned on by default. This is because Impala sorts the data in
sort by columns in each data file during population. A min/max filter can be
used to easily reject pages not overlap with the search region specified in the
filter.
was:
Currently, the min/max filter feature is turned off by default
(MINMAX_FILTER_THRESHOLD=0).
When joining into sorted columns in a fact Parquet table, the feature can be
turned on by default since the filter can achieve maximum filtering efficiency.
> Min/max filters should be enabled for joins on sorted columns in Parquet
> tables
> --------------------------------------------------------------------------------
>
> Key: IMPALA-10709
> URL: https://issues.apache.org/jira/browse/IMPALA-10709
> Project: IMPALA
> Issue Type: Test
> Reporter: Qifan Chen
> Assignee: Qifan Chen
> Priority: Major
>
> Currently, the min/max filter feature is turned off by default
> (MINMAX_FILTER_THRESHOLD=0).
> When joining into sorted columns in a fact Parquet table created by Imoala,
> the feature can be turned on by default. This is because Impala sorts the
> data in sort by columns in each data file during population. A min/max filter
> can be used to easily reject pages not overlap with the search region
> specified in the filter.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]