Quanlong Huang created IMPALA-13553:
---------------------------------------

             Summary: Always evaluate small runtime filters on parquet 
dictionary
                 Key: IMPALA-13553
                 URL: https://issues.apache.org/jira/browse/IMPALA-13553
             Project: IMPALA
          Issue Type: Improvement
            Reporter: Quanlong Huang


IMPALA-5509 supports evaluating runtime filters on Parquet dictionaries to 
filter out RowGroups. It adds a query option, 
PARQUET_DICTIONARY_RUNTIME_FILTER_ENTRY_LIMIT (defaults to 1024), to control 
how large a dictionary should disable this optimization. The motivation is to 
avoid false positives.

However, for runtime filters that generated by small build sides, e.g. just 
containing one value, it still worths to evaluate them on the dictionaries. And 
this won't take long comparing to evaluating the small runtime filters on all 
rows.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to