Qifan Chen created IMPALA-10325: ----------------------------------- Summary: Parquet scan should use min/max statistics to skip pages based on equi-join predicate Key: IMPALA-10325 URL: https://issues.apache.org/jira/browse/IMPALA-10325 Project: IMPALA Issue Type: Bug Reporter: Qifan Chen
Parquet stores min/max stats for pages which can be used to skip certain pages if they don't qualify an equi-join predicate. The query below ends up scanning all rows for table a, which may not be needed if the min/max of b.ss_addr_sk can be detected and used during the scan of a. {code:java} select a.ss_sold_time_sk from store_sales a join [SHUFFLE] store_sales b where a.ss_addr_sk = b.ss_addr_sk and b.ss_customer_sk < 10 ; {code} -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org