[ 
https://issues.apache.org/jira/browse/DRILL-2568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14381248#comment-14381248
 ] 

Jacques Nadeau commented on DRILL-2568:
---------------------------------------

Can you please post review board?

> New partition pruning prevents the optimization for trivial COUNT(*) queries
> ----------------------------------------------------------------------------
>
>                 Key: DRILL-2568
>                 URL: https://issues.apache.org/jira/browse/DRILL-2568
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Query Planning & Optimization
>    Affects Versions: 0.8.0
>            Reporter: Aman Sinha
>            Assignee: Aman Sinha
>         Attachments: 
> 0001-DRILL-2568-Drop-filter-plan-node-if-all-conjuncts-ha.patch
>
>
> With the new interpreter based partition pruning,  if the query has only 
> partition filters and they are pushed into the Scan, we don't drop the Filter 
> node from the plan. This prevents the optimization for COUNT(*) queries 
> against parquet files where we read the count values directly from the 
> parquet files instead of scanning and aggregating.  The 
> ConvertCountToDirectScan rule does not get applied if there is an intervening 
> Filter between the Scan and the Aggregate nodes.  
> {code}
> 0: jdbc:drill:zk=local> explain plan for select count(*) from 
> dfs.`/Users/asinha/data/multilevel/parquet` where dir0=1995;
> +------------+------------+
> |    text    |    json    |
> +------------+------------+
> | 00-00    Screen
> 00-01      StreamAgg(group=[{}], EXPR$0=[COUNT()])
> 00-02        Project($f0=[0])
> 00-03          SelectionVectorRemover
> 00-04            Filter(condition=[=($0, 1995)])
> 00-05              Scan(groupscan=[ParquetGroupScan 
> [entries=[ReadEntryWithPath 
> [path=file:/Users/asinha/data/multilevel/parquet/1995/Q1/orders_95_q1.parquet],
>  ReadEntryWithPath 
> [path=file:/Users/asinha/data/multilevel/parquet/1995/Q2/orders_95_q2.parquet],
>  ReadEntryWithPath 
> [path=file:/Users/asinha/data/multilevel/parquet/1995/Q3/orders_95_q3.parquet],
>  ReadEntryWithPath 
> [path=file:/Users/asinha/data/multilevel/parquet/1995/Q4/orders_95_q4.parquet]],
>  selectionRoot=/Users/asinha/data/multilevel/parquet, numFiles=4, 
> columns=[`dir0`]]])
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to